perfplot | create performance and roofline plots | Performance Testing library

by GeorgOfenbeck C++ Version: Current License: No License

X-Ray Key Features Code Snippets Community Discussions(7)Vulnerabilities Install Support

kandi X-RAY | perfplot Summary

perfplot is a C++ library typically used in Testing, Performance Testing applications. perfplot has no bugs, it has no vulnerabilities and it has low support. You can download it from GitHub.

Perfplot is a collection of scripts and tools that allow a user to instrument performance counters on a recent Intel platform, measure them and use the results to generate roofline and performance plots.

Support

Quality

Security

License

Reuse

Support

perfplot has a low active ecosystem.

It has 46 star(s) with 11 fork(s). There are 13 watchers for this library.

It had no major release in the last 6 months.

There are 2 open issues and 0 have been closed. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of perfplot is current.

Quality

perfplot has no bugs reported.

Security

perfplot has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

perfplot does not have a standard license declared.

Check the repository for any license declaration and review the terms closely.

Without a license, all rights are reserved, and you cannot use the library in your applications.

Reuse

perfplot releases are not available. You will need to build from source code and install.

Installation instructions, examples and code snippets are available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of perfplot

Get all kandi verified functions for this library.

perfplot Key Features

No Key Features are available at this moment for perfplot.

perfplot Examples and Code Snippets

No Code Snippets are available at this moment for perfplot.

Community Discussions

Trending Discussions on perfplot

Precomputing strided access pattern to array gives worse performance?

How to prevent perfplot (matplotlib) graph labels from being truncated?

Perfplot raised a "TypeError: bench() got an unexpected keyword argument 'logx'". How to fix?

Create dict from a string or list

Why `vectorize` is outperformed by `frompyfunc`?

Perfplot bench() raises "TypeError: ufunc 'isfinite' not supported for the input types, and the input types"

Seeking explanation to Dask vs Numpy vs Pandas benchmark results

QUESTION

Precomputing strided access pattern to array gives worse performance?

Asked 2021-Jun-03 at 15:36

I have a written a c-extension for the numpy library which is used for computing a specific type of bincount. From the lack of a better name, let's call it fast_compiled and place the method signature in numpy/core/src/multiarray/multiarraymodule.c inside array_module_methods:

...

ANSWER

Answered 2021-Jun-01 at 14:18

fast_compiled is faster than fast_compiled_strides because it works on contiguous data known at compile time enabling compilers to use SIMD instructions (eg. typically SSE on x86-like platforms or NEON on ARM ones). It should also be faster because of less data cache to retrieve from the L1 cache (more fetches are needed due to the indirection).

Indeed, dans[j] += weights[k] can be vectorized by loading m items of dans and m items of weights adding the m items using one instruction and storing the m items back in dans. This solution is efficient and cache friendly.

dans[strides[i]] += weights[i] cannot be efficiently vectorized on most mainstream hardware. The processor need to perform a costly gather from the memory hierarchy due to the indirection, then do the sum and then perform a scatter store which is also expensive. Even if strides would contain contiguous indices, the instructions are generally much more expensive than loading a contiguous block of data from memory. Moreover, compiler often fail to vectorize the code or just find that this is not worth using SIMD instruction in that case. As a result the generated code is likely a less efficient scalar code.

Actually, the performance difference between the two codes should be bigger on modern processors with good compilation flags. I suspect you only use SSE on a x86 processor here and so the speed up is close to 2 theoretically since 2 double-precision floating-point numbers can be computed in a row. However, using AVX/AVX-2 would lead to a speed up close to 4 theoretically (as 4 numbers can be computed in a row). Very recent Intel processors can even compute 8 double-precision floating-point numbers in a row. Note that computing simple-precision floating-point numbers can also results in a theoretical 2x speed up. The same apply for other architecture like ARM with NEON and SVE instruction sets or POWER. Since future processors will likely use wider SIMD registers (because of their efficiency), it is very important to write SIMD-friendly codes.

Source https://stackoverflow.com/questions/67787501

QUESTION

How to prevent perfplot (matplotlib) graph labels from being truncated?

Asked 2020-Dec-26 at 13:31

I'm working with the perfplot library (which you can pip-install) which benchmarks functions and plots their performance.

When observing the plotted graphs, the labels are truncated. How can I prevent this?

Here's a simple MCVE:

...

ANSWER

Answered 2020-Dec-26 at 13:31

perfplot seems to use matplotlib for the display. According to the github site, you can separate calculation and plotting, giving you the possibility to inject an autoformat (basically plt.tight_layout()) with rcParams for this graph.

You can add the following before your script:

Source https://stackoverflow.com/questions/65456241

QUESTION

Perfplot raised a "TypeError: bench() got an unexpected keyword argument 'logx'". How to fix?

Asked 2020-May-10 at 12:14

After a search on SO for numpy array mixed dtype filling I found a nice little numpy array fill performance tester perfplot. When the posted code answer from Nico Schlömer was ran, I saw a dip in the performance chart. So I changed the perflot.show(..snippet..) to perflot.bench(..snippet..) as suggest here and got the following error:

...

ANSWER

Answered 2020-Jan-15 at 23:56

After a dive into perfplot main.py I figured out there is no logx' and logy **kwargs available.

My solution:

Source https://stackoverflow.com/questions/59761149

QUESTION

Create dict from a string or list

Asked 2020-Feb-12 at 01:40

Background

I want to generate a hash table for a given string or given list. The hash table treat element as key and showup times as value. For instance:

...

ANSWER

Answered 2020-Feb-11 at 13:03

The best way would be to use the built in counter, otherwise, you may use defualtdict which is quite similar to your second attempt

Source https://stackoverflow.com/questions/60169387

QUESTION

Why `vectorize` is outperformed by `frompyfunc`?

Asked 2020-Jan-15 at 23:57

Numpy offers vectorize and frompyfunc with similar functionalies.

As pointed out in this SO-post, vectorize wraps frompyfunc and handles the type of the returned array correctly, while frompyfunc returns an array of np.object.

However, frompyfunc outperforms vectorize consistently by 10-20% for all sizes, which can also not be explained with different return types.

Consider the following variants:

...

ANSWER

Answered 2019-Jul-29 at 21:39

Following the hints of @hpaulj we can profile the vectorize-function:

Source https://stackoverflow.com/questions/57253839

QUESTION

Perfplot bench() raises "TypeError: ufunc 'isfinite' not supported for the input types, and the input types"

Asked 2020-Jan-15 at 23:56

I am using perpflot library to test the effect of DatetimeIndex on searching for a pandas dataframe.

I have defined a setup function to cretate 2 dataframes. One with datetime index and other with time as a column. I have also defined 2 functions which uses .loc in index and on column respectively and returns the subdata. However, it shows me a typeError.

...

ANSWER

Answered 2019-Jun-21 at 20:49

The bench() and show() methods by default compare the kernel outputs to ensure that all the methods produce the same output (for correctness). The check is done using numpy functions which may not apply to all cases or all kernel outputs.

What you want to do is specify an equality_check argument, which allows some flexibility in how the output is compared. This is especially useful when comparing things such as iterables of strings or dictionaries, which numpy cannot handle well.

Set equality_check to None if you're confident your functions are correct, or otherwise pass some callable which implements your own checking logic.

Source https://stackoverflow.com/questions/56708872

QUESTION

Seeking explanation to Dask vs Numpy vs Pandas benchmark results

Asked 2018-Sep-05 at 12:09

I am trying to benchmark the performance of dask vs pandas.

...

ANSWER

Answered 2018-Sep-05 at 12:09

The chunks keyword is short for chunksize, not number of chunks

Source https://stackoverflow.com/questions/52182901

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install perfplot

First make sure that PCM works on your system. To do so go into the folder perfplot/pcm and follow the instructions for your OS. Short version for linux: cd perfplot/pcm make ./pcm.x 1. This will build and run the original pcm and run it outputting the timing at fixed intervals. If you see the output the pcm is ready to use. Otherwise resolve the issues as output by the program. If it fails to execute even after you fixed the permissons, try if it will work using root. If so it might be a problem that seems to occur with a recent kernel patch in linux - at the very bottom of this document is a suggested fix.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: