im2col | Python implemention to illustrated im2col

by wangershi Python Version: Current License: No License

X-Ray Key Features Code Snippets Community Discussions(5)Vulnerabilities Install Support

kandi X-RAY | im2col Summary

im2col is a Python library. im2col has no bugs, it has no vulnerabilities and it has low support. However im2col build file is not available. You can download it from GitHub.

Python implemention to illustrated im2col, which used in Conv2D computation.

Support

Quality

Security

License

Reuse

Support

im2col has a low active ecosystem.

It has 1 star(s) with 0 fork(s). There are 1 watchers for this library.

It had no major release in the last 6 months.

im2col has no issues reported. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of im2col is current.

Quality

im2col has 0 bugs and 0 code smells.

Security

im2col has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

im2col code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

im2col does not have a standard license declared.

Check the repository for any license declaration and review the terms closely.

Without a license, all rights are reserved, and you cannot use the library in your applications.

Reuse

im2col releases are not available. You will need to build from source code and install.

im2col has no build file. You will be need to create the build yourself to build the component from source.

It has 124 lines of code, 2 functions and 2 files.

It has medium code complexity. Code complexity directly impacts maintainability of the code.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of im2col

Get all kandi verified functions for this library.

im2col Key Features

No Key Features are available at this moment for im2col.

im2col Examples and Code Snippets

No Code Snippets are available at this moment for im2col.

Community Discussions

Trending Discussions on im2col

Fast convolution algorithm have different outputs compared to regular convolution algorithm?

darknet make error : gcc: command not found, Makefile: recipe for target failed

I want to use my own pooling on CNN(Keras)

When I `make` darknet with CUDA=1, `/usr/bin/ld: cannot find -lcuda`occured. How to fix it without root permission

Optimizing CalculateConvolutionOutputTensor__im2col

QUESTION

Fast convolution algorithm have different outputs compared to regular convolution algorithm?

Asked 2020-Dec-18 at 02:50

I was trying to speed up a forward pass in a convolutional layer by using a technique of converting my image data to a column vector and essentially converting my convolution problem into a matrix multiplication problem.

[idea from https://sahnimanas.github.io/post/anatomy-of-a-high-performance-convolution/]

I did so by first implementing an im2col function from caffe's official github

...

ANSWER

Answered 2020-Dec-18 at 02:50

Oh I found out what the problem was. In my original code, I set the bias as

Source https://stackoverflow.com/questions/65345295

QUESTION

darknet make error : gcc: command not found, Makefile: recipe for target failed

Asked 2020-Nov-30 at 18:34

My intention is to do preparation of sound wave file, arrange train process, and test process via sound.c. ran into error during compiling darknet. need your help!

make: gcc: command not found Makefile: 175: recipe for target 'obj/sound.o' failed make: *** [obj/sound.o] Error 127 UBUNTU LTS 18.04 CUDA 11.1

@wbcalex-desktop:~$ sudo apt install gcc [sudo] password for wbcalex: Reading package lists... Done Building dependency tree
Reading state information... Done gcc is already the newest version (4:7.4.0-1ubuntu2.3). The following package was automatically installed and is no longer required: linux-hwe-5.4-headers-5.4.0-47 Use 'sudo apt autoremove' to remove it. 0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded. desktop:~$ cd darknet_tmp desktop:~/darknet_tmp$ make gcc -Iinclude/ -I3rdparty/stb/include -DGPU -I/usr/local/cuda/include/ -DCUDNN -Wall -Wfatal-errors -Wno-unused-result -Wno-unknown-pragmas -fPIC -Ofast -DGPU -DCUDNN -I/usr/local/cudnn/include -fPIC -c ./src/sound.c -o obj/sound.o make: gcc: Command not found Makefile:175: recipe for target 'obj/sound.o' failed make: *** [obj/sound.o] Error 127

...

ANSWER

Answered 2020-Nov-30 at 18:34

PATH variable isn't updated correctly, $PATH is not makefile syntax. Fix:

Source https://stackoverflow.com/questions/65076842

QUESTION

I want to use my own pooling on CNN(Keras)

Asked 2020-Sep-09 at 09:34

I'm making 'conditional mini pooling'.

However, applying this to Keras' CNN results in the following error:

TypeError: The added layer must be an instance of class Layer. Found: <main.MiniPooling2D object at 0x0000020E0BDD8B00>

it is my condition minipooling

MiniPooling

...

ANSWER

Answered 2020-Sep-09 at 05:14

Your problem is that you do not subclass the Layer class from Keras.

Have a look at this example (from the official documentation):

https://www.tensorflow.org/guide/keras/custom_layers_and_models

So you will have to implement something like:

Source https://stackoverflow.com/questions/63804646

QUESTION

When I `make` darknet with CUDA=1, `/usr/bin/ld: cannot find -lcuda`occured. How to fix it without root permission

Asked 2020-Jul-20 at 20:22

I'm trying to train YOLOv4 with darknet on a computing cluster. But when I make the darknet, it occured that:

...

ANSWER

Answered 2020-Jul-20 at 20:22

On a machine with a GPU (and driver) installed, the -lcuda dependency can usually be satisfied because the driver installs the libcuda.so (or equivalent on windows) in the link search path (typically).

However on a machine with no GPU installed (e.g. a login node or build machine in a cluster) the driver won't be installed and therefore libcuda.so won't be in the "usual place".

In these situations, "stub" libraries are provided, usually in the /stubs directory off the CUDA toolkit library install directory (e.g. /usr/local/cuda/lib64).

Therefore, if you change your Makefile at this line to read:

Source https://stackoverflow.com/questions/62999715

QUESTION

Optimizing CalculateConvolutionOutputTensor__im2col

Asked 2020-May-12 at 06:31

Request

I am writing to request guidance in optimizing my solution / method "CalculateConvolutionOutputTensor__im2col". I would like help determining the best strategy for moving beyond my naive approach; offerings of intuition about any relevant GPU processes and how they apply (e.g., bank conflicts); and help interpreting the above profile in terms of what I can tweak.

A first run of the method takes 0.774 seconds using a GeForce 2080 Ti. I have included a screenshot of the Nsight Compute profile of the only CUDA C++ kernel I have written: im2col.

Things I Could Do

I could have each GPU thread access shared memory instead of global memory. I could transfer GPU "heap" variables to kernel "stack" instead of dereferencing for every thread and in-kernel for-loop iteration. I could put small parameters into arrays in GPU memory and pass single pointers to those arrays. I could use a more sophisticated version of im2col.

Things I Have Tried

I would prefer not to use cuDNN 7.6.5; when I use cuDNN 7.6.5 and write the statement "cudnnCreate(&cudnnHandle);", Nsight Compute suggests that method cuModuleGetFunction returns CUDA_ERROR_NOT_FOUND.

Recreating Solution

The procedure I used to create this project was to create a new CUDA 10.2 Runtime project using Visual Studio Community 2019, rename the default source file to "main.cu", replace all contents with the first code block below, add "CalculateConvolutionOutputTensor__im2col.h" to my project, add the second code block below, add "CalculateConvolutionOutputTensor__im2col.cu" to my project, add the third code block below, and add "cublas.lib;" to Project Properties -> Linker -> Input -> Additional Dependencies.

main.cu ...

ANSWER

Answered 2020-May-12 at 06:31

After reading through the NVIDIA articles that Robert Crovella provided me, I rewrote my solution "CalculateConvolutionOutputTensor__im2col" to have threads in each block load from contiguous global memory. I used less indexing arithmetic and fewer parameters. I saw a method speed-up of (1 method / 0.445 s) / (1 method / 0.774 s) = 1.7, and an im2col kernel speed-up of (1 kernel / 35.27 ms) / (1 kernel / 128.15 ms) = 3.6. Thanks for pointing me to useful specific reading.

im2col used to take 128.15 ms; now it takes only 32.12 ms. Sgemm takes 6.34 ms now; probably took about the same then. Their total is 38.46 ms. The pair is run four times, for a total of 153.84 ms. I wonder how to speed up im2col more, and to reduce the 274.16 ms in "overhead".

To sculpt an image into matrix col, I had the (3*590/2) threads in each of (2*590*19*19) blocks transfer half cross sections of a filter-shaped portion of an image sequentially to col. I believe that each thread loaded from global memory physically adjacent to the memory accessed by the previous thread, and that each thread stored to global memory physically adjacent to the memory stored to by the previous thread. I did notice that 11 threads in the last warp in each block went unused.

I think I might take th31 up on their suggestion and move this optimization thread to Code Review.

Nsight Compute profile of im2col with coalesced global memory loads and stores

main.cu

Source https://stackoverflow.com/questions/61691198

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install im2col

You can download it from GitHub.
You can use im2col like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: