Massively parallel self-organizing maps: accelerate training on multicore CPUs, GPUs, and clusters
Support
Quality
Security
License
Reuse
Juice Community Version Public Release
Support
Quality
Security
License
Reuse
High-Performance Cross-Platform Monte Carlo Renderer Based on LuisaCompute
Support
Quality
Security
License
Reuse
contains the source code accompanying the book GPU Zen.
Support
Quality
Security
License
Reuse
C++14 (and beyond) library features implemented in C++11
Support
Quality
Security
License
Reuse
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
Support
Quality
Security
License
Reuse
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.
Support
Quality
Security
License
Reuse
Code for the paper "OpenAI Remote Rendering Backend"
Support
Quality
Security
License
Reuse
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies
Support
Quality
Security
License
Reuse
An unofficial cuda assembler, for all generations of SASS, hopefully :)
Support
Quality
Security
License
Reuse
:cloud: Volumetric path tracer using cuda
Support
Quality
Security
License
Reuse
Build userspace NVMe drivers and storage applications with CUDA support
Support
Quality
Security
License
Reuse
CUDA-based NumPy
Support
Quality
Security
License
Reuse
Realtime GPU Path tracer based on OpenCL and OpenGL
Support
Quality
Security
License
Reuse
GPU Pathtracer from scratch in C++/CUDA
Support
Quality
Security
License
Reuse
Christian Buchner's & Christian H.'s CUDA miner project
Support
Quality
Security
License
Reuse
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
Support
Quality
Security
License
Reuse
A cross-platform go bot that tracks for availability of stock from Nvidia's store and adds a cart to your checkout.
Support
Quality
Security
License
Reuse
Rust bindings for the Z3 solver.
Support
Quality
Security
License
Reuse
STREAM, for lots of devices written in many programming models
Support
Quality
Security
License
Reuse
A Python script to patch NVIDIA vBIOS dumps into a format compatible with VFIO passthrough
Support
Quality
Security
License
Reuse
A structured light scanner
Support
Quality
Security
License
Reuse
Utilities for Dask and CUDA interactions
Support
Quality
Security
License
Reuse
Run Stable Diffusion on Apple Silicon Macs natively
Support
Quality
Security
License
Reuse
Cloud Gaming Made Easy
Support
Quality
Security
License
Reuse
Synchronized Multi-GPU Batch Normalization
Support
Quality
Security
License
Reuse
This is the software repository for the book GPU Pro 6.
Support
Quality
Security
License
Reuse
Examples of C# code compiled to GPU by hybridizer
Support
Quality
Security
License
Reuse
HermitCore: A C-based, lightweight unikernel
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
A CUDA implementation of the k-means clustering algorithm
Support
Quality
Security
License
Reuse
C
C 213 Version:Current License: Permissive (MIT)
CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through working examples.
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
GPU environment management and cluster orchestration
Support
Quality
Security
License
Reuse
R interface to use GPU's
Support
Quality
Security
License
Reuse
Stardust: Create GPU-based Visualizations
Support
Quality
Security
License
Reuse
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Support
Quality
Security
License
Reuse
FastFlow pattern-based parallel programming framework (formerly on sourceforge)
Support
Quality
Security
License
Reuse
The OpenCL ICD Loader project.
Support
Quality
Security
License
Reuse
Vulkan GPU-offloading layer
Support
Quality
Security
License
Reuse
AutoDock for GPUs and other accelerators
Support
Quality
Security
License
Reuse
gmonitor is a GPU monitor (Nvidia only at the moment)
Support
Quality
Security
License
Reuse
Create a minimalist, Ubuntu based image for Nvidia jetson nano board
Support
Quality
Security
License
Reuse
A REAL-TIME 3D detection network [Pointpillars] compiled by CUDA/TensorRT/C++.
Support
Quality
Security
License
Reuse
Data Parallel Python
Support
Quality
Security
License
Reuse
:statue_of_liberty: Parameter-Efficient Person Re-identification in the 3D Space :statue_of_liberty:
Support
Quality
Security
License
Reuse
OpenGL implementation of the MSDF algorithm
Support
Quality
Security
License
Reuse
S
Single-GPU-passthrough-amd-nvidiaby ilayna
Shell 201 Version:Current License: Strong Copyleft (GPL-3.0)
My way of doing single gpu passthrough the simplest way, I've gathered many sources together to make the perfect Single GPU passthrough guide the simplest and easiest way.
Support
Quality
Security
License
Reuse
Unofficial faiss wheel builder
Support
Quality
Security
License
Reuse
GeNN is a GPU-enhanced Neuronal Network simulation environment based on code generation for Nvidia CUDA.
Support
Quality
Security
License
Reuse
s
somocluby peterwittek
Massively parallel self-organizing maps: accelerate training on multicore CPUs, GPUs, and clusters
C 239Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
J
Juice-Labsby Juice-Labs
Juice Community Version Public Release
JavaScript 238Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
L
LuisaRenderby LuisaGroup
High-Performance Cross-Platform Monte Carlo Renderer Based on LuisaCompute
C++ 237Updated: 2 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
G
GPUZenby wolfgangfengel
contains the source code accompanying the book GPU Zen.
C 236Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
coreby mnmlstc
C++14 (and beyond) library features implemented in C++11
C++ 236Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
m
mixbenchby ekondis
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
C++ 234Updated: 2 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
a
antaresby microsoft
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.
Python 233Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
o
orrbby openai
Code for the paper "OpenAI Remote Rendering Backend"
C# 233Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
r
rwkv-cpp-acceleratedby harrisonvanderbyl
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies
C++ 232Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
CuAssemblerby cloudcores
An unofficial cuda assembler, for all generations of SASS, hopefully :)
Python 231Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
V
Volumetric-Path-Tracerby sergeneren
:cloud: Volumetric path tracer using cuda
C++ 231Updated: 2 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
s
ssd-gpu-dmaby enfiskutensykkel
Build userspace NVMe drivers and storage applications with CUDA support
C 231Updated: 2 y ago License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
R
RayTracingby AlexanderVeselov
Realtime GPU Path tracer based on OpenCL and OpenGL
C++ 230Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
G
GPU-Pathtracerby jan-van-bergen
GPU Pathtracer from scratch in C++/CUDA
C++ 230Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
ccminerby cbuchner1
Christian Buchner's & Christian H.'s CUDA miner project
C 229Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
t
torchsynthby torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
Python 229Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
n
nvidia-clerkby ianmarmour
A cross-platform go bot that tracks for availability of stock from Nvidia's store and adds a cart to your checkout.
Go 227Updated: 4 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
z
z3.rsby prove-rs
Rust bindings for the Z3 solver.
Rust 226Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
B
BabelStreamby UoB-HPC
STREAM, for lots of devices written in many programming models
C++ 225Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
N
NVIDIA-vBIOS-VFIO-Patcherby Matoking
A Python script to patch NVIDIA vBIOS dumps into a format compatible with VFIO passthrough
Python 223Updated: 2 y ago License: Permissive (CC0-1.0)
Support
Quality
Security
License
Reuse
3
3DUNDERWORLD-SLS-GPU_CPUby theICTlab
A structured light scanner
C++ 222Updated: 2 y ago License: Weak Copyleft (LGPL-3.0)
Support
Quality
Security
License
Reuse
d
dask-cudaby rapidsai
Utilities for Dask and CUDA interactions
Python 221Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
mochi-diffusionby godly-devotion
Run Stable Diffusion on Apple Silicon Macs natively
Swift 220Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
a
azure-gamingby ecalder6
Cloud Gaming Made Easy
PowerShell 218Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pytorch-syncbnby tamakoji
Synchronized Multi-GPU Batch Normalization
Python 217Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
G
GPU-Pro-6by wolfgangfengel
This is the software repository for the book GPU Pro 6.
C++ 217Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hybridizer-basic-samplesby altimesh
Examples of C# code compiled to GPU by hybridizer
C# 215Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
libhermitby hermitcore
HermitCore: A C-based, lightweight unikernel
C 215Updated: 2 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
R
RTXDIby NVIDIAGameWorks
C++ 215Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
k
kmeansby serban
A CUDA implementation of the k-means clustering algorithm
C 214Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
CUDA-by-Example-source-code-for-the-book-s-examples-by CodedK
CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through working examples.
C 213Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
nv_peer_memoryby Mellanox
C 213Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
genvby run-ai
GPU environment management and cluster orchestration
Python 213Updated: 2 y ago License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
g
Support
Quality
Security
License
Reuse
s
stardust-coreby stardustjs
Stardust: Create GPU-based Visualizations
TypeScript 210Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
bohriumby bh107
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
C++ 210Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
f
fastflowby fastflow
FastFlow pattern-based parallel programming framework (formerly on sourceforge)
C++ 209Updated: 2 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
O
OpenCL-ICD-Loaderby KhronosGroup
The OpenCL ICD Loader project.
C 208Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
primus_vkby felixdoerre
Vulkan GPU-offloading layer
C++ 207Updated: 4 y ago License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
A
AutoDock-GPUby ccsb-scripps
AutoDock for GPUs and other accelerators
C++ 205Updated: 2 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
g
gmonitorby mountassir
gmonitor is a GPU monitor (Nvidia only at the moment)
C++ 204Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
j
jetson-nano-imageby pythops
Create a minimalist, Ubuntu based image for Nvidia jetson nano board
Shell 204Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
P
PointPillars_MultiHead_40FPSby hova88
A REAL-TIME 3D detection network [Pointpillars] compiled by CUDA/TensorRT/C++.
C++ 204Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
c
copperheadby bryancatanzaro
Data Parallel Python
Python 203Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
person-reid-3dby layumi
:statue_of_liberty: Parameter-Efficient Person Re-identification in the 3D Space :statue_of_liberty:
Python 203Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
msdfglby nyyManni
OpenGL implementation of the MSDF algorithm
C 203Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
Single-GPU-passthrough-amd-nvidiaby ilayna
My way of doing single gpu passthrough the simplest way, I've gathered many sources together to make the perfect Single GPU passthrough guide the simplest and easiest way.
Shell 201Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
f
faiss-wheelsby kyamagu
Unofficial faiss wheel builder
Python 200Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
gennby genn-team
GeNN is a GPU-enhanced Neuronal Network simulation environment based on code generation for Nvidia CUDA.
C++ 200Updated: 2 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse