Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
Support
Quality
Security
License
Reuse
Go Bindings for the NVIDIA Management Library (NVML)
Support
Quality
Security
License
Reuse
An almost-parallel, semi-functioning, dynamic linker experiment, written in Rust
Support
Quality
Security
License
Reuse
A micro Vulkan compute pipeline and a collection of benchmarking compute shaders
Support
Quality
Security
License
Reuse
Safe rust wrapper around CUDA toolkit
Support
Quality
Security
License
Reuse
Example programs and source code for GPU Zen 2
Support
Quality
Security
License
Reuse
Yoda is a kubernetes scheduler based on GPU metrics. Yoda是一个基于GPU参数指标的 Kubernetes 调度器
Support
Quality
Security
License
Reuse
Sniff CUDA ioctls
Support
Quality
Security
License
Reuse
basic examples of OpenCL with the C++ API
Support
Quality
Security
License
Reuse
A kernel module to support SSD-to-GPU direct DMA
Support
Quality
Security
License
Reuse
Simple OpenCL demos for iOS and more
Support
Quality
Security
License
Reuse
A GPU profiling tool
Support
Quality
Security
License
Reuse
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
Support
Quality
Security
License
Reuse
Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions
Support
Quality
Security
License
Reuse
Next generation FFT implementation for ROCm
Support
Quality
Security
License
Reuse
SIMD partial order alignment tool/library
Support
Quality
Security
License
Reuse
testbeds, random bits, snippets mainly for real-time physics/graphics development. The GPU rigid body pipeline is moved to a separate repository at http://github.com/bulletphysics/bullet3
Support
Quality
Security
License
Reuse
H
Hands-On-GPU-Programming-with-Python-and-CUDAby PacktPublishing
Python 120 Version:Current License: Permissive (MIT)
Hands-On GPU Programming with Python and CUDA, published by Packt
Support
Quality
Security
License
Reuse
Small pathtracing library with GPU and CPU backends
Support
Quality
Security
License
Reuse
Workaround for low GPU utilization in recent Atelier games
Support
Quality
Security
License
Reuse
A library to analyze PyTorch traces.
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Vulkan Compute for C++ (experimentation project)
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Differentiable Fluid Dynamics Package
Support
Quality
Security
License
Reuse
Vulpes: a Deep Belief Net written in F#, and using Alea.cuBase to access the GPU.
Support
Quality
Security
License
Reuse
QuIP provides an interactive environment for computing and presenting images and image sequences, manipulating and storing arbitrary data, and general scientific computing and plotting. The current release supports unix-like operating systems (tested on Linux and Mac OSX), and Apple's iOS mobile operating system. GPU acceleration is supported with either CUDA or OpenCL. There is built-in support for psychophysical experimentation, with general-purpose staircase routines and analysis of psychometric functions.
Support
Quality
Security
License
Reuse
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
Support
Quality
Security
License
Reuse
A GPU devices manager to choice freest gpu.
Support
Quality
Security
License
Reuse
a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Support
Quality
Security
License
Reuse
Uncovering Class Hierarchies in C++ Programs
Support
Quality
Security
License
Reuse
Example python (numpy) -- CUDA installable package with a C-extension library
Support
Quality
Security
License
Reuse
Install PyTorch distributions with computation backend auto-detection
Support
Quality
Security
License
Reuse
Full-speed Array of Structures access
Support
Quality
Security
License
Reuse
GPU accelerated JPEG decoder
Support
Quality
Security
License
Reuse
AMD TrueAudio Next is a software development kit for GPU accelerated audio signal processing
Support
Quality
Security
License
Reuse
A collection of code examples as well as presentations for training purposes
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
ArrayFire.js - ArrayFire for Node.js
Support
Quality
Security
License
Reuse
ROCm Parallel Primitives
Support
Quality
Security
License
Reuse
High-level C++ for Accelerator Clusters
Support
Quality
Security
License
Reuse
Python client for OmniSci GPU-accelerated SQL engine and analytics platform
Support
Quality
Security
License
Reuse
Position Based Fluids CUDA implementation
Support
Quality
Security
License
Reuse
Reference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Support
Quality
Security
License
Reuse
Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP
Support
Quality
Security
License
Reuse
Performance-portable geometric search library
Support
Quality
Security
License
Reuse
Windows System Programming Experiments
Support
Quality
Security
License
Reuse
The Encog project for C/C++
Support
Quality
Security
License
Reuse
Implement Wide & Deep algorithm by using NumPy
Support
Quality
Security
License
Reuse
c
composable_kernelby ROCmSoftwarePlatform
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
C++ 126Updated: 1 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
g
go-nvmlby NVIDIA
Go Bindings for the NVIDIA Management Library (NVML)
C 125Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
dryadby m4b
An almost-parallel, semi-functioning, dynamic linker experiment, written in Rust
Rust 125Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
u
uVkComputeby google
A micro Vulkan compute pipeline and a collection of benchmarking compute shaders
C++ 125Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
c
cudarcby coreylowman
Safe rust wrapper around CUDA toolkit
Rust 125Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
G
GPUZen2by wolfgangfengel
Example programs and source code for GPU Zen 2
C++ 124Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
Y
Yoda-Schedulerby Mr-Linus
Yoda is a kubernetes scheduler based on GPU metrics. Yoda是一个基于GPU参数指标的 Kubernetes 调度器
Go 124Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
O
OpenCL-examplesby Dakkers
basic examples of OpenCL with the C++ API
C 122Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
n
nvme-kmodby kaigai
A kernel module to support SSD-to-GPU direct DMA
C 122Updated: 3 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
o
opencl-test-iosby linusyang
Simple OpenCL demos for iOS and more
C 122Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
Support
Quality
Security
License
Reuse
k
k8s-device-pluginby 4paradigm
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
Go 122Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
F
Fractional-GPUsby sakjain92
Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions
C 121Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
r
rocFFTby ROCmSoftwarePlatform
Next generation FFT implementation for ROCm
C++ 121Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
spoaby rvaser
SIMD partial order alignment tool/library
C++ 121Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
e
experimentsby erwincoumans
testbeds, random bits, snippets mainly for real-time physics/graphics development. The GPU rigid body pipeline is moved to a separate repository at http://github.com/bulletphysics/bullet3
C 121Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
H
Hands-On-GPU-Programming-with-Python-and-CUDAby PacktPublishing
Hands-On GPU Programming with Python and CUDA, published by Packt
Python 120Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
R
Rayby sergcpp
Small pathtracing library with GPU and CPU backends
C++ 120Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
atelier-sync-fixby doitsujin
Workaround for low GPU utilization in recent Atelier games
C++ 119Updated: 1 y ago License: Permissive (Zlib)
Support
Quality
Security
License
Reuse
H
HolisticTraceAnalysisby facebookresearch
A library to analyze PyTorch traces.
Python 119Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
G
GPUSmokeby michal1000w
C++ 118Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
libvcby uNetworking
Vulkan Compute for C++ (experimentation project)
C++ 118Updated: 3 y ago License: Permissive (Zlib)
Support
Quality
Security
License
Reuse
e
enokiby wjakob
C++ 117Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
J
JAXFLUIDSby tumaer
Differentiable Fluid Dynamics Package
Python 117Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
V
Vulpesby fsprojects
Vulpes: a Deep Belief Net written in F#, and using Alea.cuBase to access the GPU.
JavaScript 116Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
Q
QuIPby nasa
QuIP provides an interactive environment for computing and presenting images and image sequences, manipulating and storing arbitrary data, and general scientific computing and plotting. The current release supports unix-like operating systems (tested on Linux and Mac OSX), and Apple's iOS mobile operating system. GPU acceleration is supported with either CUDA or OpenCL. There is built-in support for psychophysical experimentation, with general-purpose staircase routines and analysis of psychometric functions.
C 116Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
F
Fuserby NVIDIA
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++ 116Updated: 1 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
t
tf_gpu_managerby QuantumLiu
A GPU devices manager to choice freest gpu.
Python 115Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pytorch-unflowby sniklaus
a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Python 115Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
M
Marxby RUB-SysSec
Uncovering Class Hierarchies in C++ Programs
C++ 115Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
npcuda-exampleby rmcgibbo
Example python (numpy) -- CUDA installable package with a C-extension library
C 115Updated: 3 y ago License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
l
light-the-torchby pmeier
Install PyTorch distributions with computation backend auto-detection
Python 113Updated: 1 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
t
troveby bryancatanzaro
Full-speed Array of Structures access
C++ 113Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
j
Support
Quality
Security
License
Reuse
T
TANby GPUOpen-LibrariesAndSDKs
AMD TrueAudio Next is a software development kit for GPU accelerated audio signal processing
C++ 113Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
training-materialby gjbex
A collection of code examples as well as presentations for training purposes
Jupyter Notebook 113Updated: 2 y ago License: Permissive (CC-BY-4.0)
Support
Quality
Security
License
Reuse
p
pytorch-bilstmcrfby kaniblu
Python 112Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
D
DifferentSLIAutoby EmberVulpix
C# 112Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
arrayfire-jsby arrayfire
ArrayFire.js - ArrayFire for Node.js
C++ 112Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
r
rocPRIMby ROCmSoftwarePlatform
ROCm Parallel Primitives
C++ 112Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
celerity-runtimeby celerity
High-level C++ for Accelerator Clusters
C++ 111Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pymapdby omnisci
Python client for OmniSci GPU-accelerated SQL engine and analytics platform
Python 110Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
P
PBF-CUDAby naeioi
Position Based Fluids CUDA implementation
C++ 110Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
mprby mkeeter
Reference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
C++ 109Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
G
GooFitby GooFit
Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP
C++ 109Updated: 2 y ago License: Weak Copyleft (LGPL-3.0)
Support
Quality
Security
License
Reuse
A
ArborXby arborx
Performance-portable geometric search library
C++ 109Updated: 1 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
w
wspeby am0nsec
Windows System Programming Experiments
C 108Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
e
encog-cby jeffheaton
The Encog project for C/C++
C 108Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
N
NumpyWDLby stasi009
Implement Wide & Deep algorithm by using NumPy
Python 107Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse