80
195
285
Our great sponsors
Mentions
@
|
Stars | Project | Description |
---|---|---|---|
147 | 15,260 | Instant neural graphics primitives: lightning fast NeRF and more | |
3 | 15,106 | LLM training in simple, raw C/CUDA | |
2 | 6,091 | Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189 | |
4 | 4,190 | The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation" | |
1 | 3,264 | Squeeze-and-Excitation Networks | |
6 | 1,566 | cuGraph - RAPIDS Graph Analytics Library | |
5 | 1,217 | CUDA Library Samples | |
2 | 1,035 | FSA/FST algorithms, differentiable, with PyTorch compatibility. | |
1 | 1,001 | Efficient GPU kernels for block-sparse matrix multiplication and convolution | |
1 | 748 | Automatically exported from code.google.com/p/cuda-convnet2 | |
1 | 652 | NCCL Tests | |
1 | 622 | Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch | |
3 | 608 | RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications. | |
1 | 557 | Fast, gpu-based CSV parser | |
12 | 486 | Instant neural graphics primitives: lightning fast NeRF and more | |
1 | 461 | Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs | |
1 | 430 | MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment | |
2 | 364 | Flash Attention in ~100 lines of CUDA (forward pass only) | |
4 | 294 | GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications. | |
1 | 269 | Neighborhood Attention Extension. Bringing attention to a neighborhood near you! |
Popular Cuda Topics
Latest Mentions
Latest mentioned Cuda repos
Stars | Project |
---|---|
15,106 | llm.c |
1 | cuda-1brc |
294 | dietgpu |
364 | flash-attention-minimal |
0 | blog-code |
608 | raft |
5 | tuna |
269 | NATTEN |
4 | build-nccl-tests-with-pytorch |
23 | GPUODEBenchmarks |
186 | RWKV-CUDA |
153 | causal-conv1d |
67 | ABMGPU |
7 | gpu-desktop-calculator |
1 | DOKSparse |
57 | gdlog |
1,566 | cugraph |
22 | Harmonia_for_B_plus_trees |
0 | MandelbrotExplorer |
105 | Parallel-Computing-Cuda-C |
Latest Discoveries
Latest discovered Cuda repos
Stars | Project |
---|---|
1 | cuda-1brc |
15,106 | llm.c |
364 | flash-attention-minimal |
0 | blog-code |
5 | tuna |
269 | NATTEN |
4 | build-nccl-tests-with-pytorch |
23 | GPUODEBenchmarks |
153 | causal-conv1d |
67 | ABMGPU |
7 | gpu-desktop-calculator |
57 | gdlog |
22 | Harmonia_for_B_plus_trees |
0 | MandelbrotExplorer |
105 | Parallel-Computing-Cuda-C |
7 | GCGT |
652 | nccl-tests |
622 | cuda_programming |
1 | DOKSparse |
7 | Scalix |