143
293
530
|
Mentions
@
|
Stars | Project | Description |
|---|---|---|---|
| 6 | 2,884 | This package contains the original 2012 AlexNet code. | |
| 1 | 31 | RLVR training for LLM in CUDA/C++ | |
| 1 | 4 | ||
| 1 | 0 | High-Performance GPU Bloom Filter |
Popular Cuda Topics
Latest Mentions
Latest mentioned Cuda repos
| Stars | Project |
|---|---|
| 31 | RL.cu |
| 0 | cuSBF |
| 4 | ThriftAttention |
| 2,884 | AlexNet-Source-Code |
| 7,334 | DeepGEMM |
| 87 | gdlog |
| 278 | parrot |
| 23 | FlashKDA |
| 30,117 | llm.c |
| 2 | luce-megakernel |
| 23 | llama.cpp-jetson |
| 9 | polar_quant |
| 39 | Cuckoo-GPU |
| 54 | KernelBlaster |
| 1 | cunningham-chain-search |
| 4 | cuda-unified-memory-analyzer |
| 896 | causal-conv1d |
| 2 | gpu-pcie-path-validator |
| 226 | bam |
| 52 | qwen_megakernel |
Latest Discoveries
Latest discovered Cuda repos
| Stars | Project |
|---|---|
| 31 | RL.cu |
| 0 | cuSBF |
| 4 | ThriftAttention |
| 23 | FlashKDA |
| 2 | luce-megakernel |
| 23 | llama.cpp-jetson |
| 9 | polar_quant |
| 54 | KernelBlaster |
| 1 | cunningham-chain-search |
| 4 | cuda-unified-memory-analyzer |
| 2 | gpu-pcie-path-validator |
| 226 | bam |
| 52 | qwen_megakernel |
| 24 | cuda-fp8-ampere |
| 13 | yali |
| 6 | libcusort |
| 2 | Ricci-curvature-clustering-CUDA |
| 39 | Cuckoo-GPU |
| 4 | USM-Core |
| 0 | hello-cuda |
Recently updated posts
-
Rl.cu: Training LLM RL with Pure CUDA
-
Show HN: CuSBF – Faster GPU Bloom Filter for Sequence Data
-
ThriftAttention: Selective Mixed Precision for Long-Context FP4 Attention
-
AlexNet Source Code
-
Optimizing Datalog for the GPU