115
249
472
Mentions
@
|
Stars | Project | Description |
---|---|---|---|
5 | 325 | GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications. |
Popular Cuda Topics
Latest Mentions
Latest mentioned Cuda repos
Stars | Project |
---|---|
325 | dietgpu |
2,564 | AlexNet-Source-Code |
4 | prospero.vm |
1,397 | SageAttention |
491 | SpargeAttn |
95 | cuda_examples |
327 | GPUSorting |
3 | cuda_bwt |
7,454 | DeepEP |
123 | BenchmarkCustomPTX |
11,492 | FlashMLA |
57 | optimized-fused-ssim |
3 | cpp4fun |
1,532 | nunchaku |
47 | BinaryGPUIndex |
173 | array-language-comparisons |
1,189 | k2 |
26,399 | llm.c |
16,534 | instant-ngp |
47 | llama3.cu |
Latest Discoveries
Latest discovered Cuda repos
Stars | Project |
---|---|
4 | prospero.vm |
491 | SpargeAttn |
1,397 | SageAttention |
2,564 | AlexNet-Source-Code |
95 | cuda_examples |
3 | cuda_bwt |
327 | GPUSorting |
123 | BenchmarkCustomPTX |
57 | optimized-fused-ssim |
3 | cpp4fun |
7,454 | DeepEP |
11,492 | FlashMLA |
1,532 | nunchaku |
47 | BinaryGPUIndex |
47 | llama3.cu |
37 | cudacracker |
1 | uuidminer |
1 | cuda-hello-world |
799 | Nanoflow |
5 | cuda-utils |
Recently updated posts
-
70% Size, 100% Accuracy: Lossless LLM Compression via Dynamic-Length Float
-
Package contains the original 2012 AlexNet code
-
The Original 2012 AlexNet Is Open Source Now
-
AlexNet-Source-Code: The original 2012 AlexNet code
-
Jagged Flash Attention Optimization