Cuda Posts

Latest Cuda related posts with mentions of open-source projects
  • Meta Open-Sources Megalodon LLM for Efficient Long Sequence Modeling – InfoQ

    1 project | news.ycombinator.com | 1 day ago
  • Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

    2 projects | news.ycombinator.com | 1 day ago
  • So you want to rent an Nvidia H100 cluster? 2024 Consumer Guide

    1 project | news.ycombinator.com | 5 days ago
  • Karpathy: Let's reproduce GPT-2 (1.6B): one 8XH100 node 24h $672 in llm.c

    1 project | news.ycombinator.com | 7 days ago
  • Show HN: UNet diffusion model in pure CUDA

    2 projects | news.ycombinator.com | 20 days ago
  • Fork of Llm.c for AMD Devices

    1 project | news.ycombinator.com | about 1 month ago
  • ThunderKittens: A framework to write fast deep learning kernels in CUDA

    1 project | news.ycombinator.com | about 1 month ago
  • Grokfast: Accelerated Grokking by Amplifying Slow Gradients

    2 projects | news.ycombinator.com | about 1 month ago
  • Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20

    1 project | news.ycombinator.com | about 2 months ago
  • Jaxsplat: 3D Gaussian Splatting for Jax

    2 projects | news.ycombinator.com | about 2 months ago
  • Welcome to the Parallel Future of Computation

    5 projects | news.ycombinator.com | about 2 months ago
  • Bend a Parallel Language

    1 project | news.ycombinator.com | 2 months ago
  • Bend: A higher order language for the GPU

    1 project | news.ycombinator.com | 2 months ago
  • ThunderKittens: Tile Primitives for Speedy Kernels

    1 project | news.ycombinator.com | 2 months ago
  • Bend: A High-Level GPU Language Powered by HVM2

    1 project | news.ycombinator.com | 2 months ago
  • Bend: A Python-Like Parallel Language for GPUs and Multicore CPUs

    1 project | news.ycombinator.com | 2 months ago
  • SimpleGEMM

    1 project | news.ycombinator.com | 2 months ago
  • How hard can generating 1024-bit primes be?

    4 projects | news.ycombinator.com | 2 months ago
  • Llm.c State of the Union

    1 project | news.ycombinator.com | 2 months ago
  • CUDA Checkpoint and Restore

    1 project | news.ycombinator.com | 3 months ago
  • Ask HN: Yo Nephew, in E. Africa, wants to train an LLM with on disk Wikipedia

    1 project | news.ycombinator.com | 3 months ago
  • Show HN: One Billion Rows in CUDA

    1 project | news.ycombinator.com | 3 months ago
  • The Simple Beauty of XOR Floating Point Compression

    1 project | news.ycombinator.com | 3 months ago
  • Show HN: Faster sorting with register shuffling in CUDA

    1 project | news.ycombinator.com | 4 months ago
  • Raft: Fundamental widely-used algorithms and primitives for machine learning

    1 project | news.ycombinator.com | 5 months ago
  • A Fast FP16xFP4 Gemm CUDA Kernel

    1 project | news.ycombinator.com | 6 months ago
  • Direct Pixel-Space Megapixel Image Generation with Diffusion Models

    1 project | news.ycombinator.com | 6 months ago
  • Show HN: Build NCCL-Tests and Configure SSHD in PyTorch Container

    1 project | news.ycombinator.com | 6 months ago
  • Show HN: Demo of Agent Based Model on GPU with CUDA and OpenGL (Windows/Linux)

    1 project | /r/hypeurls | 8 months ago
  • Show HN: GPU Desktop Calculator

    1 project | news.ycombinator.com | 8 months ago
  • Punica: Serving multiple LoRA finetuned LLM as one

    1 project | news.ycombinator.com | 8 months ago
  • CuGraph – GPU-accelerated graph analytics

    1 project | news.ycombinator.com | 9 months ago
  • A High Throughput B+tree for SIMD Architectures [pdf]

    2 projects | news.ycombinator.com | 10 months ago
  • Parallel Computing Using Cuda-C

    1 project | /r/CUDA | 12 months ago
  • I want a 3d scanner...

    1 project | /r/3Dprinting | about 1 year ago
  • Has anyone tried out Squeezellm?

    1 project | /r/LocalLLaMA | about 1 year ago
  • Scanning in real life environments to be viewed in VR >>> taking pictures. Simple process from video -> render, using instant-ngp

    1 project | /r/virtualreality | about 1 year ago
  • How about Ranger Green?

    1 project | /r/airsoft | about 1 year ago
  • Roast my MC kit

    1 project | /r/airsoft | about 1 year ago
  • I started reading about CUDA programming and I don't see what makes it better than CPU programming

    1 project | /r/learnprogramming | about 1 year ago
  • Has anyone tried to generate images from enough angles to feed Nvidia Nerf to make 3D models?

    1 project | /r/StableDiffusion | about 1 year ago
  • Instant NPG: how do minimize noise and maximize quality? Tips welcome!

    1 project | /r/computervision | about 1 year ago
  • tensor.to_sparse() Memory Allocation

    1 project | /r/pytorch | about 1 year ago
  • GPU implementation of shortest path?

    1 project | /r/learnpython | over 1 year ago
  • I NeRF'd the new Taco Bell on Rt. 40

    1 project | /r/Delaware | over 1 year ago
  • [Mediasynthesis] Les meilleurs modèles d’IA pour un upscaling de résolution d’image ?

    1 project | /r/enfrancais | over 1 year ago
  • Scalix: A Data Parallel Compute Framework w/ Automatic Scaling

    1 project | /r/HPC | over 1 year ago
  • How ? Title: A glitch in the Matrix discovered.

    1 project | /r/CaptainDisillusion | over 1 year ago
  • [P] Clustering face embeddings (512d) using GCN's (not knowing the amount of needed clusters)

    1 project | /r/MachineLearning | over 1 year ago
  • Why this video is sooo good but he's sooo underrated... Y'all should watch it, it's perfect.

    1 project | /r/AyyMD | over 1 year ago