Rust-CUDA: writing and executing extremely fast GPU code fully in Rust

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

rust-gpu

82 6,930 8.2 Rust

🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

The best way to do it is probably the way rust-gpu does it: https://github.com/EmbarkStudios/rust-gpu/blob/main/docs/src...
The entry point of the kernel would supply any objects that have special properties.

Rust-CUDA

5 1,120 8.8 Rust

Discontinued Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust. [Moved to: https://github.com/Rust-GPU/Rust-CUDA] (by RDambrosio016)

https://github.com/RDambrosio016/Rust-CUDA/blob/master/guide...
* Missing Atomics -- Gamebreaker IMO. Atomics are absolutely essential when you are dealing with 10,000+ threads on a regular basis. You'll inevitably come across a shared data-structure that requires write-access from each thread, and some coordination mechanism is needed for that. Atomics are one important fit.
Ironic, a few days ago, I argued for the use of Fork-join parallelism in most cases (aka: Kernel launch / synchronized kernel exits). Now I find myself arguing the opposite now that we have a topic here with missing atomics. Like... atomics need to be used very, very rarely, but those rare uses are incredibly important.
* Warp Vote / Match / Reduce / Shuffle missing (Very useful tools for highly-optimized code, but you can write slower code that does the same thing through \_\_shared\_\_ memory just fine)
------
Wait, does this support \_\_shared\_\_ memory at all? Raw access to memory is not really amenable to Rust's programming style, but its absolutely necessary for high-performance GPU programming.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
wgpu

195 10,846 9.9 Rust

Cross-platform, safe, pure-rust graphics api.

> "Extremely fast"
When people make claims like this, it would be good if they put the benchmarks on the first page. E.g, how does it compare with https://github.com/gfx-rs/wgpu which lets you target Vulkan, Metal, DX, GL or WASM+WebGPU with rust?

gpgpu-rs

8 135 3.8 Rust

Simple experimental async GPGPU framework for Rust

Would be really nice to have an actual cross platform GPGPU library. It's really holding every kind of progress back to have only vendor lock-in.
Maybe WebCPU will be capable of compute to the extend that CUDA isn't necessary. https://github.com/UpsettingBoy/gpgpu-rs

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project