Our great sponsors
-
Be warned, NON_BLOCKING streams do not fully synchronize with sync host to device copies. They are not guaranteed to actually finish by the time they return. Meaning its possible to initiate a copy, then initiate a kernel launch, and have the copy be unfinished by the time the kernel is launched. This caused so many confusing bugs that i personally decided to stop using NON_BLOCKING altogether in rust-cuda. https://github.com/Rust-GPU/Rust-CUDA/issues/15
-
Awesome, I added an issue here https://github.com/coreylowman/dfdx/issues/597. We can discuss more there! The first step will just be adding the device and implementing tensor creation methods for it.
-
SonarQube
Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
-
All the public methods and modules should be documented with example snippets in docs.rs (https://docs.rs/dfdx/latest/dfdx/). What are you looking at that doesn't have that?
Related posts
- rapl: Rank Polymorphic array library for Rust.
- Tensor shapes with both const generic and run time dimensions
- Oxidizing Machine Learning
- dfdx v0.9.0 - nightly convs & transformers, broadcasting/reducing/selecting from any axis, and more!
- llm: a Rust crate/CLI for CPU inference of LLMs, including LLaMA, GPT-NeoX, GPT-J and more