-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I noticed that Facebook's reference code https://github.com/facebookresearch/llama/blob/main/llama/model.py is surprisingly short and easy to understand. So I decided to try replicate it in Rust.
The project that inspired me was this one: https://github.com/ggerganov/llama.cpp although I decided to be rebel and try different technologies since the ML community is all about using PyTorch or Cuda or Apple Silicon or something.
Here's my implementation if you're interested: https://github.com/philpax/ggllama
OPs implementation runs OpenCL kernels on the GPU not Rust. You could use rust-gpu to re-implement the kernels in Rust which are converted to SPIR-V and execute via Vulkan.
I also noticed someone shared https://github.com/setzer22/llama-rs/ on this Reddit. I predict there will be more implementations; I followed StableDiffusion developments back when it was released and it didn't take very long for the community to optimize the crap out of it.
Related posts
-
What's the coolest Rust project you've seen that made you go, 'Wow, I didn't know Rust could do that!'?
-
[rust-gpu] How do I run/build my own shaders locally?
-
Posh: Type-Safe Graphics Programming in Rust
-
Introducing posh: Type-Safe Graphics Programming in Rust
-
Looking for high level GPU computing crate