Pure Rust CPU and OpenCL implementation of LLaMA language model

This page summarizes the projects mentioned and recommended in the original post on /r/rust

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • rllama

    Rust+OpenCL+AVX2 implementation of LLaMA inference code

  • llama

    Inference code for Llama models

  • I noticed that Facebook's reference code https://github.com/facebookresearch/llama/blob/main/llama/model.py is surprisingly short and easy to understand. So I decided to try replicate it in Rust.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • llama.cpp

    LLM inference in C/C++

  • The project that inspired me was this one: https://github.com/ggerganov/llama.cpp although I decided to be rebel and try different technologies since the ML community is all about using PyTorch or Cuda or Apple Silicon or something.

  • ggllama

    Discontinued `ggllama` is a Rust port of ggerganov's llama.cpp.

  • Here's my implementation if you're interested: https://github.com/philpax/ggllama

  • rust-gpu

    🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

  • OPs implementation runs OpenCL kernels on the GPU not Rust. You could use rust-gpu to re-implement the kernels in Rust which are converted to SPIR-V and execute via Vulkan.

  • llm

    An ecosystem of Rust libraries for working with large language models

  • I also noticed someone shared https://github.com/setzer22/llama-rs/ on this Reddit. I predict there will be more implementations; I followed StableDiffusion developments back when it was released and it didn't take very long for the community to optimize the crap out of it.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • What's the coolest Rust project you've seen that made you go, 'Wow, I didn't know Rust could do that!'?

    5 projects | /r/rust | 3 Jul 2023
  • [rust-gpu] How do I run/build my own shaders locally?

    3 projects | /r/rust | 6 Jun 2023
  • Posh: Type-Safe Graphics Programming in Rust

    1 project | news.ycombinator.com | 6 Jun 2023
  • Introducing posh: Type-Safe Graphics Programming in Rust

    6 projects | /r/rust | 4 Jun 2023
  • Looking for high level GPU computing crate

    3 projects | /r/rust | 25 May 2023