A fast llama2 decoder in pure Rust.
Why do you think that https://github.com/turboderp/exllama is a good alternative to llama2.rs
A fast llama2 decoder in pure Rust.
Why do you think that https://github.com/turboderp/exllama is a good alternative to llama2.rs