Tiny inference-only implementation of LLaMA
Why do you think that https://github.com/ColinRyan/Chat-Markup-Language is a good alternative to cria
Tiny inference-only implementation of LLaMA
Why do you think that https://github.com/ColinRyan/Chat-Markup-Language is a good alternative to cria