Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
smartcat
ggml
-
Run LLMs at home, BitTorrent‑style
https://github.com/philpax/ggml/blob/gguf-spec/docs/gguf.md#...
It is (IMO) a necessary and good change.
I just specified gguf because my 3090 cannot host a 70B model without offloading outside of exLlama's very new ~2 bit quantization.
- GGUF File Format Specification
-
Meta: Code Llama, an AI Tool for Coding
While we're at it, the GGML file format has been deprecated in favor of GGUF.
https://github.com/philpax/ggml/blob/gguf-spec/docs/gguf.md
https://github.com/ggerganov/llama.cpp/pull/2398
What are some alternatives?
lmdeploy - LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.
ollama-ui - Simple HTML UI for Ollama
chat.petals.dev - 💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
codellama - Inference code for CodeLlama models
LoRA - Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
llama-cpp-python - Python bindings for llama.cpp
godot-dodo - Finetuning large language models for GDScript generation.
ArchGPT - 🐕 ArchGPT is a source-code-management framework to enable a new meta-programming paradigm specially designed for Language-Model-Driven-Development (LMDD) i.e. the utilization of Large Language Models for automated software development.
aider - aider is AI pair programming in your terminal