Our great sponsors
-
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
git clone https://github.com/oobabooga/text-generation-webui
git clone https://github.com/oobabooga/GPTQ-for-LLaMa
Have You tried https://mlc.ai/mlc-llm/ ? It uses Vulkan instead of CUDA, so running it is much easier, but only models compiled for MLC will work with it.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.