WizardLM-30B-Uncensored

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

text-generation-webui

876 35,583 9.9 Python

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Also, probably the easiest way to get started would be to install oobabooga's web-ui (there are one-click installers for various operating systems), then pair it with a GPTQ quantized (not GGML) model -- you'll also want the smaller 4-bit file (ie without groupsize 128) where applicable to avoid running into issues with the context length. Here are the appropriate files for GPT4-X-Alpaca-30b and WizardLM-30B, which are both good choices.

llama.cpp

766 55,117 9.9 C++

LLM inference in C/C++

It's worth noting that you'll need a recent release of llama.cpp to run GGML models with GPU acceleration here is the latest build for CUDA 12.1), and you'll need to install a recent CUDA version if you haven't already (here is the CUDA 12.1 toolkit installer -- mind, it's over 3 GB).

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
one-click-installers

18 470 8.9 Python

Discontinued Simplified installers for oobabooga/text-generation-webui.
WizardLM

37 7,531 9.4 Python

Discontinued Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder and WizardMath

Here is the codebase and dataset for WizardLM https://github.com/nlpxucan/WizardLM https://github.com/AetherCortex/Llama-X https://huggingface.co/datasets/victor123/evol_instruct_70k

Llama-X

1 1,564 6.5 Python

Open Academic Research on Improving LLaMA to SOTA LLM

Here is the codebase and dataset for WizardLM https://github.com/nlpxucan/WizardLM https://github.com/AetherCortex/Llama-X https://huggingface.co/datasets/victor123/evol_instruct_70k

WizardVicunaLM

12 710 6.8

LLM that combines the principles of wizardLM and vicunaLM

Here is the codebase and dataset for WizardVicuna https://github.com/melodysdreamj/WizardVicunaLM https://github.com/lm-sys/FastChat https://huggingface.co/datasets/RyokoAI/ShareGPT52K

FastChat

82 33,668 9.7 Python

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Here is the codebase and dataset for WizardVicuna https://github.com/melodysdreamj/WizardVicunaLM https://github.com/lm-sys/FastChat https://huggingface.co/datasets/RyokoAI/ShareGPT52K

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
koboldcpp

179 3,683 10.0 C++

A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

There was a special release of Koboldcpp that features GPU offloading, it's a 418 MB file due to all the libraries needed to support CUDA. There are hints that it might be a one-off thing but it'll at least work until the model formats get changed again.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project