-
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
-
text-generation-webui
A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, GPT-Neo, and Pygmalion. (by TheTerrasque)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
For some background, running a GTX 1080 with 8GB of vram on Windows. Installed using a combination of the one-click installer, the How to guide by /u/Technical_Leather949, and using the pre-compiled wheel by Brawlence (to avoid having to install visual studio). I've downloaded the latest 4bit LLaMa 7b 4bit model, and the tokenizer/config files.
git clone https://github.com/TheTerrasque/text-generation-webui.git
One alternative you could try.. I've set up a docker environment to build things and set it up. It would require you to install some tools if you don't have: Git and Docker Desktop for Windows.
Related posts
-
Github Sponsor Sebastián RamÃrez Python programmer
-
Sequoia: Serving exact Llama2-70B on an RTX4090 with 1/2 s per token
-
Ask HN: Have you coded any productivity software just for yourself?
-
LFG is a CLI tool using llama3 to help you find terminal commands
-
OpenAdapt: AI-First Process Automation with Large Multimodal Models