How to run Pygmalion on 4.5GB of VRAM with full context size.

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

text-generation-webui

877 37,723 9.9 Python

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Get rid of everything you have and just use the installer I linked HERE. It'll get everything for you, up to date.

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
GPTQ-for-LLaMa

19 130 7.7 Python

4 bits quantization of LLaMa using GPTQ (by oobabooga)

I'm just throwing it out there, but I had to use oobabooga's fork of GPTQ-for-Llama and ensuring I was on the cuda branch (https://github.com/oobabooga/GPTQ-for-LLaMa.git).

GPTQ-for-LLaMa

1 0 8.2 Python

4 bits quantization of LLMs using GPTQ (by YellowRoseCx)

I can load it with GPTQ on a 6700xt using a fork: https://github.com/YellowRoseCx/GPTQ-for-LLaMa You can update the post for AMD users if you want.

bitsandbytes-rocm

4 38 8.8 Python

There are a lot of ROCm versions of bitsandbytes. For example this one: https://github.com/broncotc/bitsandbytes-rocm The problem is compatibility with most of the requirements. Kobold does a better job than ooba in offering a more streamlined approach for AMD users.

bitsandbytes

61 5,649 9.4 Python

Accessible large language models via k-bit quantization for PyTorch.

Welcome to bitsandbytes. For bug reports, please submit your error trace to: https://github.com/TimDettmers/bitsandbytes/issues

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Why Google Takeout is sooo bad

2 projects | news.ycombinator.com | 16 Jun 2024
NumPy 2.0.0

7 projects | news.ycombinator.com | 16 Jun 2024
NumPy 2.0.0 Release Notes

1 project | news.ycombinator.com | 16 Jun 2024
ChatGPT Is Bullshit

1 project | news.ycombinator.com | 16 Jun 2024
Exporting Array API Code to ONNX

1 project | news.ycombinator.com | 16 Jun 2024

How to run Pygmalion on 4.5GB of VRAM with full context size.

This page summarizes the projects mentioned and recommended in the original post on /r/PygmalionAI
hardware-buttons scrape-images linkedin-bot
Post date: 2 Apr 2023

text-generation-webui

Scout Monitoring

GPTQ-for-LLaMa

GPTQ-for-LLaMa

bitsandbytes-rocm

bitsandbytes

InfluxDB

Related posts

Why Google Takeout is sooo bad

NumPy 2.0.0

NumPy 2.0.0 Release Notes

ChatGPT Is Bullshit

Exporting Array API Code to ONNX

How to run Pygmalion on 4.5GB of VRAM with full context size.

This page summarizes the projects mentioned and recommended in the original post on /r/PygmalionAI hardware-buttons scrape-images linkedin-bot Post date: 2 Apr 2023

text-generation-webui

Scout Monitoring

GPTQ-for-LLaMa

GPTQ-for-LLaMa

bitsandbytes-rocm

bitsandbytes

InfluxDB

Related posts

Why Google Takeout is sooo bad

NumPy 2.0.0

NumPy 2.0.0 Release Notes

ChatGPT Is Bullshit

Exporting Array API Code to ONNX

This page summarizes the projects mentioned and recommended in the original post on /r/PygmalionAI
hardware-buttons scrape-images linkedin-bot
Post date: 2 Apr 2023