How to run this service with a local GPU?

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

Basic-UI-for-GPT-J-6B-with-low-vram

4 113 0.0 Jupyter Notebook

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.

You need a lot of VRAM to run the AI models, scaling somewhat with the amount of parameters a model uses. The most advanced model Pygmalion has is 6 billion parameters, which requires a minimum of 16GB of VRAM to run locally at decent speeds. There are methods of running 6b locally on low VRAM machines as listed here: https://github.com/arrmansa/Basic-UI-for-GPT-J-6B-with-low-vram but even then, the generations would be excruciatingly slow, and the lowest VRAM card used with this method has 6GB of VRAM.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Tesla M40 and GPT-J-6B

1 project | /r/KoboldAI | 8 Aug 2021
How is any of this even possible?

1 project | /r/GPT3 | 21 Jul 2021
GPT-J 6B locally on my computer

1 project | /r/KoboldAI | 25 Jun 2021
WebGPT: GPT Model on the Browser with WebGPU

1 project | news.ycombinator.com | 1 Apr 2024
WebGPT: Run GPT model on the browser with WebGPU

1 project | news.ycombinator.com | 12 Aug 2023

How to run this service with a local GPU?

This page summarizes the projects mentioned and recommended in the original post on /r/PygmalionAI
gpt-neo Gpt Transformers
Post date: 27 Jan 2023

Basic-UI-for-GPT-J-6B-with-low-vram

InfluxDB

Related posts

Tesla M40 and GPT-J-6B

How is any of this even possible?

GPT-J 6B locally on my computer

WebGPT: GPT Model on the Browser with WebGPU

WebGPT: Run GPT model on the browser with WebGPU

How to run this service with a local GPU?

This page summarizes the projects mentioned and recommended in the original post on /r/PygmalionAI gpt-neo Gpt Transformers Post date: 27 Jan 2023

Basic-UI-for-GPT-J-6B-with-low-vram

InfluxDB

Related posts

Tesla M40 and GPT-J-6B

How is any of this even possible?

GPT-J 6B locally on my computer

WebGPT: GPT Model on the Browser with WebGPU

WebGPT: Run GPT model on the browser with WebGPU

This page summarizes the projects mentioned and recommended in the original post on /r/PygmalionAI
gpt-neo Gpt Transformers
Post date: 27 Jan 2023