Multi-GPU questions

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

exllama

64 2,631 9.0 Python

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Exllama for example uses buffers on each card that reduce the amount of VRAM available for model and context, see here. https://github.com/turboderp/exllama/issues/121

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

HuggingFace hacked – Space secrets leak disclosure

1 project | news.ycombinator.com | 1 Jun 2024
Shellgpt: Chat with LLM in your terminal, be it shell generator, story teller

1 project | news.ycombinator.com | 1 Jun 2024
Omost: A project to convert LLM's coding capability to image generation

1 project | news.ycombinator.com | 31 May 2024
Take control! Run ChatGPT and Github Copilot yourself!

3 projects | dev.to | 31 May 2024
The DevRel Digest May 2024: Documentation and the Developer Journey

1 project | dev.to | 31 May 2024

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Post date: 9 Jul 2023

exllama

Scout Monitoring

Related posts

HuggingFace hacked – Space secrets leak disclosure

Shellgpt: Chat with LLM in your terminal, be it shell generator, story teller

Omost: A project to convert LLM's coding capability to image generation

Take control! Run ChatGPT and Github Copilot yourself!

The DevRel Digest May 2024: Documentation and the Developer Journey