Damn, I was so satisfied with my 3080 with 10GB of VRAM until I found this subreddit.

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • exllama

    A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

  • For exllama (https://github.com/turboderp/exllama) the instructions are on the post itself.

  • KoboldAI

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • exllama

    A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. (by 0cc4m)

  • koboldcpp

    A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

  • Just about any llama-based model can be run purely on your CPU, or split between your CPU and GPU. Download KoboldCPP, assign as many layers to your GPU as it can handle, and let the CPU and system RAM handle the rest.

  • gpt4all

    gpt4all: run open-source LLMs anywhere

  • Specifically with this project: https://github.com/nomic-ai/gpt4all

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Simple text to short video script using moviepy, Python

    1 project | news.ycombinator.com | 20 May 2024
  • Groqbook: Generate entire books in seconds using Groq and Llama3

    1 project | news.ycombinator.com | 20 May 2024
  • Fewer if statements with Nothing instead of None

    1 project | news.ycombinator.com | 20 May 2024
  • Using Google Cloud Firestore with Django's ORM

    3 projects | dev.to | 20 May 2024
  • FLaNK-AIM: 20 May 2024 Weekly

    28 projects | dev.to | 20 May 2024