Gibberish with LLaMa 7B 4bit

This page summarizes the projects mentioned and recommended in the original post on /r/Oobabooga

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • text-generation-webui

    A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

  • For some background, running a GTX 1080 with 8GB of vram on Windows. Installed using a combination of the one-click installer, the How to guide by /u/Technical_Leather949, and using the pre-compiled wheel by Brawlence (to avoid having to install visual studio). I've downloaded the latest 4bit LLaMa 7b 4bit model, and the tokenizer/config files.

  • text-generation-webui

    A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, GPT-Neo, and Pygmalion. (by TheTerrasque)

  • git clone https://github.com/TheTerrasque/text-generation-webui.git

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • docker

    Docker - the open-source application container engine (by microsoft)

  • One alternative you could try.. I've set up a docker environment to build things and set it up. It would require you to install some tools if you don't have: Git and Docker Desktop for Windows.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Github Sponsor Sebastián Ramírez Python programmer

    2 projects | dev.to | 5 May 2024
  • Sequoia: Serving exact Llama2-70B on an RTX4090 with 1/2 s per token

    1 project | news.ycombinator.com | 5 May 2024
  • Ask HN: Have you coded any productivity software just for yourself?

    1 project | news.ycombinator.com | 5 May 2024
  • LFG is a CLI tool using llama3 to help you find terminal commands

    1 project | news.ycombinator.com | 5 May 2024
  • OpenAdapt: AI-First Process Automation with Large Multimodal Models

    1 project | news.ycombinator.com | 5 May 2024