Tiny models for contextually coherent conversations?

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • FastChat

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

  • Don't think so, there's an issue created around this https://github.com/lm-sys/FastChat/issues/925

  • RWKV-Runner

    A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • rwkv.cpp

    INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

  • mlc-llm

    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

  • MLC-LLM Vicuna is a 7B model that fits into about 2gb of Vram and can make small talk with context. MLC LLM | Home

  • koboldcpp

    A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

  • As a bit of a fun fact, KoboldCpp also has support for non-LLaMA GGML models, which includes the RWKV conversion they just linked

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [Project] MLC LLM: Universal LLM Deployment with GPU Acceleration

    6 projects | /r/LocalLLaMA | 29 Apr 2023
  • Eagle 7B: Soaring past Transformers

    2 projects | news.ycombinator.com | 28 Jan 2024
  • Mixtral in Colab

    1 project | news.ycombinator.com | 7 Jan 2024
  • People who've used RWKV, whats your wishlist for it?

    9 projects | /r/LocalLLaMA | 9 Dec 2023
  • Ai on a android phone?

    2 projects | /r/LocalLLaMA | 8 Dec 2023