Ask HN: Who is using small OS LLMs in production?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • newhope

    NewHope: Harnessing 99% of GPT-4's Programming Capabilities

  • Interestingly, they have withdrawn their model because they discovered that test data leaked into the training data. Quantised versions are still available on huggingface from others though.

    https://github.com/SLAM-group/newhope

  • can-ai-code

    Self-evaluating interview for AI coders

  • Yeah it seemed suspiciously high for HumanEval and it only ranks 14th for JS and 7th for Python on other benchmarks now: https://huggingface.co/spaces/mike-ravkine/can-ai-code-resul...

    WizardCoder is a bit of a problem since it's not llama 1/2 based but is its own 15B model and as such the support for it in anything practical is near nonexistent. WizardLM v1.2 looks like it may be worth checking out.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: FileKitty – Combine and label text files for LLM prompt contexts

    5 projects | news.ycombinator.com | 1 May 2024
  • Ask HN: Freelancer? Seeking freelancer? (May 2024)

    1 project | news.ycombinator.com | 2 May 2024
  • More Low-Bit LLMs

    2 projects | news.ycombinator.com | 2 May 2024
  • Kolmogorov-Arnold Network for Reinforcement Leaning, Initial Experiments

    1 project | news.ycombinator.com | 2 May 2024
  • Create an AI prototyping environment using Jupyter Lab IDE with Typescript, LangChain.js and Ollama for rapid AI prototyping

    4 projects | dev.to | 2 May 2024