GPT-4 Says an Open-Source Chatbot Vicuna Reaches 90% ChatGPT Quality

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • gpt4all

    gpt4all: run open-source LLMs anywhere

  • This is great, but similar to GPT4All, it will likely be deemed unusable for any commercial or otherwise "legitimate" use cases since it's trained on OpenAI completions from sharegpt.com.

    https://github.com/nomic-ai/gpt4all

  • FastChat

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

  • As another tidbit of self-reflection, I asked it what license Vicuna was released under and it claimed MIT (it's actually Apache 2: https://github.com/lm-sys/FastChat/blob/main/LICENSE ) and when I asked it to tell me why it believed the project was MIT licensed it told me what the MIT license was and why a project would choose it :-/

    I can appreciate there may be some nuance since the repo is actually named FastChat but the chatbot seems to be named Vicuna so ... tomato, tomahtoe?

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • picoGPT

    An unnecessarily tiny implementation of GPT-2 in NumPy.

  • Take a look at https://github.com/jaymody/picoGPT/blob/a750c145ba4d09d57648...

    Yes, this is GPT-2 not 4 and it‘s not the Chat, only the model and it‘s basically only the inference part, not the training loop and it‘s somewhat simplified.

    Still, take a good look.

    That‘s essentially what it is and a single sheet of paper.

    There is nothing specifically about language in „language model“, we just call it that. Better to call it just LLM.

    Nobody knows exactly what it learns, although there would be ways to poke around given some research programs. But it seems like the interest in that is limited currently, everyone is busy with improving it or with applications.

    Perhaps the answer is that we overestimated what a mind is. It‘s like we used to ask what life is and it turned out that there is nothing special about life, not even the DNA is controlling anything. It‘s merely a chemical process, even though a complex process.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts