Our great sponsors
-
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
This is great, but similar to GPT4All, it will likely be deemed unusable for any commercial or otherwise "legitimate" use cases since it's trained on OpenAI completions from sharegpt.com.
https://github.com/nomic-ai/gpt4all
As another tidbit of self-reflection, I asked it what license Vicuna was released under and it claimed MIT (it's actually Apache 2: https://github.com/lm-sys/FastChat/blob/main/LICENSE ) and when I asked it to tell me why it believed the project was MIT licensed it told me what the MIT license was and why a project would choose it :-/
I can appreciate there may be some nuance since the repo is actually named FastChat but the chatbot seems to be named Vicuna so ... tomato, tomahtoe?
Take a look at https://github.com/jaymody/picoGPT/blob/a750c145ba4d09d57648...
Yes, this is GPT-2 not 4 and it‘s not the Chat, only the model and it‘s basically only the inference part, not the training loop and it‘s somewhat simplified.
Still, take a good look.
That‘s essentially what it is and a single sheet of paper.
There is nothing specifically about language in „language model“, we just call it that. Better to call it just LLM.
Nobody knows exactly what it learns, although there would be ways to poke around given some research programs. But it seems like the interest in that is limited currently, everyone is busy with improving it or with applications.
Perhaps the answer is that we overestimated what a mind is. It‘s like we used to ask what life is and it turned out that there is nothing special about life, not even the DNA is controlling anything. It‘s merely a chemical process, even though a complex process.
Related posts
- Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean
- Observations on MLOps–A Fragmented Mosaic of Mismatched Expectations
- Python Day 9: Building Interactive Web Apps without HTML/CSS and JavaScript
- The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol
- This Week In Python