picoGPT
the-algorithm-ml
picoGPT | the-algorithm-ml | |
---|---|---|
7 | 36 | |
3,081 | 9,912 | |
- | 0.5% | |
1.9 | 10.0 | |
about 1 year ago | 8 months ago | |
Python | Python | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
picoGPT
-
Understanding Automatic Differentiation in 30 lines of Python
In that case, you might also enjoy https://jaykmody.com/blog/gpt-from-scratch/
(here's the raw code: https://github.com/jaymody/picoGPT/blob/main/gpt2.py)
-
Transformers from Scratch
I wrote a minimal implementation in NumPy here (the forward pass code is only 40 lines): https://github.com/jaymody/picoGPT
Although this is for a decoder-only transformer (aka GPT) and doesnt include the encoder part.
- FLaNK Stack Weekly 3 April 2023
-
GPT-4 Says an Open-Source Chatbot Vicuna Reaches 90% ChatGPT Quality
Take a look at https://github.com/jaymody/picoGPT/blob/a750c145ba4d09d57648...
Yes, this is GPT-2 not 4 and it‘s not the Chat, only the model and it‘s basically only the inference part, not the training loop and it‘s somewhat simplified.
Still, take a good look.
That‘s essentially what it is and a single sheet of paper.
There is nothing specifically about language in „language model“, we just call it that. Better to call it just LLM.
Nobody knows exactly what it learns, although there would be ways to poke around given some research programs. But it seems like the interest in that is limited currently, everyone is busy with improving it or with applications.
Perhaps the answer is that we overestimated what a mind is. It‘s like we used to ask what life is and it turned out that there is nothing special about life, not even the DNA is controlling anything. It‘s merely a chemical process, even though a complex process.
-
u/functor7 explains why AIs like ChatGPT do not "understand" their subject
(The hardest part was just designing a math function that has the capability of getting good at this game, but when all is said and done, it need not be a whole lot of code).
- PicoGPT: An unnecessarily tiny implementation of GPT-2 in NumPy
- picoGPT: An unnecessarily tiny implementation of GPT-2 in NumPy
the-algorithm-ml
-
Scammers posing as customer service agents on X as companies leave platform
I said “parts of the recommender system code.”
This is the kind of highly emotional reaction that’s not helpful.
Yes, I am quite familiar with building ML models, both training and building my own for which I’ve been paid large sums of money, and I’m here to tell you that you don’t know what you’re taking about.
There’s so much more information about an ML system than just the trained model that is important for understanding the effects of the system on a society, and its legal, ethical, and social ramifications.
Just seeing the type of RS being used, the ranking approach, and the information on SimClusters is enough for RAI folks to start to understand the ecosystem effects and how that can show up downstream in social effects.
https://blog.twitter.com/engineering/en_us/topics/open-sourc...
- Twitter's Recommendation Algorithm
-
AOC said Elon Musk put his 'finger on the scale' during Turkey's presidential election and is 'concerned' it will set a precedent for the 2024 US election
Blog summarising the change: https://blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm
-
Discussion Thread
People who don't share your interests (or at least what Twitter thinks your interests are). This blog post explains it in detail.
-
Twitter's For You Recommendation Algorithm
Twitter's announcement | Main GitHub Repo | ML GitHub Repo | Engineering Blog Post
- FLaNK Stack Weekly 3 April 2023
-
New York Times says it won't pay for Twitter verified check mark
where? I searched through the repo and couldn't find it.
- Analysis of Twitter algorithm code reveals social medium down-ranks tweets about Ukraine
What are some alternatives?
gpt4all - gpt4all: run open-source LLMs anywhere
the-algorithm
glances - Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
Finagle - A fault tolerant, protocol-agnostic RPC system
taskwarrior - Taskwarrior - Command line Task Management
cointop - A fast and lightweight interactive terminal based UI application for tracking cryptocurrencies 🚀
ctop - Top-like interface for container metrics
Tensor-Puzzles - Solve puzzles. Improve your pytorch.
Apollo-11 - Original Apollo 11 Guidance Computer (AGC) source code for the command and lunar modules.
exiftool - ExifTool meta information reader/writer
jdupes - A powerful duplicate file finder and an enhanced fork of 'fdupes'.