reweight-gpt
Reweight GPT - a simple neural network using transformer architecture for next character prediction (by hunar4321)
ML-foundations
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science (by jonkrohn)
reweight-gpt | ML-foundations | |
---|---|---|
1 | 1 | |
51 | 3,959 | |
- | 2.7% | |
6.3 | 6.2 | |
over 1 year ago | 5 months ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
reweight-gpt
Posts with mentions or reviews of reweight-gpt.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-05-01.
-
[Research] An alternative to self-attention mechanism in GPT
Instead of self attention, I tried to generate the self-attention matrix directly using lateral connections among the inputs. The method is like LSTM but it gates all the past inputs using separate gates for each input (it can be parallelized). It's very easy to implement the method into the current GPT architectures. You just remove the attention part and replace it with learnable weights. Her is a working implementation (around100 lines!): Code: https://github.com/hunar4321/reweight-gpt In my experience, it learns very well and it can super-pass the self-attention mechanism if the number of the parameters are matched. (I tested it on small datasets for next character prediction. I haven't systematically compared these two methods yet).
ML-foundations
Posts with mentions or reviews of ML-foundations.
We have used some of these posts to build our list of alternatives
and similar projects.
-
Worried about Calculus
As others have said, you won't need calculus immediately, but it's important that you make a good attempt at learning up to Calc3. I also didn't have a math heavy undergrad so it took a lot of self-study for me, but it's possible. Simulation has a great math boot camp at the beginning to review everything but you'll want to be prepped with Calc before that because that class is all calculus based probability. Some other good resources are the 3Blue1Brown videos on YouTube. They have a great series for both calc & linear algebra to talk through all the intuition with visuals. I also really like John Krohns series because you code through the math which is very applicable for us in this program. I only did his linear Algebra, but he has a whole series with Calc and probability, too. https://github.com/jonkrohn/ML-foundations
What are some alternatives?
When comparing reweight-gpt and ML-foundations you can also consider the following projects:
repeng - A library for making RepE control vectors
Mathematics-for-Machine-Learning-and-Data-Science-Specialization-Coursera - Mathematics for Machine Learning and Data Science Specialization - Coursera - deeplearning.ai - solutions and notes
ai_story_scale - The AI story scale (AISS): A human rating scale for texts written with generative language models.
the-elements-of-statistical-learning - My notes and codes (jupyter notebooks) for the "The Elements of Statistical Learning" by Trevor Hastie, Robert Tibshirani and Jerome Friedman
numerical-linear-algebra - Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
ITC - Computer Science coursework and projects at Tec de Monterrey 👨🎓