reweight-gpt
Reweight GPT - a simple neural network using transformer architecture for next character prediction (by hunar4321)
numerical-linear-algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course (by fastai)
reweight-gpt | numerical-linear-algebra | |
---|---|---|
1 | 6 | |
51 | 10,465 | |
- | 0.4% | |
6.3 | 0.0 | |
over 1 year ago | about 1 year ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | - |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
reweight-gpt
Posts with mentions or reviews of reweight-gpt.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-05-01.
-
[Research] An alternative to self-attention mechanism in GPT
Instead of self attention, I tried to generate the self-attention matrix directly using lateral connections among the inputs. The method is like LSTM but it gates all the past inputs using separate gates for each input (it can be parallelized). It's very easy to implement the method into the current GPT architectures. You just remove the attention part and replace it with learnable weights. Her is a working implementation (around100 lines!): Code: https://github.com/hunar4321/reweight-gpt In my experience, it learns very well and it can super-pass the self-attention mechanism if the number of the parameters are matched. (I tested it on small datasets for next character prediction. I haven't systematically compared these two methods yet).
numerical-linear-algebra
Posts with mentions or reviews of numerical-linear-algebra.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-12-29.
-
I'm a 42-years-old librarian whithout any math background and I'm willing to learn
If you really like to dig into math, I liked the Udacity course on Intro to Deeplearning with Pytorch. Also, the Stanford course CS231n Convolutional Neural Networks for Visual Recognition is a good place to understand some basics. Other two courses to get you jumpstarted are Practical Deep Learning for Coders and Linear Algebra Course by FastAI
- Hi, what are the advanced courses/books in machine learning and neural nets? And where do I find them?
-
Math for Machine Learning!
I've also bookmarked Fast.Ai Computational Linear Algebra for Coders. https://github.com/fastai/numerical-linear-algebra/blob/master/README.md
-
Textbook for computer algebra using Python?
In that case, I would probably be temped to teach a numerical methods of linear algebra course using NumPy / Numba. Something like https://github.com/fastai/numerical-linear-algebra or https://pythonnumericalmethods.berkeley.edu/notebooks/Index.html
- Interactive Linear Algebra Text Book
-
OPEN SOURCE COMPUTER SCIENCE CURRICULUM
Computation Linear Algebra Lectures Study Material To do after completing curricula.
What are some alternatives?
When comparing reweight-gpt and numerical-linear-algebra you can also consider the following projects:
ai_story_scale - The AI story scale (AISS): A human rating scale for texts written with generative language models.
cornell-cs5785-2020-applied-ml - Teaching materials for the applied machine learning course at Cornell Tech (online edition)
repeng - A library for making RepE control vectors
social-perception - Studying sociopolitical attitudes and moving the human perspective using psychographic and sociodemographic data from the European Social Survey.
ML-foundations - Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
stacks-project - Repository for the Stacks Project