llm.c
LLM training in simple, raw C/CUDA (by karpathy)
Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models" (by KimMeen)
llm.c | Time-LLM | |
---|---|---|
12 | 1 | |
20,599 | 924 | |
- | - | |
9.8 | 7.4 | |
5 days ago | 12 days ago | |
Cuda | Python | |
MIT License | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
llm.c
Posts with mentions or reviews of llm.c.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-06-10.
-
NanoGPT: The simplest, fastest repository for training medium-sized GPTs
See also (from the same author) https://github.com/karpathy/llm.c — "LLMs in simple, pure C/CUDA with no need for 245MB of PyTorch or 107MB of cPython."
-
Grokfast: Accelerated Grokking by Amplifying Slow Gradients
> They are trying to innovate in the idea space and are probably quite compute constrained.
Training a GPT-2 sized model costs ~$20 nowadays in respect to compute: https://github.com/karpathy/llm.c/discussions/481
- Reproduce GPT-2 (124M) in llm.c in 90 minutes for $20
- Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20
- Llm.c State of the Union
-
Ask HN: Yo Nephew, in E. Africa, wants to train an LLM with on disk Wikipedia
----------------------------------
https://github.com/karpathy/llm.c
It is only 1,000 lines of easy to read C code.
-
llm.c is now down to 26.2ms/iteration, matching PyTorch
I'm not grasping the significance of this. I see a repo called llm.c, https://github.com/karpathy/llm.c, is the main selling point that PyTorch is a large download and this isn't... or are there other problems with PyTorch? I thought the appeal of PyTorch is its accessibility and documentation.
- Layernorm
- karpathy/llm.c
Time-LLM
Posts with mentions or reviews of Time-LLM.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-04-08.
-
karpathy/llm.c
Yes general LLM models can be used for time series forecasting:
https://github.com/KimMeen/Time-LLM
What are some alternatives?
When comparing llm.c and Time-LLM you can also consider the following projects:
richard - Richard is gaining power