goslide | SLIDE | |
---|---|---|
1 | 3 | |
39 | 475 | |
- | -0.4% | |
0.0 | 0.0 | |
about 4 years ago | over 2 years ago | |
Go | ||
BSD 2-clause "Simplified" License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
goslide
SLIDE
-
Yandex opensources 100B parameter GPT-like model
That's pretty much what SLIDE [0] does. The driver was achieving performance parity with GPUs for CPU training, but presumably the same could apply to running inference on models too large to load into consumer GPU memory.
https://github.com/RUSH-LAB/SLIDE
- [R] CPU algorithm trains deep neural nets up to 15 times faster than top GPU trainers
- CPU-based algorithm trains deep neural nets up to 15 times faster than top GPU
What are some alternatives?
Gorgonia - Gorgonia is a library that helps facilitate machine learning in Go.
YaLM-100B - Pretrained language model with 100B parameters
lc0 - The rewritten engine, originally for tensorflow. Now all other backends have been ported here.
olivia - 💁♀️Your new best friend powered by an artificial neural network
gpt-neox - An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Stockfish - A free and strong UCI chess engine
HashingDeepLearning - Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"
mesh-transformer-jax - Model parallel transformers in JAX and Haiku
YaLM-100B - Pretrained language model with 100B parameters