PaLM-colossalai
PaLM-pytorch
PaLM-colossalai | PaLM-pytorch | |
---|---|---|
2 | 3 | |
192 | 821 | |
- | - | |
0.0 | 0.0 | |
about 2 years ago | about 2 years ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PaLM-colossalai
-
Implementing the Gargantuan Pathways with Colossal-AI, easy and efficient!
If you feel interested, do check out our GitHub repo: https://github.com/hpcaitech/PaLM-colossalai
- [P] Scalable PaLM implementation of PyTorch
PaLM-pytorch
-
Implementing the Gargantuan Pathways with Colossal-AI, easy and efficient!
We firstly reproduced the PaLM model architecture on one GPU according to the PaLM paper’s description. Here we have referred following repo for the reproduction: https://github.com/lucidrains/PaLM-pytorch
- [R] Google's 540B (Dense) model Pathways LLM, "Unlocks" new tasks proportional to scale
- Pathways Language Model (Palm): 540B Parameters for Breakthrough Perf
What are some alternatives?
ColossalAI - Making large AI models cheaper, faster and more accessible
nuwa-pytorch - Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
CoCa-pytorch - Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
x-transformers - A concise but complete full-attention transformer with a set of promising experimental features from various papers
PaLM-flax - Implementation of the SOTA Transformer architecture from PaLM - Scaling Language Modeling with Pathways in JAX/Flax
DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
TimeSformer-pytorch - Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification