|5 days ago||4 days ago|
|BSD 3-clause "New" or "Revised" License||Apache License 2.0|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
We haven't tracked posts mentioning kss yet.
Tracking mentions began in Dec 2020.
BetterTransformer: PyTorch-native free-lunch speedups for Transformer-based models
3 projects | reddit.com/r/MachineLearning | 22 Nov 2022
In order to support BetterTransformer with the canonical Transformer models from Transformers library, an integration was done with the open-source library Optimum as a one-liner:
Semantic Search with SQLite
2 projects | reddit.com/r/Python | 22 Nov 2022
Sounds good. In terms of the theory behind the models, transformers and sentence-transformers are best projects to take a look at.
[R] RWKV-4 7B release: an attention-free RNN language model matching GPT-J performance (14B training in progress)
2 projects | reddit.com/r/MachineLearning | 17 Nov 2022
1 week of Stable Diffusion
8 projects | news.ycombinator.com | 30 Aug 2022
Basically stuff a 32 bit value into an 8 bit value (and lose precision).
Apparently it doesn't affect the results significantly.
FauxPilot – an open-source GitHub Copilot server
4 projects | news.ycombinator.com | 2 Aug 2022
Thank you for sharing the command for finetuning! Is it possible to share your ds_config.json? I tried to finetune the 2B model on A100 (40GB) using your command, but got a CUDA out of memory error. The ds_config I used was the one from huggingface (https://github.com/huggingface/transformers/blob/main/tests/...).
[P] BART denoising language modeling in JAX/Flax
3 projects | reddit.com/r/MachineLearning | 1 Aug 2022
Due to the high demand in implementation for pretraining BART. I created an pretraining script for BART in JAX/Flax. Got approvals to merge into huggingface/transformers. I will archive this repo once it is merged.
Colossal-AI Seamlessly Accelerates Large Models at Low Costs with Hugging Face
2 projects | reddit.com/r/artificial | 14 Jul 2022
Portal Project address: https://github.com/hpcaitech/ColossalAI Reference https://arxiv.org/abs/2202.05924v2 https://arxiv.org/abs/2205.11487 https://github.com/features/copilot https://github.com/huggingface/transformers https://www.forbes.com/sites/forbestechcouncil/2022/03/25/six-ai-trends-to-watch-in-2022/?sh=4dc51f82be15 https://www.infoq.com/news/2022/06/meta-opt-175b/
[P] A Simpler @PyTorch Annotated Implementation of EleutherAI's 20B Language Model GPT-NeoX.
3 projects | reddit.com/r/MachineLearning | 23 Apr 2022
There is also a pull request on huggingface transformers for GPT-NeoX-20B if anyone is interested: https://github.com/huggingface/transformers/pull/16659. It has worked for me
[D] NLP has HuggingFace, what does Computer Vision have?
7 projects | reddit.com/r/MachineLearning | 19 Apr 2022
image classification: ViT, DeiT, BEiT, Swin Transformer, PoolFormer, ResNet, RegNet, ConvNeXT, Perceiver, ImageGPT, VAN. Check out the official example scripts, example notebooks.7 projects | reddit.com/r/MachineLearning | 19 Apr 2022
What are some alternatives?
fairseq - Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
sentence-transformers - Multilingual Sentence & Image Embeddings with BERT
transformer-pytorch - PyTorch Implementation of "Attention Is All You Need"
Swin-Transformer-Tensorflow - Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)
OpenNMT-py - Open Source Neural Machine Translation in PyTorch
huggingface_hub - All the open source things related to the Hugging Face Hub.
faiss - A library for efficient similarity search and clustering of dense vectors.
gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
lm-scorer - 📃Language Model based sentences scoring library
sentencepiece - Unsupervised text tokenizer for Neural Network-based text generation.
kogpt - KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)
aitextgen - A robust Python tool for text-based AI training and generation using GPT-2.