uda
squirrel-core
uda | squirrel-core | |
---|---|---|
2 | 1 | |
2,153 | 279 | |
0.0% | 0.7% | |
0.0 | 5.6 | |
over 2 years ago | 5 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
uda
-
BERT models: how resilient are they to typos?
Another thought is to do some data augmentation using back-translation, a la https://arxiv.org/abs/1904.12848
-
A Visual Survey of Data Augmentation in NLP
The words that replaces the original word are chosen by calculating TF-IDF scores of words over the whole document and taking the lowest ones. You can refer to the code implementation for this in the original paper here.
squirrel-core
-
[P] Squirrel: A new OS library for fast & flexible large-scale data loading
Today we open-sourced Squirrel, a data infrastructure library that my colleagues and I have been working on over the past 1.5 years: https://github.com/merantix-momentum/squirrel-core
What are some alternatives?
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
squirrel-datasets-core - Squirrel dataset hub
SSL4MIS - Semi Supervised Learning for Medical Image Segmentation, a collection of literature reviews and code implementations.
talking-head-anime-3-demo - Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
nlpaug - Data augmentation for NLP
ML-YouTube-Courses - 📺 Discover the latest machine learning / AI courses on YouTube.
clip-as-service - 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
PDEBench - PDEBench: An Extensive Benchmark for Scientific Machine Learning
contractions - Fixes contractions such as `you're` to `you are`
deeplake - Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
speaking_with_plato - Exploring Plato's philosophy with AI - A Data Spiral blog article