AugLy
textaugment
Our great sponsors
AugLy | textaugment | |
---|---|---|
14 | 2 | |
4,900 | 370 | |
0.6% | 2.2% | |
6.0 | 4.6 | |
about 1 month ago | 2 months ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AugLy
-
Meta's A.I. exodus: Top talent quits as lab tries to keep pace with rivals
Their recent effort to generate training data for spotting stuff that includes unsanctioned narratives comes to mind. https://github.com/facebookresearch/AugLy
-
Next steps for after classification
Data augmentation is usually helpful: https://github.com/facebookresearch/AugLy
-
The hand-picked selection of the best Python libraries released in 2021
AugLy.
- Prefer volume or quality for BERT-based Text classification model
- Augly - An augmentation library for audio, image, video, and text from facebook
- [D] What's the best method to generate synthetic data for an image with text? Small dataset
- AugLy is opensourse now.
- Facebook is open-sourcing AugLy, a library that uses data augmentations to evaluate and improve ML models
-
Integration test: Complexity of privacy-preserving bird call bio-sensor for distributed ecological monitoring?
Some of the technologies which could be integrated include differential privacy, distributed online machine learning, misinformation resilience and multi-party computation, all within the context of smart contracts and bioinformatics.
-
[N] Facebook AI Open Sources AugLy: A New Python Library For Data Augmentation To Develop Robust Machine Learning Models
Facebook Blog: https://ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models/
textaugment
-
NLP augmentation models
I just came across this Python library. It has a bunch of dictionary-, backtranslation- and knowledge-based heuristics that should work most of the time:
- Prefer volume or quality for BERT-based Text classification model
What are some alternatives?
imgaug - Image augmentation for machine learning experiments.
word_forms - Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.
speechbrain - A PyTorch-based Speech Toolkit
wordnet - Stand-alone WordNet API
PySyft - Perform data science on data that remains in someone else's server
scattertext - Beautiful visualizations of how language differs among document types.
BlenderProc - A procedural Blender pipeline for photorealistic training image generation
tfops-aug - TFOps-Aug: Implementation of policy-based image augmentation techniques based on TF2 Operations. All augmentations as efficient Tensorflow 2.11.0 operations. Easy integration into a tf.data API pipeline.
Activeloop Hub - Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake]
magnitude - A fast, efficient universal vector embedding utility package.
evidently - Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
river - 🌊 Online machine learning in Python