Top 5 Python training-data Projects
-
Project mention: Harnessing Weak Supervision to Isolate Sign Language in Crowded News Videos | news.ycombinator.com | 2024-08-15
Hello everyone, we are trying to make a large dataset for Sign Language translation, inspired by BSL-1K [1]. As part of cleaning our collected videos, we use a nice technique for aggregating heuristic labels [2]. We thought it was interesting enough to share with people on here.
[1] https://www.robots.ox.ac.uk/~vgg/research/bsl1k/
[2] https://github.com/snorkel-team/snorkel
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
diffgram
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
-
-
compose
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning. (by alteryx)
-
BingImageAITrainer
A tool for generating diverse synthetic training images using Bing Image Creator to facilitate the training of AI/ML image models.
Python training-data discussion
Index
What are some of the best open-source training-data projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | snorkel | 5,792 |
2 | diffgram | 1,835 |
3 | skweak | 917 |
4 | compose | 495 |
5 | BingImageAITrainer | 3 |