SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 active-learning Open-Source Projects
-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
-
vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
argilla
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
-
refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
awesome-open-data-centric-ai
Curated list of open source tooling for data-centric AI on unstructured data.
-
Encord Active
Open source active learning toolkit to find failure modes in your computer vision models, prioritize data to label next, and drive data curation to improve model performance.
-
Active-Learning-as-a-Service
A scalable & efficient active learning/data selection system for everyone.
-
Bamboo
Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning. (by ZhangYuanhan-AI)
-
internet-explorer
Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desired target dataset.
-
active_learning
Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training with torch's DDP.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: [Research] Detecting Annotation Errors in Semantic Segmentation Data | /r/MachineLearning | 2023-11-05We have feely open-sourced our new method for improving segmentation data, published a paper on the research behind it, and released a 5-min code tutorial. You can also read more in the blog if you'd like.
Project mention: Open-Source Data Collection Platform for LLM Fine-Tuning and RLHF | news.ycombinator.com | 2023-06-05I'm Dani, CEO and co-founder of Argilla.
Happy to answer any questions you might have and excited to hear your thoughts!
More about Argilla
GitHub: https://github.com/argilla-io/argilla
Project mention: Small-Text: Looking for Contributors (Active Learning, Text Classification, NLP) | /r/LanguageTechnology | 2023-05-21
Project mention: Launch HN: Encord (YC W21) – Unit testing for computer vision models | news.ycombinator.com | 2024-01-31We base our pricing on your user and consumption scale and would be happy to discuss this with you directly. Please feel free to explore the OS version of Active at https://github.com/encord-team/encord-active. Note that some features, such as natural language search using GPU accelerated APIs, are not included in the cloud version.
Hey HN! I'm super excited to share Markup with you, which is a totally free & open-source annotation tool that helps you transform unstructured text (e.g. news articles) into structured data that you can use for building, training, or fine-tuning ML models!
Check it out: https://github.com/samueldobbie/markup
Project mention: BayBE – A Bayesian Back End for Design of Experiments | news.ycombinator.com | 2023-12-06
Project mention: Internet Explorer: Targeted Representation Learning on the Open Web | news.ycombinator.com | 2023-10-09
Project mention: [P] EGSIS: Exploratory Graph-based Semi-supervised Image Segmentation | /r/MachineLearning | 2023-11-27
Some of these plugins were simpler than others. On one end, the Twilio automation plugin consists of a single Python file without bells and whistles. On the opposite extreme, plugins like Active Learning, which required multiple operators, caching, and special handling for many different scenarios. Plugins like Reverse Image Search and Concept Space Traversal were challenging in a different way, mostly because I am new to JavaScript. But that is for another day.
active-learning related posts
-
[P] EGSIS: Exploratory Graph-based Semi-supervised Image Segmentation
-
[P] EGSIS: Exploratory Graph-based Semi-supervised Image Segmentation
-
Internet Explorer: Targeted Representation Learning on the Open Web
-
Small-Text: Looking for Contributors (Active Learning, Text Classification, NLP)
-
Show HN: An annotation tool for ML and NLP
-
Internet Explorer: Targeted Representation Learning on the Open Web
-
Internet Explorer: Targeted Representation Learning on the Open Web
-
A note from our sponsor - SaaSHub
www.saashub.com | 11 May 2024
Index
What are some of the best open-source active-learning projects? This list will help you:
Project | Stars | |
---|---|---|
1 | cleanlab | 8,719 |
2 | vowpal_wabbit | 8,409 |
3 | argilla | 3,132 |
4 | modAL | 2,143 |
5 | refinery | 1,366 |
6 | adaptive | 1,114 |
7 | deep-active-learning | 758 |
8 | awesome-open-data-centric-ai | 678 |
9 | awesome-active-learning | 674 |
10 | 3d-bat | 580 |
11 | asreview | 561 |
12 | MONAILabel | 542 |
13 | small-text | 520 |
14 | Encord Active | 420 |
15 | markup | 232 |
16 | Active-Learning-as-a-Service | 210 |
17 | baybe | 182 |
18 | Bamboo | 161 |
19 | internet-explorer | 160 |
20 | active_learning | 52 |
21 | QDrant-NLP | 11 |
22 | egsis | 9 |
23 | active-learning-plugin | 6 |
Sponsored