SaaSHub helps you find the best software and product alternatives Learn more →
Top 11 Python data-labeling Projects
-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Project mention: Ask HN: Not a webdev, why are these sites so good? | news.ycombinator.com | 2024-06-18https://cleanlab.ai/
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
Tools: Platforms like LangChain, Kern AI Refinery, and Langtail simplify testing, debugging, and optimizing prompts.
-
compose
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning. (by alteryx)
-
-
-
edsl
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Project mention: Python Library for Structured Data Extraction via LLM | news.ycombinator.com | 2024-08-14Hey thanks for noticing - here's the MIT licensed library it's based on: https://github.com/expectedparrot/edsl
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
modzy-labelstudio-sample
Create training data labels from a production model with Modzy, Dropbox, and Label Studio
-
Python data-labeling discussion
Python data-labeling related posts
-
Ultimate guide to prompt engineering
-
Python Library for Structured Data Extraction via LLM
-
You Can't Have a Free Software AI Stack
-
How we used AI to automate stock sentiment classification
-
German's NLP startup Kern AI has raised €2.7M in seed funding to accelerate its recent growth
-
Why and how we started Kern AI (our seed funding announcement)
-
GPT and BERT: A Comparison of Transformer Architectures
-
A note from our sponsor - SaaSHub
www.saashub.com | 21 May 2025
Index
What are some of the best open-source data-labeling projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | cleanlab | 10,526 |
2 | doccano | 9,986 |
3 | refinery | 1,438 |
4 | compose | 505 |
5 | bbox-visualizer | 396 |
6 | hover | 326 |
7 | edsl | 240 |
8 | mutate | 151 |
9 | superpipe | 110 |
10 | modzy-labelstudio-sample | 18 |
11 | bunny-party | 10 |