Our great sponsors
-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I've been busy building a standard open-source library for data-centric AI: https://github.com/cleanlab/cleanlab/
In one-line of python, cleanlab can automatically: 1) find mislabeled data + train robust models 2) detect outliers 3) estimate consensus + annotator-quality for datasets labeled by multiple annotators 4) suggest which data is best to label or re-label next (active learning)
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- [Research] Detecting Errors in Numerical Data via any Regression Model
- Detecting Errors in Numerical Data via Any Regression Model
- cleanlab v2.5 now supports all major ML tasks (adds regression, object detection, and image segmentation)
- Enhancing Product Analytics and E-commerce Business
- Databricks users can now automatically correct data and improve ML models