Our great sponsors
-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
There's an entire GitHub package being actively developed for this reason. If your data has labels and you can train a classifier on it or get embeddings, cleanlab has a bunch of useful features to analyze the noise in your dataset. Taken from the repo readme:
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- [Research] Detecting Errors in Numerical Data via any Regression Model
- Detecting Errors in Numerical Data via Any Regression Model
- cleanlab v2.5 now supports all major ML tasks (adds regression, object detection, and image segmentation)
- Enhancing Product Analytics and E-commerce Business
- Databricks users can now automatically correct data and improve ML models