-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I just published a paper detailing this non-IID check and open-sourced its code in the cleanlab package — just one line of code will check for this and many other types of issues in your dataset.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
[Research] Detecting Errors in Numerical Data via any Regression Model
-
Detecting Errors in Numerical Data via Any Regression Model
-
cleanlab v2.5 now supports all major ML tasks (adds regression, object detection, and image segmentation)
-
Enhancing Product Analytics and E-commerce Business
-
Databricks users can now automatically correct data and improve ML models