-
cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
[D] Major bug in Scikit-Learn's implementation of F-1 score
-
Contraction Clustering (RASTER): A fast clustering algorithm
-
Transformers as Support Vector Machines
-
How to Build and Deploy a Machine Learning model using Docker
-
Planning to get a laptop for ML/DL, is this good enough at the price point or are there better options at/below this price point?