-
label-errors
🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I found these issues to be pretty interesting, yet I wasn't surprised. It's pretty well known that many common ML datasets exhibit thousands of errors.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
"I'm gonna make him a Neural Network he can't refuse" - Godfather of AI
-
How do we best practice preprocessing and data cleaning?
-
Show HN: 78% MNIST accuracy using GZIP in under 10 lines of code
-
Automated Data Quality at Scale
-
[N] Fine-Tuning OpenAI Language Models with Noisily Labeled Data (37% error reduction)