noisy-labels

Open-source projects categorized as noisy-labels
Language: + Python + HTML

Top 6 noisy-label Open-Source Projects

  • cleanlab

    The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

  • Project mention: [Research] Detecting Annotation Errors in Semantic Segmentation Data | /r/MachineLearning | 2023-11-05

    We have feely open-sourced our new method for improving segmentation data, published a paper on the research behind it, and released a 5-min code tutorial. You can also read more in the blog if you'd like.

  • Awesome-Learning-with-Label-Noise

    A curated list of resources for Learning with Noisy Labels

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • awesome-open-data-centric-ai

    Curated list of open source tooling for data-centric AI on unstructured data.

  • Encord Active

    Open source active learning toolkit to find failure modes in your computer vision models, prioritize data to label next, and drive data curation to improve model performance.

  • Project mention: Launch HN: Encord (YC W21) – Unit testing for computer vision models | news.ycombinator.com | 2024-01-31

    We base our pricing on your user and consumption scale and would be happy to discuss this with you directly. Please feel free to explore the OS version of Active at https://github.com/encord-team/encord-active. Note that some features, such as natural language search using GPU accelerated APIs, are not included in the cloud version.

  • ProSelfLC-AT

    noisy labels; missing labels; semi-supervised learning; entropy; uncertainty; robustness and generalisation.

  • scikit-clean

    A collection of algorithms for detecting and handling label noise

  • Project mention: Ask HN: What side projects landed you a job? | news.ycombinator.com | 2023-12-03

    Among all these feel-good stories, how about one with a bit different ending?

    During my masters, I created a ML library that dealt with noise in dataset. I implemented bunch of papers, but unlike your usual research code, I spent a long time obsessing about it's API, performance, created documentation, CI- the whole shebang [1]. But then, like avg research code, I moved on and promptly forgot about it.

    One day about a year ago the cofounder of a very new, small startup working on something similar texted me about the project on linkedin. We chatted for a bit, but as a guy who thinks he's too cool for linkedin, I next logged in and saw his last message about wanting to collaborate about 3/4 months after the fact.

    Well they raised $25 million dollars a few months ago :(

    [1] https://github.com/Shihab-Shahriar/scikit-clean

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

noisy-labels related posts

  • [R] ProSelfLC: Progressive Self Label Correction Towards A Low-Temperature Entropy State

    1 project | /r/MachineLearning | 26 Jul 2022
  • [D] Should expert opinion be a bigger part of the Machine Learning world?

    2 projects | /r/MachineLearning | 25 Mar 2022

Index

What are some of the best open-source noisy-label projects? This list will help you:

Project Stars
1 cleanlab 8,719
2 Awesome-Learning-with-Label-Noise 2,534
3 awesome-open-data-centric-ai 678
4 Encord Active 420
5 ProSelfLC-AT 58
6 scikit-clean 13

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com