Python weak-supervision

Open-source Python projects categorized as weak-supervision

Top 7 Python weak-supervision Projects

  • cleanlab

    The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

    Project mention: [P] Datalab: A Linter for ML Datasets | | 2023-05-16

    I recently published a blog introducing Datalab and an open-source Python implementation that is easy-to-use for all data types (image, text, tabular, audio, etc). For data scientists, I’ve made a quick Jupyter tutorial to run Datalab on your own data.

  • snorkel

    A system for quickly generating training data with weak supervision

    Project mention: [P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions. | | 2023-03-03

    The paid product came out of an open source tool:

  • CodiumAI

    TestGPT | Generating meaningful tests for busy devs. Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push.

  • argilla

    ✨Argilla: the open-source data curation platform for LLMs

    Project mention: Meet Argilla: An Open-Source Data Curation Platform for Large Language Models (LLMs) and MLOps for Natural Language Processing | | 2023-05-19

    Github link:

  • skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

    Project mention: Entity Extraction with Predefined List | | 2023-01-07

    Thanks for pointing me in the right direction. Seems like there’s a few other approaches with weak supervision:

  • wrench

    WRENCH: Weak supeRvision bENCHmark

  • weasel

    Weakly Supervised End-to-End Learning (NeurIPS 2021) (by autonlab)

  • zeroshot_topics

    Topic Inference with Zeroshot models


    ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-05-19.

Python weak-supervision related posts


What are some of the best open-source weak-supervision projects in Python? This list will help you:

Project Stars
1 cleanlab 5,935
2 snorkel 5,495
3 argilla 1,971
4 skweak 877
5 wrench 191
6 weasel 142
7 zeroshot_topics 59
Access the most powerful time series database as a service
Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.