Python Annotation

Open-source Python projects categorized as Annotation

Top 20 Python Annotation Projects

  1. cleanlab

    The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

    Project mention: Ask HN: Not a webdev, why are these sites so good? | news.ycombinator.com | 2024-06-18

    https://cleanlab.ai/

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. h

    Annotate with anyone, anywhere.

    Project mention: Ann, the Small Annotation Server | news.ycombinator.com | 2025-05-20

    > Does Hypothes.is have a self-hosting option? https://web.hypothes.is/sales/

    The code for both the client and server are open source (https://github.com/hypothesis/h) so this is possible. The server is designed to support the needs of large scale deployments, so this does come with some complexity compared to a system you would design for smaller scale usage.

    The text on https://web.hypothes.is/ mostly targets schools and universities, because Hypothesis pays for itself by selling integrations with online learning platforms (Canvas, D2L, Blackboard etc.) and associated support.

  4. diffgram

    The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

  5. labelCloud

    A lightweight tool for labeling 3D bounding boxes in point clouds.

  6. bakta

    Rapid & standardized annotation of bacterial genomes, MAGs & plasmids

  7. errant

    ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.

  8. bbox-visualizer

    Make drawing and labeling bounding boxes easy as cake

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. atlas

    ATLAS - Three commands to start analyzing your metagenome data (by metagenome-atlas)

  11. grasp

    A reliable org-capture browser extension for Chrome/Firefox

  12. DLTA-AI

    Data Labeling, Tracking and Annotation with AI

  13. spectree

    API spec validator and OpenAPI document generator for Python web frameworks.

  14. globox

    A package to read and convert object detection datasets (COCO, YOLO, PascalVOC, LabelMe, CVAT, OpenImage, ...) and evaluate them with COCO and PascalVOC metrics.

  15. turkle

    Django-based clone of Amazon's Mechanical Turk service running in your local environment.

  16. labelbox-python

    The data factory for next gen AI

  17. image-sorter2

    One-click image sorting/labelling script

  18. ansible-docgen

    Generate documentation from annotated Ansible Playbooks and Roles

  19. labelformat

    A tool for converting computer vision label formats.

  20. reconstruction-error-ratios

    Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!

    Project mention: How difficult is this dataset REALLY? | dev.to | 2024-12-10
  21. vogon-web

    Building the epistemic web

  22. active-learning-plugin

    Label your dataset with active learning in FiftyOne!

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Annotation discussion

Log in or Post with

Python Annotation related posts

Index

What are some of the best open-source Annotation projects in Python? This list will help you:

# Project Stars
1 cleanlab 10,588
2 h 3,041
3 diffgram 1,867
4 labelCloud 690
5 bakta 519
6 errant 444
7 bbox-visualizer 401
8 atlas 392
9 grasp 361
10 DLTA-AI 344
11 spectree 336
12 globox 200
13 turkle 152
14 labelbox-python 138
15 image-sorter2 91
16 ansible-docgen 75
17 labelformat 62
18 reconstruction-error-ratios 25
19 vogon-web 13
20 active-learning-plugin 12

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?