Top 8 Python labeling-tool Projects
-
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
I’m fairly new to deep learning and learning as I got so sorry if this is very basic, but I’m working on a model for detecting invasive coconut rhinoceros beetles destroying palm trees using drone photography. The 1080p photos I’m given were taken 250ft AGL and were cropped into equal size smaller images with some having one or more palm trees and some having none. Im using I’m using labelStudio to generate the XML files that point to their jpg counterparts path.
-
DeepLabCut
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
Project mention: Landmark tracking / Pose estimation model training in TensorFlow : | reddit.com/r/tensorflow | 2022-09-05Use DeepLabCut, I also strongly suggest that you should fund their work: https://github.com/DeepLabCut/DeepLabCut
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
Project mention: [P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions. | reddit.com/r/MachineLearning | 2023-03-03You definitely forgot https://www.kern.ai/ :)
-
compose
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning. (by alteryx)
Compose Compose targets labeling raw data, allowing you to set labeling functions for your data in Python in order to make the labeling process easier.
-
-
-
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
I come from a similar application area, where I try to tag (annotation/label) a taxonomy of products iteratively. You are trying something slightly different, AFAIU, labeling a flat set of songs, each song with a set of tags from ontology (directed graph)From an application point of view, this is what taxonomists often do, when migrating products from one catalog to another: mapping one taxonomy to another. There was quite active research on matching ontologies. So, there are tools in both industry and research that help in that process, although I have never researched whether they do it iteratively and using sampling. Another related area is labeling data to train machine learning models (in your case it sounds a bit like multilabel classification, in my case, this is multiclass classification). This is often done iteratively, and tools like Explosion Prodigy samples for manual annotation only those items that the ML model is still not confident enough. This might be offtopic, but I looked at your library and your notation for defining relations between tags, reminded me of RDF and OWL languages for defining ontologies. They are quite well-defined and have tools for making inferences (reasoners).
Index
What are some of the best open-source labeling-tool projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | label-studio | 12,420 |
2 | DeepLabCut | 3,567 |
3 | refinery | 1,145 |
4 | compose | 411 |
5 | hover | 296 |
6 | image-sorter2 | 76 |
7 | tapen | 11 |
8 | tree-labeller | 3 |