Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression. Learn more →
Top 8 Python data-labeling Projects
-
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
I’m fairly new to deep learning and learning as I got so sorry if this is very basic, but I’m working on a model for detecting invasive coconut rhinoceros beetles destroying palm trees using drone photography. The 1080p photos I’m given were taken 250ft AGL and were cropped into equal size smaller images with some having one or more palm trees and some having none. Im using I’m using labelStudio to generate the XML files that point to their jpg counterparts path.
-
Project mention: How do I connect application running in a notebook server to my local machine. | reddit.com/r/Kubeflow | 2022-12-11
I followed the guide to have doccano https://github.com/doccano/doccano setup in a notebook server in Kubeflow. It is running and the django connection is established, but it fails to connect with my localmachine so when I try to open the link, it does not respond. Is there a way to connect apps running on remove notebook servers to local in Kubeflow?
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
Project mention: [P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions. | reddit.com/r/MachineLearning | 2023-03-03You definitely forgot https://www.kern.ai/ :)
-
compose
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning. (by alteryx)
Compose Compose targets labeling raw data, allowing you to set labeling functions for your data in Python in order to make the labeling process easier.
-
-
-
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
modzy-labelstudio-sample
Create training data labels from a production model with Modzy, Dropbox, and Label Studio
Project mention: Data Labeling for ML Model Retraining with Label Studio | reddit.com/r/artificial | 2022-09-26Link to Github repo.
Python data-labeling related posts
- How we used AI to automate stock sentiment classification
- German's NLP startup Kern AI has raised €2.7M in seed funding to accelerate its recent growth
- Why and how we started Kern AI (our seed funding announcement)
- GPT and BERT: A Comparison of Transformer Architectures
- Open-source tool to label, assess and maintain natural language data. Treat training data like a software artifact!
- Introducing bricks, an open-source content-library for NLP
- How to fine-tune your embeddings for better similarity search
-
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Mar 2023
Index
What are some of the best open-source data-labeling projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | label-studio | 12,336 |
2 | doccano | 7,505 |
3 | refinery | 1,142 |
4 | compose | 410 |
5 | bbox-visualizer | 327 |
6 | hover | 296 |
7 | mutate | 142 |
8 | modzy-labelstudio-sample | 11 |