Encord Active
awesome-open-data-centric-ai
Encord Active | awesome-open-data-centric-ai | |
---|---|---|
6 | 1 | |
420 | 680 | |
0.5% | - | |
8.8 | 5.8 | |
15 days ago | 6 months ago | |
Python | ||
Apache License 2.0 | Creative Commons Attribution 4.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Encord Active
-
Launch HN: Encord (YC W21) – Unit testing for computer vision models
We base our pricing on your user and consumption scale and would be happy to discuss this with you directly. Please feel free to explore the OS version of Active at https://github.com/encord-team/encord-active. Note that some features, such as natural language search using GPU accelerated APIs, are not included in the cloud version.
- We tried injecting hallucinogenics into vision models
-
How to Fine-Tune Foundation Models to Auto-Label Training Data
Webinar from last week on how to fine-tune VFMs, specifically Meta's Segment Anything Model (SAM).
What you'll need to follow along the fine-tuning walkthrough:
Images, ground-truth masks, and optionally, prompts from the Stamp Verification (StaVer) Dataset on Kaggle (https://www.kaggle.com/datasets/rtatman/stamp-verification-s...)
Download the model weights for SAM the official GitHub repo (https://github.com/facebookresearch/segment-anything)
Good understanding of the model architecture Segment Anything paper (https://ai.meta.com/research/publications/segment-anything/)
GPU infra the NVIDIA A100 should do for this fine-tuning.
Data curation and model evaluation tool Encord Active (https://github.com/encord-team/encord-active)
Colab walkthrough for fine-tuning: https://colab.research.google.com/github/encord-team/encord-...
I'd love to get your thoughts and feedback. Thank you.
- Show HN: Open-source toolkit for ML model evaluation and active learning
-
modAL VS encord-active - a user suggested alternative
2 projects | 12 Apr 2023
An active learning toolkit I use to find failure modes in my vision datasets, prioritize which data to label next using the different acquisition functions.
awesome-open-data-centric-ai
-
[P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions.
Here is the link to the Github repo: https://github.com/Renumics/awesome-open-data-centric-ai Do you think there are tools missing? Please let me know or feel free to submit a pull request.
What are some alternatives?
tsuki-wscp - Web scraper for AI/ML training
internet-explorer - Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desired target dataset.
cleanlab - The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
spotlight - Interactively explore unstructured datasets from your dataframe.
modAL - A modular active learning framework for Python
WhereIsAI - AI company, product, and tool collection.
panda_patrol
awesome-synthetic-data - 📖 A curated list of resources dedicated to synthetic data
Awesome-Learning-with-Label-Noise - A curated list of resources for Learning with Noisy Labels
refinery - The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.