awesome-open-data-centric-ai
internet-explorer
awesome-open-data-centric-ai | internet-explorer | |
---|---|---|
1 | 7 | |
678 | 160 | |
- | 0.0% | |
5.8 | 2.4 | |
6 months ago | about 1 year ago | |
Creative Commons Attribution 4.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-open-data-centric-ai
-
[P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions.
Here is the link to the Github repo: https://github.com/Renumics/awesome-open-data-centric-ai Do you think there are tools missing? Please let me know or feel free to submit a pull request.
internet-explorer
- Internet Explorer: Targeted Representation Learning on the Open Web
-
[R] Internet Explorer: Targeted Representation Learning on the Open Web - Carnegie Mellon University Alexander C. Li et al 2023 - Trained on a single GPU for 40 hours and outperforms CLIP ResNet-50 that was trained on 4000 GPU hours!
Found relevant code at https://internet-explorer-ssl.github.io/ + all code implementations here
-
Internet Explorer: Targeted Representation Learning on the Open Web - Carnegie Mellon University Alexander C. Li et al 2023 - Trained on a single GPU for 40 hours and outperforms CLIP ResNet-50 that was trained on 4000 GPU hours!
Blog: https://internet-explorer-ssl.github.io/
-
CMU Researchers Introduce Internet Explorer: An AI Approach with Targeted Representation Learning on the Open Web
Quick Read: https://www.marktechpost.com/2023/03/08/cmu-researchers-introduce-internet-explorer-an-ai-approach-with-targeted-representation-learning-on-the-open-web/ Paper: https://arxiv.org/pdf/2302.14051.pdf Github: https://github.com/internet-explorer-ssl/internet-explorer
What are some alternatives?
spotlight - Interactively explore unstructured datasets from your dataframe.
start-machine-learning-in-2020 - A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2021 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques! [Moved to: https://github.com/louisfb01/start-machine-learning]
Encord Active - Open source active learning toolkit to find failure modes in your computer vision models, prioritize data to label next, and drive data curation to improve model performance.
awesome-ai-awesomeness - A curated list of awesome awesomeness about artificial intelligence
WhereIsAI - AI company, product, and tool collection.
awesome-artificial-intelligence-research - A curated list of Artificial Intelligence (AI) Research, tracks the cutting edge trending of AI research, including recommender systems, computer vision, machine learning, etc.
awesome-synthetic-data - 📖 A curated list of resources dedicated to synthetic data
DataAug4Code - Source Code Data Augmentation for Deep Learning: A Survey.
Awesome-Learning-with-Label-Noise - A curated list of resources for Learning with Noisy Labels
cleanlab - The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
refinery - The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.