small-text
QDrant-NLP
small-text | QDrant-NLP | |
---|---|---|
4 | 1 | |
521 | 11 | |
0.6% | - | |
7.6 | 10.0 | |
10 days ago | over 1 year ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
small-text
- Small-Text: Looking for Contributors (Active Learning, Text Classification, NLP)
-
🤗 Active learning from scratch | Using Hugging Face Transformers, Rubrix, and small-text
If you are new to small-text: https://github.com/webis-de/small-text
- [P] Small-Text: Active Learning for Text Classification in Python
QDrant-NLP
-
Vector Databases for Data-Centric AI (Part 2)
Just clone the repo QDrant-NLP and run: docker-compose up I would like to increase the number of datasets this can be tried on, either with GPU backed lambda functions or by saving many example datasets to S3. So far I've only made a 6K subset of ag_news available. ag_news · Datasets at Hugging Face This is the code snippet used to generate the embeddings via hugging-face:
What are some alternatives?
transformers-interpret - Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
refinery - The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
happy-transformer - Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
NeoGPT - Chat effortlessly, execute commands, and interpret code with Llama3, Phi3, and more - your local AI assistant. Enjoy seamless interaction while ensuring ultimate privacy
NewsMTSC - Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.
fiftyone - The open-source tool for building high-quality datasets and computer vision models
argilla - Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
automl-docker - CLI-based tool to automatically build ML models from training data into a servable Docker container
Resume-Matcher - Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
DECIMER-Image_Transformer - DECIMER: Deep Learning for Chemical Image Recognition using Efficient-Net V2 + Transformer
MONAILabel - MONAI Label is an intelligent open source image labeling and learning tool.