Top 6 zero-shot-classification Open-Source Projects
-
notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
text-to-image-eval
Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics include Zero-shot accuracy, Linear Probe, Image retrieval, and KNN accuracy.
Yeah, inference[1] is our open source package for running locally (either directly in Python or via a Docker container). It works with all the models on Universe, models you train yourself (assuming we support the architecture; we have a bunch of notebooks available[2]), or train in our platform, plus several more general foundation models[3] (for things like embeddings, zero-shot detection, question answering, OCR, etc).
We also have a hosted API[4] you can hit for most models we support (except some of the large vision models that are really GPU-heavy) if you prefer.
[1] https://github.com/roboflow/inference
[2] https://github.com/roboflow/notebooks
[3] https://inference.roboflow.com/foundation/about/
[4] https://docs.roboflow.com/deploy/hosted-api
Project mention: We Built an Open-Source Text-to-Image Evaluation Library for Clip Models | news.ycombinator.com | 2024-05-07Hi everyone,
We recently released TTI Eval `text-to-image-eval`, an open-source library for evaluating zero-shot classification models like CLIP and domain-specific ones like BioCLIP against your (or HF) datasets to estimate how well the model will perform.
You can evaluate custom and HuggingFace text-to-image/zero-shot image classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. The evaluation metrics include Zero-shot accuracy, linear probe, image retrieval, and KNN accuracy.
We built this for ML engineers and developers using CLIP models.
Here's the installation guide if you want to get started: https://github.com/encord-team/text-to-image-eval?tab=readme...
I'd love to hear your thoughts on this. I'm open to contributions and feedback from the community.
Index
What are some of the best open-source zero-shot-classification projects? This list will help you:
Project | Stars | |
---|---|---|
1 | open_clip | 8,622 |
2 | notebooks | 4,250 |
3 | hcaptcha-challenger | 1,426 |
4 | InternVideo | 994 |
5 | cybertron | 258 |
6 | text-to-image-eval | 19 |
Sponsored