Top 6 zero-shot-classification Open-Source Projects

open_clip

29 8,622 8.2 Jupyter Notebook

An open source implementation of CLIP.

Project mention: FLaNK-AIM: 20 May 2024 Weekly | dev.to | 2024-05-20

notebooks

19 4,250 8.3 Jupyter Notebook

Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.

Project mention: Supervision: Reusable Computer Vision | news.ycombinator.com | 2024-03-24

Yeah, inference[1] is our open source package for running locally (either directly in Python or via a Docker container). It works with all the models on Universe, models you train yourself (assuming we support the architecture; we have a bunch of notebooks available[2]), or train in our platform, plus several more general foundation models[3] (for things like embeddings, zero-shot detection, question answering, OCR, etc).
We also have a hosted API[4] you can hit for most models we support (except some of the large vision models that are really GPU-heavy) if you prefer.
[1] https://github.com/roboflow/inference
[2] https://github.com/roboflow/notebooks
[3] https://inference.roboflow.com/foundation/about/
[4] https://docs.roboflow.com/deploy/hosted-api

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
hcaptcha-challenger

1 1,426 9.7 Python

🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
InternVideo

3 994 8.4 Python

Video Foundation Models & Data for Multimodal Understanding
cybertron

1 258 7.2 Go

Cybertron: the home planet of the Transformers in Go (by nlpodyssey)
text-to-image-eval

1 19 8.8 Jupyter Notebook

Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics include Zero-shot accuracy, Linear Probe, Image retrieval, and KNN accuracy.

Project mention: We Built an Open-Source Text-to-Image Evaluation Library for Clip Models | news.ycombinator.com | 2024-05-07

Hi everyone,
We recently released TTI Eval `text-to-image-eval`, an open-source library for evaluating zero-shot classification models like CLIP and domain-specific ones like BioCLIP against your (or HF) datasets to estimate how well the model will perform.
You can evaluate custom and HuggingFace text-to-image/zero-shot image classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. The evaluation metrics include Zero-shot accuracy, linear probe, image retrieval, and KNN accuracy.
We built this for ML engineers and developers using CLIP models.
Here's the installation guide if you want to get started: https://github.com/encord-team/text-to-image-eval?tab=readme...
I'd love to hear your thoughts on this. I'm open to contributions and feedback from the community.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).