Top 23 clip Open-Source Projects

marqo

114 4,124 9.3 Python

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Project mention: Are we at peak vector database? | news.ycombinator.com | 2024-01-25

We (Marqo) are doing a lot on 1 and 2. There is a huge amount to be done on the ML side of vector search and we are investing heavily in it. I think it has not quite sunk in that vector search systems are ML systems and everything that comes with that. I would love to chat about 1 and 2 so feel free to email me (email is in my profile). What we have done so far is here -> https://github.com/marqo-ai/marqo

Chinese-CLIP

1 3,642 7.6 Python

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
mmpretrain

2 3,171 7.8 Python

OpenMMLab Pre-training Toolbox and Benchmark
clip-interrogator

27 2,484 4.8 Python

Image to prompt with BLIP and CLIP

Project mention: AI Horde’s AGPL3 hordelib receives DMCA take-down from hlky | news.ycombinator.com | 2023-05-31

It's image -> words, the inverse of stable diffusion.
see: https://github.com/pharmapsychotic/clip-interrogator

Queryable

5 2,424 7.9 Swift

Run OpenAI's CLIP model on iOS to search photos.

Project mention: I accidentally built a meme search engine | news.ycombinator.com | 2024-04-13

You might be interested in this, https://github.com/mazzzystar/Queryable, https://queryable.app/
I run it on my iPhone.
Native app. Doesn't require a network connection (great for privacy).

X-AnyLabeling

1 2,477 9.5 Python

Effortless data labeling with AI support from Segment Anything and other awesome models.

Project mention: X-AnyLabeling: Effortless Data Labeling with AI, Segment Anything and Others | news.ycombinator.com | 2024-03-19

clip-retrieval

11 2,139 7.7 Jupyter Notebook

Easily compute clip embeddings and build a clip retrieval system with them

Project mention: FLaNK AI for 11 March 2024 | dev.to | 2024-03-11

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Awesome-CLIP

2 1,019 0.0

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
Stable-Diffusion-NCNN

8 935 4.9 C++

Stable Diffusion in NCNN with c++, supported txt2img and img2img

Project mention: Stable Diffusion implemented by ncnn framework based on C++, supported txt2img and img2img! | /r/StableDiffusion | 2023-06-08

natural-language-image-search

9 927 0.0 Jupyter Notebook

Search photos on Unsplash using natural language
natural-language-youtube-search

6 895 0.0 Jupyter Notebook

Search inside YouTube videos using natural language
uform

8 885 9.2 Python

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Project mention: CatLIP: Clip Vision Accuracy with 2.7x Faster Pre-Training on Web-Scale Data | news.ycombinator.com | 2024-04-25

question: any good on-device size image embedding models?
tried https://github.com/unum-cloud/uform which i do like, especially they also support languages other than English. Any recommendations on other alternatives?

Text2LIVE

2 849 0.0 Python

Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
aphantasia

21 769 3.9 Python

CLIP + FFT/DWT/RGB = text to image/video
Transformer-MM-Explainability

3 704 0.0 Jupyter Notebook

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
openscene

3 548 4.9 Python

[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies

Project mention: OPENSCENE can identify objects, materials, affordances, activities, and room types in complex 3D scenes, all using a single model trained without any labeled 3D data | /r/AR_MR_XR | 2023-06-17

Project website: github.io/openscene

awesome-foundation-and-multimodal-models

1 510 7.5 Python

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Project mention: Foundation Multimodal Models | news.ycombinator.com | 2024-03-01

clip.cpp

3 388 8.1 C

CLIP inference in plain C/C++ with no extra dependencies

Project mention: Ask HN: What are some unpopular technologies you wish people knew more about? | news.ycombinator.com | 2023-12-02

Disco_Diffusion_Local

7 312 1.8 Jupyter Notebook

Getting the latest versions of Disco Diffusion to work locally, instead of colab. Including how I run this on Windows, despite some Linux only dependencies ;)
CLIPstyler

1 286 0.0 Python

Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition" (CVPR 2022)
Instruct2Act

1 257 3.8 Python

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Project mention: [R]Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model | /r/MachineLearning | 2023-05-20

Code: https://github.com/OpenGVLab/Instruct2Act

MAGIC

2 245 0.0 Python

Language Models Can See: Plugging Visual Controls in Text Generation (by yxuansu)
TargetCLIP

3 228 0.0 Jupyter Notebook

[ECCV 2022] Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

clip related posts

Open source – Unsupervised captioning getting closer to supervised captioning

1 project | news.ycombinator.com | 20 Apr 2024
I accidentally built a meme search engine

6 projects | news.ycombinator.com | 13 Apr 2024
How to Build a Semantic Search Engine for Emojis

1 project | dev.to | 7 Feb 2024
New Multimodal Model Coin-CLIP for Coin Identification/Recognition

1 project | /r/Multimodal | 8 Dec 2023
MetaCLIP – Meta AI Research

6 projects | news.ycombinator.com | 26 Oct 2023
Meta's Segment Anything written with C++ / GGML

4 projects | news.ycombinator.com | 5 Sep 2023
Shortcuts ?

1 project | /r/Queryable | 10 Jul 2023
A note from our sponsor - SaaSHub
www.saashub.com | 3 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source clip projects? This list will help you:

	Project	Stars
1	marqo	4,124
2	Chinese-CLIP	3,642
3	mmpretrain	3,171
4	clip-interrogator	2,484
5	Queryable	2,424
6	X-AnyLabeling	2,477
7	clip-retrieval	2,139
8	Awesome-CLIP	1,019
9	Stable-Diffusion-NCNN	935
10	natural-language-image-search	927
11	natural-language-youtube-search	895
12	uform	885
13	Text2LIVE	849
14	aphantasia	769
15	Transformer-MM-Explainability	704
16	openscene	548
17	awesome-foundation-and-multimodal-models	510
18	clip.cpp	388
19	Disco_Diffusion_Local	312
20	CLIPstyler	286
21	Instruct2Act	257
22	MAGIC	245
23	TargetCLIP	228