SaaSHub helps you find the best software and product alternatives Learn more β
Top 23 clip Open-Source Projects
-
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
-
Text2LIVE
Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
-
Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
-
awesome-foundation-and-multimodal-models
ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
-
Disco_Diffusion_Local
Getting the latest versions of Disco Diffusion to work locally, instead of colab. Including how I run this on Windows, despite some Linux only dependencies ;)
-
CLIPstyler
Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition" (CVPR 2022)
-
Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
-
TargetCLIP
[ECCV 2022] Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
We (Marqo) are doing a lot on 1 and 2. There is a huge amount to be done on the ML side of vector search and we are investing heavily in it. I think it has not quite sunk in that vector search systems are ML systems and everything that comes with that. I would love to chat about 1 and 2 so feel free to email me (email is in my profile). What we have done so far is here -> https://github.com/marqo-ai/marqo
Project mention: AI Hordeβs AGPL3 hordelib receives DMCA take-down from hlky | news.ycombinator.com | 2023-05-31It's image -> words, the inverse of stable diffusion.
see: https://github.com/pharmapsychotic/clip-interrogator
You might be interested in this, https://github.com/mazzzystar/Queryable, https://queryable.app/
I run it on my iPhone.
Native app. Doesn't require a network connection (great for privacy).
Project mention: X-AnyLabeling: Effortless Data Labeling with AI, Segment Anything and Others | news.ycombinator.com | 2024-03-19
Project mention: Stable Diffusion implemented by ncnn framework based on C++, supported txt2img and img2img! | /r/StableDiffusion | 2023-06-08
Project mention: CatLIP: Clip Vision Accuracy with 2.7x Faster Pre-Training on Web-Scale Data | news.ycombinator.com | 2024-04-25question: any good on-device size image embedding models?
tried https://github.com/unum-cloud/uform which i do like, especially they also support languages other than English. Any recommendations on other alternatives?
Project mention: OPENSCENE can identify objects, materials, affordances, activities, and room types in complex 3D scenes, all using a single model trained without any labeled 3D data | /r/AR_MR_XR | 2023-06-17Project website: github.io/openscene
Project mention: Ask HN: What are some unpopular technologies you wish people knew more about? | news.ycombinator.com | 2023-12-02
Project mention: [R]Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model | /r/MachineLearning | 2023-05-20Code: https://github.com/OpenGVLab/Instruct2Act
clip related posts
-
Open source β Unsupervised captioning getting closer to supervised captioning
-
I accidentally built a meme search engine
-
How to Build a Semantic Search Engine for Emojis
-
New Multimodal Model Coin-CLIP for Coin Identification/Recognition
-
MetaCLIP β Meta AI Research
-
Meta's Segment Anything written with C++ / GGML
-
Shortcuts ?
-
A note from our sponsor - SaaSHub
www.saashub.com | 3 May 2024
Index
What are some of the best open-source clip projects? This list will help you:
Project | Stars | |
---|---|---|
1 | marqo | 4,124 |
2 | Chinese-CLIP | 3,642 |
3 | mmpretrain | 3,171 |
4 | clip-interrogator | 2,484 |
5 | Queryable | 2,424 |
6 | X-AnyLabeling | 2,477 |
7 | clip-retrieval | 2,139 |
8 | Awesome-CLIP | 1,019 |
9 | Stable-Diffusion-NCNN | 935 |
10 | natural-language-image-search | 927 |
11 | natural-language-youtube-search | 895 |
12 | uform | 885 |
13 | Text2LIVE | 849 |
14 | aphantasia | 769 |
15 | Transformer-MM-Explainability | 704 |
16 | openscene | 548 |
17 | awesome-foundation-and-multimodal-models | 510 |
18 | clip.cpp | 388 |
19 | Disco_Diffusion_Local | 312 |
20 | CLIPstyler | 286 |
21 | Instruct2Act | 257 |
22 | MAGIC | 245 |
23 | TargetCLIP | 228 |
Sponsored