SaaSHub helps you find the best software and product alternatives Learn more →
Top 16 Python image-captioning Projects
-
InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
-
a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
-
awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
CLIP-Caption-Reward
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
-
UPop
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
-
perturb-predict-paraphrase
Implementation of Perturb, Predict & Paraphrase: Semi-supervised Learning using Noisy Student for Image Captioning
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
You can also create an issue and ask the developers for help.
Project mention: Show HN: I scraped 200M Shopify products to build a search engine | news.ycombinator.com | 2024-02-22I found some things on Github you could use, I'm not a dev myself and I'm not sure how scalable these are, but have a look, maybe there's something useful. https://github.com/jhc13/taggui
The category filtering is what I wanted to get at, I think the search would improve a lot.
Project mention: Show HN: Compress vision-language and unimodal AI models by structured pruning | news.ycombinator.com | 2023-07-31
Concept Modeling Techniques: the built-in concept modeling technique in this walkthrough uses GPT-4V and some light prompting to identify each cluster's core concept. This is but one way to approach an open-ended problem. Try using image captioning and topic modeling, or create your own technique!
Python image-captioning related posts
- Need help for a colab notebook running Lavis blip2_instruct_vicuna13b?
- most sane web3 job listing
- I work at a non-tech company and have been asked to make software that is impossible. How do I explain it to my boss?
- Two-minute Daily AI Update (Date: 5/15/2023)
- InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
- [D] Tracking Dancing People
- Meet Prismer: An Open Source Vision-Language Model with An Ensemble of Experts
-
A note from our sponsor - SaaSHub
www.saashub.com | 24 Apr 2024
Index
What are some of the best open-source image-captioning projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | InternGPT | 3,121 |
2 | a-PyTorch-Tutorial-to-Image-Captioning | 2,591 |
3 | OFA | 2,323 |
4 | prismer | 1,287 |
5 | Oscar | 1,025 |
6 | virtex | 556 |
7 | awesome-foundation-and-multimodal-models | 502 |
8 | meshed-memory-transformer | 497 |
9 | taggui | 298 |
10 | MAGIC | 245 |
11 | catr | 242 |
12 | CLIP-Caption-Reward | 220 |
13 | UPop | 83 |
14 | image-captioning | 29 |
15 | fiftyone-image-captioning-plugin | 5 |
16 | perturb-predict-paraphrase | 5 |
Sponsored