SaaSHub helps you find the best software and product alternatives Learn more →
Top 16 Python image-captioning Projects
-
InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
-
OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
-
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
-
-
-
awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
-
-
-
-
-
CLIP-Caption-Reward
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
-
CapRL
[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"
Project mention: CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning | news.ycombinator.com | 2025-11-04 -
-
fiftyone-image-captioning-plugin
Caption images across your datasets with state of the art models from Hugging Face and Replicate!
-
perturb-predict-paraphrase
Implementation of Perturb, Predict & Paraphrase: Semi-supervised Learning using Noisy Student for Image Captioning
Python image-captioning discussion
Python image-captioning related posts
-
Need help for a colab notebook running Lavis blip2_instruct_vicuna13b?
-
most sane web3 job listing
-
I work at a non-tech company and have been asked to make software that is impossible. How do I explain it to my boss?
-
Two-minute Daily AI Update (Date: 5/15/2023)
-
InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
-
[D] Tracking Dancing People
-
Meet Prismer: An Open Source Vision-Language Model with An Ensemble of Experts
-
A note from our sponsor - SaaSHub
www.saashub.com | 15 Jun 2026
Index
What are some of the best open-source image-captioning projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | InternGPT | 3,203 |
| 2 | a-PyTorch-Tutorial-to-Image-Captioning | 2,892 |
| 3 | OFA | 2,557 |
| 4 | Caption-Anything | 1,774 |
| 5 | taggui | 1,321 |
| 6 | prismer | 1,312 |
| 7 | awesome-foundation-and-multimodal-models | 639 |
| 8 | virtex | 563 |
| 9 | meshed-memory-transformer | 546 |
| 10 | catr | 271 |
| 11 | MAGIC | 261 |
| 12 | CLIP-Caption-Reward | 246 |
| 13 | CapRL | 215 |
| 14 | image-captioning | 50 |
| 15 | fiftyone-image-captioning-plugin | 11 |
| 16 | perturb-predict-paraphrase | 6 |