SaaSHub helps you find the best software and product alternatives Learn more →
Top 3 Jupyter Notebook vision-language Projects
-
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
-
pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion) (by google-research)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I suggest trying BLIP for this. I've had really good results from that.
https://github.com/salesforce/BLIP
I built a tiny Python CLI wrapper for it to make it easier to try: https://github.com/simonw/blip-caption
Project mention: CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss | dev.to | 2024-04-15GitHub
Jupyter Notebook vision-language related posts
- Is there a website where you can upload a photo and get the description in a paragraph?
- Is there a way to do segmentation of a person's clothing?
- Stable Diffusion v2-1-unCLIP model released
- GPT-4 shows emergent Theory of Mind on par with an adult. It scored in the 85+ percentile for a lot of major college exams. It can also do taxes and create functional websites from a simple drawing
- meme
- Object Recognition for Photo Metadata
- Stable-diffusion in Nix
-
A note from our sponsor - SaaSHub
www.saashub.com | 19 Apr 2024
Index
What are some of the best open-source vision-language projects in Jupyter Notebook? This list will help you:
Project | Stars | |
---|---|---|
1 | BLIP | 4,222 |
2 | pix2seq | 806 |
3 | AlphaCLIP | 478 |