Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues. Learn more â
Top 9 Python multimodal-learning Projects
-
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
-
-
pykale
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the ð¥PyTorch ecosystem. â Star to support our work!
-
LViT
[IEEE Transactions on Medical Imaging/TMI] This repo is the official implementation of "LViT: Language meets Vision Transformer in Medical Image Segmentation"
-
-
UPop
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
valhalla-nmt
Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"
-
Coin-CLIP
Coin-CLIP: fine-tuned with a vast collection of coin images from CLIP using contrastive learning. It enhances feature extraction for coins, boosting image search accuracy. This model merges Visual Transformer (ViT) with CLIP's multimodal learning, optimized for numismatic applications.
Python multimodal-learning discussion
Python multimodal-learning related posts
-
New Multimodal Model Coin-CLIP for Coin Identification/Recognition
-
Are there any multimodal AI models I can use to provide a paired text *and* image input, to then generate an expanded descriptive text output? [D]
-
[D] Multi modal for visual qna based on a given image. Need suggestions.
-
Open Flamingo: An open-source framework for training large multimodal models
-
[D]Are there any good solutions for multimodal classification? Libraries, AutoML tool?
-
Classification problem with text and numerical features
-
A note from our sponsor - Judoscale
judoscale.com | 25 Apr 2025
Index
What are some of the best open-source multimodal-learning projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | open_flamingo | 3,897 |
2 | Multimodal-Toolkit | 603 |
3 | XPretrain | 491 |
4 | pykale | 457 |
5 | LViT | 338 |
6 | ViT-Lens | 175 |
7 | UPop | 101 |
8 | valhalla-nmt | 28 |
9 | Coin-CLIP | 19 |