Python multimodal-deep-learning

Open-source Python projects categorized as multimodal-deep-learning

Top 9 Python multimodal-deep-learning Projects

multimodal-deep-learning
  1. Time-LLM

    [ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. pytorch-widedeep

    A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

  4. VQASynth

    Compose multimodal datasets 🎹

  5. CLoT

    CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation". (by sail-sg)

  6. DeepViewAgg

    [CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"

  7. CapDec

    CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

  8. 3DCoMPaT-v2

    3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition

  9. Sevalla

    Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

    Sevalla logo
  10. multimind-sdk

    Your SDK solves all of this. One interface. Unified logic. Local + hosted models. Fine-tuning. Agent tools. Enterprise-ready. Hybrid RAG.Star 🌟 if you like it!

    Project mention: One Input, Multiple AI Minds: Meet the New MultiMindSDK LLM Router | dev.to | 2025-07-11

    GitHub: github.com/multimindlab/multimind-sdk

  11. LIMoE-pytorch

    PyTorch implementation of LIMoE

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python multimodal-deep-learning discussion

Log in or Post with

Python multimodal-deep-learning related posts

  • Open source – Unsupervised captioning getting closer to supervised captioning

    1 project | news.ycombinator.com | 20 Apr 2024
  • [D] 3DCoMPaT Challenge: Tag materials and parts on 3D Models. 3K$ USD price pool

    1 project | /r/MachineLearning | 10 May 2023
  • Reverse engineer Stable Diffusion images

    2 projects | news.ycombinator.com | 8 Feb 2023
  • [R] [CVPR 2022 Oral] Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

    2 projects | /r/MachineLearning | 11 May 2022

Index

What are some of the best open-source multimodal-deep-learning projects in Python? This list will help you:

# Project Stars
1 Time-LLM 2,203
2 pytorch-widedeep 1,367
3 VQASynth 466
4 CLoT 317
5 DeepViewAgg 233
6 CapDec 198
7 3DCoMPaT-v2 84
8 multimind-sdk 58
9 LIMoE-pytorch 53

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?