Python multimodal-deep-learning

Open-source Python projects categorized as multimodal-deep-learning

Top 8 Python multimodal-deep-learning Projects

  • BentoML

    The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!

  • Project mention: Who's hiring developer advocates? (December 2023) | dev.to | 2023-12-04

    Link to GitHub -->

  • pytorch-widedeep

    A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Time-LLM

    [ICLR 2024] Official implementation of " πŸ¦™ Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

  • Project mention: karpathy/llm.c | news.ycombinator.com | 2024-04-08

    Yes general LLM models can be used for time series forecasting:

    https://github.com/KimMeen/Time-LLM

  • DeepViewAgg

    [CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"

  • CLoT

    Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation" (CVPR 2024) (by sail-sg)

  • Project mention: CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss | dev.to | 2024-04-15

    GitHub

  • CapDec

    CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

  • Project mention: Open source – Unsupervised captioning getting closer to supervised captioning | news.ycombinator.com | 2024-04-20
  • VQASynth

    Compose multimodal datasets 🎹

  • Project mention: Show HN: VQASynth – pipelines to synthesize VQA datasets | news.ycombinator.com | 2024-02-23
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • 3DCoMPaT-v2

    3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition

  • Project mention: [D] 3DCoMPaT Challenge: Tag materials and parts on 3D Models. 3K$ USD price pool | /r/MachineLearning | 2023-05-10
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python multimodal-deep-learning related posts

  • Open source – Unsupervised captioning getting closer to supervised captioning

    1 project | news.ycombinator.com | 20 Apr 2024
  • [D] 3DCoMPaT Challenge: Tag materials and parts on 3D Models. 3K$ USD price pool

    1 project | /r/MachineLearning | 10 May 2023
  • Reverse engineer Stable Diffusion images

    2 projects | news.ycombinator.com | 8 Feb 2023
  • [R] [CVPR 2022 Oral] Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

    2 projects | /r/MachineLearning | 11 May 2022

Index

What are some of the best open-source multimodal-deep-learning projects in Python? This list will help you:

Project Stars
1 BentoML 6,537
2 pytorch-widedeep 1,238
3 Time-LLM 742
4 DeepViewAgg 215
5 CLoT 219
6 CapDec 169
7 VQASynth 71
8 3DCoMPaT-v2 68

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com