Top 13 multimodal-deep-learning Open-Source Projects
-
BentoML
The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
pytorch-widedeep
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
-
Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
CLoT
Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation" (CVPR 2024) (by sail-sg)
-
DeepViewAgg
[CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"
Link to GitHub -->
Yes general LLM models can be used for time series forecasting:
https://github.com/KimMeen/Time-LLM
Project mention: CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss | dev.to | 2024-04-15GitHub
Project mention: Open source – Unsupervised captioning getting closer to supervised captioning | news.ycombinator.com | 2024-04-20
Project mention: Show HN: VQASynth – pipelines to synthesize VQA datasets | news.ycombinator.com | 2024-02-23
Project mention: [D] 3DCoMPaT Challenge: Tag materials and parts on 3D Models. 3K$ USD price pool | /r/MachineLearning | 2023-05-10
multimodal-deep-learning related posts
-
Open source – Unsupervised captioning getting closer to supervised captioning
-
[D] 3DCoMPaT Challenge: Tag materials and parts on 3D Models. 3K$ USD price pool
-
Reverse engineer Stable Diffusion images
-
[R] [CVPR 2022 Oral] Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation
Index
What are some of the best open-source multimodal-deep-learning projects? This list will help you:
Project | Stars | |
---|---|---|
1 | LAVIS | 8,738 |
2 | BentoML | 6,558 |
3 | Awesome-Text-to-Image | 1,878 |
4 | pytorch-widedeep | 1,238 |
5 | Time-LLM | 742 |
6 | blended-latent-diffusion | 509 |
7 | scarches | 310 |
8 | CLoT | 219 |
9 | DeepViewAgg | 215 |
10 | CapDec | 169 |
11 | VQASynth | 74 |
12 | 3DCoMPaT-v2 | 69 |
13 | Multimodal | 8 |
Sponsored