- Awesome-Multimodal-Large-Language-Models VS alpaca_farm
- Awesome-Multimodal-Large-Language-Models VS Chain-of-ThoughtsPapers
- Awesome-Multimodal-Large-Language-Models VS MindVideo
- Awesome-Multimodal-Large-Language-Models VS Otter
- Awesome-Multimodal-Large-Language-Models VS instructblip-pipeline
- Awesome-Multimodal-Large-Language-Models VS Awesome-LLM-Reasoning
- Awesome-Multimodal-Large-Language-Models VS Awesome-Multimodal-LLM
- Awesome-Multimodal-Large-Language-Models VS unilm
Awesome-Multimodal-Large-Language-Models Alternatives
Similar projects and alternatives to Awesome-Multimodal-Large-Language-Models
-
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Chain-of-ThoughtsPapers
Discontinued A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
-
instructblip-pipeline
A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.
-
Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
-
Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Awesome-Multimodal-Large-Language-Models reviews and mentions
-
Don't we need a leaderboard for visual models?
There is this one: https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/tree/Evaluation As well as a leaderboard from OpenCompass (probably outdated): https://mmbench.opencompass.org.cn/leaderboard
-
Recommended open LLMs with image input modality?
https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/tree/Evaluation this is pretty comprehensive. tldr; blip is probably the best, though i've heard it does need a lot of vram. In my experience its the most responsive to prompt engineering.
Stats
Popular Comparisons
Sponsored