multimodal-maestro
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥 (by roboflow)
multi_token
Embed arbitrary modalities (images, audio, documents, etc) into large language models. (by sshh12)
multimodal-maestro | multi_token | |
---|---|---|
1 | 1 | |
955 | 144 | |
2.6% | - | |
8.6 | 8.5 | |
3 months ago | about 2 months ago | |
Python | Python | |
MIT License | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
multimodal-maestro
Posts with mentions or reviews of multimodal-maestro.
We have used some of these posts to build our list of alternatives
and similar projects.
multi_token
Posts with mentions or reviews of multi_token.
We have used some of these posts to build our list of alternatives
and similar projects.
What are some alternatives?
When comparing multimodal-maestro and multi_token you can also consider the following projects:
Mask_RCNN - Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
segment-anything-video - MetaSeg: Packaged version of the Segment Anything repository
InternGPT - InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.