ONE-PEACE
Multimodal-GPT
ONE-PEACE | Multimodal-GPT | |
---|---|---|
2 | 4 | |
850 | 1,411 | |
4.4% | 2.1% | |
8.6 | 5.4 | |
5 months ago | 11 months ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ONE-PEACE
- A general representation modal across vision, audio, language modalities
-
Meet ONE-PEACE: A General Representation Model Towards Unlimited Modalities Across Different Modalities
Github: https://github.com/OFA-Sys/ONE-PEACE
Multimodal-GPT
- Meet MultiModal-GPT: A Vision and Language Model for Multi-Round Dialogue with Humans
-
Breaking: OpenAI plans to release an own open-source chatbot AI as it comes under competitive pressure. My analysis on what this means for ChatGPT and LLMs.
A number of them have popped up as training methods to introduce multimodality have proliferated. Here's one: https://mmgpt.openmmlab.org.cn/
- MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
- Train a multi-modal chatbot with visual and language instructions
What are some alternatives?
OFA - Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
torchscale - Foundation Architecture for (M)LLMs
ALPRO - Align and Prompt: Video-and-Language Pre-training with Entity Prompts
LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
EVA - EVA Series: Visual Representation Fantasies from BAAI
mPLUG-Owl - mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
unilm - Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
InternGPT - InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)