Multimodal-GPT
InternGPT
Multimodal-GPT | InternGPT | |
---|---|---|
4 | 5 | |
1,420 | 3,144 | |
2.7% | 1.8% | |
5.4 | 8.8 | |
12 months ago | 6 months ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Multimodal-GPT
- Meet MultiModal-GPT: A Vision and Language Model for Multi-Round Dialogue with Humans
-
Breaking: OpenAI plans to release an own open-source chatbot AI as it comes under competitive pressure. My analysis on what this means for ChatGPT and LLMs.
A number of them have popped up as training methods to introduce multimodality have proliferated. Here's one: https://mmgpt.openmmlab.org.cn/
- MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
- Train a multi-modal chatbot with visual and language instructions
InternGPT
-
How do I use the programs on Github?
You can also create an issue and ask the developers for help.
- InternGPT
- DragGAN demo is now live!! Best AI Tool For Editing Images
- Web based multimodal ChatGPT - InternGPT
What are some alternatives?
torchscale - Foundation Architecture for (M)LLMs
langchain-chatbot - Chatbot using LLM chat model and Langchain, LangSmith.
LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
NExT-GPT - Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
ONE-PEACE - A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
MiniGPT-4-discord-bot - A true multimodal LLaMA derivative -- on Discord!
mPLUG-Owl - mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
xllm - 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
codeinterpreter-api - 👾 Open source implementation of the ChatGPT Code Interpreter
Multi-Modality-Arena - Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
agentchain - Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
benchllm - Continuous Integration for LLM powered applications