mPLUG-Owl
Video-LLaMA
Our great sponsors
mPLUG-Owl | Video-LLaMA | |
---|---|---|
2 | 8 | |
1,945 | 2,423 | |
8.5% | 10.5% | |
7.6 | 8.4 | |
24 days ago | 6 months ago | |
Python | Python | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mPLUG-Owl
-
Unleash the Power of Video-LLaMA: Revolutionizing Language Models with Video and Audio Understanding!
We extend our deepest gratitude to the extraordinary projects that have influenced and contributed to the development of Video-LLaMA. We're indebted to MiniGPT-4, FastChat, BLIP-2, EVA-CLIP, ImageBind, LLaMA, VideoChat, LLaVA, WebVid, and mPLUG-Owl for their invaluable contributions. Special thanks to Midjourney for creating the stunning Video-LLaMA logo, encapsulating the essence of our groundbreaking project.
- [P]mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Video-LLaMA
- Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
-
OpenAI vs Google, Detect ChatGPT Content with 99% accuracy, Navigating AI compute costs
👀 Video-LLaMA - Empower large language models with video and audio understanding capability. (link) 🦦 Otter - Multi-modal model with improved instruction-following and in-context learning ability. 🔗 Linkly.AI - AI-powered lead analytics and management platform that helps you track, analyze, and streamline your leads in one place. 🎬 Jet Cut Ready - AI plugin for Adobe Premiere Pro that automatically removes silent parts in videos. (link) 💬 HeyGen's ChatGPT Plugin - Convert text into high-quality videos using AI text and video generation.
- Video-LLaMA: Instruction-Tuned Audio-Visual Lang Model for Video Understanding
-
Unleash the Power of Video-LLaMA: Revolutionizing Language Models with Video and Audio Understanding!
Prepare to be blown away by the cutting-edge Video-LLaMA project! We're pushing the boundaries of language models by equipping them with the remarkable ability to comprehend video and audio. Get ready for an extraordinary adventure! 🌟
-
Video-LLaMA An Instruction-tuned Audio-Visual Language Model for Video Understanding
Source Code: The codebase for pre-training and fine-tuning the Video-LLaMA model as well as the model weights are available on GitHub: https://github.com/DAMO-NLP-SG/Video-LLaMA
-
Video-ChatGPT: Redefining Interactions with Visual Data
Tons of cool stuff happening in the space, also recently saw the LLaMa-Video version of this - https://github.com/DAMO-NLP-SG/Video-LLaMA
-
Meet Video-LLaMA: A Multi-Modal Framework that Empowers Large Language Models (LLMs) with the Capability of Understanding both Visual and Auditory Content in the Video
Code: https://github.com/DAMO-NLP-SG/Video-LLaMA
What are some alternatives?
LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
NExT-GPT - Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
ExpertLLaMA - An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.
Otter - 🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
LLMSurvey - The official GitHub page for the survey paper "A Survey of Large Language Models".
Chinese-LLaMA-Alpaca - ä¸æ–‡LLaMA&Alpaca大è¯è¨€æ¨¡åž‹+本地CPU/GPUè®ç»ƒéƒ¨ç½² (Chinese LLaMA & Alpaca LLMs)
ChatPDF - Chat with any PDF. Easily upload the PDF documents you'd like to chat with. Instant answers. Ask questions, extract information, and summarize documents with AI. Sources included.
MiniGPT-4-discord-bot - A true multimodal LLaMA derivative -- on Discord!
CodeCapybara - Open-source Self-Instruction Tuning Code LLM
chat-llama-discord-bot - A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other Large Language Model (LLM) supported by text-generation-webui or llama.cpp.