mPLUG-Owl
AliceMind
Our great sponsors
mPLUG-Owl | AliceMind | |
---|---|---|
2 | 1 | |
1,945 | 1,936 | |
8.5% | 1.8% | |
7.6 | 5.7 | |
24 days ago | about 1 month ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mPLUG-Owl
-
Unleash the Power of Video-LLaMA: Revolutionizing Language Models with Video and Audio Understanding!
We extend our deepest gratitude to the extraordinary projects that have influenced and contributed to the development of Video-LLaMA. We're indebted to MiniGPT-4, FastChat, BLIP-2, EVA-CLIP, ImageBind, LLaMA, VideoChat, LLaVA, WebVid, and mPLUG-Owl for their invaluable contributions. Special thanks to Midjourney for creating the stunning Video-LLaMA logo, encapsulating the essence of our groundbreaking project.
- [P]mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
AliceMind
-
[P]mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Found relevant code at https://github.com/alibaba/AliceMind + all code implementations here
What are some alternatives?
LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
RATransformers - RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!
Video-LLaMA - [EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
jina-financial-qa-search
ExpertLLaMA - An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
LLMSurvey - The official GitHub page for the survey paper "A Survey of Large Language Models".
extreme-bert - ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
NExT-GPT - Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
happy-transformer - Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
ChatPDF - Chat with any PDF. Easily upload the PDF documents you'd like to chat with. Instant answers. Ask questions, extract information, and summarize documents with AI. Sources included.
CodeCapybara - Open-source Self-Instruction Tuning Code LLM