VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking (by OpenGVLab)
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS. (by OpenGVLab)
VideoMAEv2 | Ask-Anything | |
---|---|---|
1 | 3 | |
405 | 2,703 | |
8.6% | 4.8% | |
4.1 | 8.1 | |
2 months ago | about 1 month ago | |
Python | Python | |
MIT License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
VideoMAEv2
Posts with mentions or reviews of VideoMAEv2.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-04-19.
-
[Demo] Watch Videos with ChatGPT
Thanks for your interest! If you had any ideas to make the given demo more user-friendly, please do not hesitate to share them with us. We are open to discussing relevant ideas about video foundation models or other topics. We made some progress in these areas (InternVideo, VideoMAE v2, UMT, and more). We believe that user-level intelligent video understanding is on the horizon with the current LLM, computing power, and video data.
Ask-Anything
Posts with mentions or reviews of Ask-Anything.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-04-19.
- ChatGPT with Video Understanding
-
Ask-Anything, tool for chatting about video with chatGPT, miniGPT4 and StableLM
GitHub link: https://github.com/OpenGVLab/Ask-Anything
-
[Demo] Watch Videos with ChatGPT
The project currently only has a basic framework and includes two main subprojects by leveraging existing APIs and open-sourced solutions:
What are some alternatives?
When comparing VideoMAEv2 and Ask-Anything you can also consider the following projects:
InternVideo - Video Foundation Models & Data for Multimodal Understanding
LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
unmasked_teacher - [ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
openscene - [CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
mmaction2 - OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark