Video Foundation Models & Data for Multimodal Understanding
Why do you think that https://github.com/OpenGVLab/VideoMAEv2 is a good alternative to InternVideo
Video Foundation Models & Data for Multimodal Understanding
Why do you think that https://github.com/OpenGVLab/VideoMAEv2 is a good alternative to InternVideo