Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Why do you think that https://github.com/OpenGVLab/InternVideo is a good alternative to ALPRO
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Why do you think that https://github.com/OpenGVLab/InternVideo is a good alternative to ALPRO