Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Why do you think that https://github.com/j-min/DallEval is a good alternative to ALPRO
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Why do you think that https://github.com/j-min/DallEval is a good alternative to ALPRO