[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Here you can share your experience with the project you are suggesting or its comparison with Video-LLaMA. Optional.
A valid email to send you a verification link when necessary or log in.