InternVideo Alternatives

Similar projects and alternatives to InternVideo

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better InternVideo alternative or higher similarity.

InternVideo reviews and mentions

Posts with mentions or reviews of InternVideo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-19.
  • [Demo] Watch Videos with ChatGPT
    7 projects | /r/ChatGPT | 19 Apr 2023
    Thanks for your interest! If you had any ideas to make the given demo more user-friendly, please do not hesitate to share them with us. We are open to discussing relevant ideas about video foundation models or other topics. We made some progress in these areas (InternVideo, VideoMAE v2, UMT, and more). We believe that user-level intelligent video understanding is on the horizon with the current LLM, computing power, and video data.
  • [R] InternVideo: General Video Foundation Models via Generative and Discriminative Learning
    1 project | /r/MachineLearning | 10 Apr 2023
    Found relevant code at https://github.com/OpenGVLab/InternVideo + all code implementations here
    2 projects | /r/u_noise_3 | 10 Apr 2023
    The foundation models have recently shown excellent performance on a variety of downstream tasks in computer vision. However, most existing vision foundation models simply focus on image-level pretraining and adaption, which are limited for dynamic and complex video-level understanding tasks. To fill the gap, we present general video foundation models, InternVideo, by taking advantage of both generative and discriminative self-supervised video learning. Specifically, InternVideo efficiently explores masked video modeling and video-language contrastive learning as the pretraining objectives, and selectively coordinates video representations of these two complementary frameworks in a learnable manner to boost various video applications. Without bells and whistles, InternVideo achieves state-of-the-art performance on 39 video datasets from extensive tasks including video action recognition/detection, video-language alignment, and open-world video applications. Especially, our methods can obtain 91.1% and 77.2% top-1 accuracy on the challenging Kinetics-400 and Something-Something V2 benchmarks, respectively. All of these results effectively show the generality of our InternVideo for video understanding. The code will be released at https://github.com/OpenGVLab/InternVideo.
  • A note from our sponsor - SaaSHub
    www.saashub.com | 28 Apr 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic InternVideo repo stats
3
909
8.0
7 days ago

OpenGVLab/InternVideo is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of InternVideo is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com