video-pretrained-transformer

Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scratch on YouTube (YT-1B dataset). (by KastanDay)

Video-pretrained-transformer Alternatives

Similar projects and alternatives to video-pretrained-transformer

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better video-pretrained-transformer alternative or higher similarity.

video-pretrained-transformer reviews and mentions

Posts with mentions or reviews of video-pretrained-transformer. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-01.

Stats

Basic video-pretrained-transformer repo stats
1
42
6.5
about 1 year ago

KastanDay/video-pretrained-transformer is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of video-pretrained-transformer is Jupyter Notebook.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com