Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Video-LLaVA Alternatives
Similar projects and alternatives to Video-LLaVA
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
-
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
-
FLiPStackWeekly
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
-
FLaNK-Halifax
Community over Code, Apache NiFi, Apache Kafka, Apache Flink, Python, GTFS, Transit, Open Source, Open Data
-
krita-ai-diffusion
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
-
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
-
vectorflow
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice. (by dgarnitz)
-
CoC2023
Community over Code, Apache NiFi, Apache Kafka, Apache Flink, Python, GTFS, Transit, Open Source, Open Data
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Video-LLaVA reviews and mentions
- FLaNK Stack Weekly for 27 November 2023
-
Google Bard AI Now Has the Ability to Understand YouTube Videos
Bard can read images and, being the same company as YouTube, probably has access to high quality video embeddings they use for YouTube search, probably the most sophisticated video search engine on the planet. It could definitely be using the video content directly without a text representation.
For an open source project that actually “sees” videos you can check out https://github.com/PKU-YuanGroup/Video-LLaVA
-
Video-LLaVA
The related paper is here: https://arxiv.org/pdf/2311.10122.pdf
I think the TL;DR is "it can tell what's in the video and 'reason' about it"
-
A note from our sponsor - InfluxDB
www.influxdata.com | 5 May 2024
Stats
PKU-YuanGroup/Video-LLaVA is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of Video-LLaVA is Python.
Sponsored