"Taming Visually Guided Sound Generation". Quickly generate audio matching a given video. Code includes a Google Colab.

This page summarizes the projects mentioned and recommended in the original post on /r/MediaSynthesis

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • SpecVQGAN

    Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

  • pollinations

    Generate Art

  • I have already added it to the site pollinations.ai (a site I'm working on with friends to make ml art more approachable),

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Netflix Queen Elizabeth generated by Chat GPT

    1 project | /r/HolUp | 11 Dec 2023
  • IT CAN MAKE IMAGES

    1 project | /r/ChatGPT | 1 Jul 2023
  • I run a free Stable Diffusion bot. I have fun trying to prevent people from overloading it with porn. This time I added (hairy gorilla:1.2) to the prompt when a mature word is detected.

    4 projects | /r/StableDiffusion | 20 Jun 2023
  • Immersive text based adventure prompt to explore the imaginary internet of an alternate universe

    1 project | /r/ChatGPT | 10 May 2023
  • Text-to-Audio Generation Using Instruction Tuned LLM and Latent Diffusion Model

    1 project | news.ycombinator.com | 28 Apr 2023