-
tt-vae-gan
Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I've been working on an open source implementation for the past month. The link for it can be found here. But have since been struggling with a load of mode collapse - or the model outputting "blurry" spectrograms not quite capturing the same initial structure as they should.
You also set biases to false? At least in CoGAN there are biases. https://github.com/mingyuliutw/CoGAN/blob/master/cogan/models/celeba/celeba.train.ptt
Hi just an update to whom ever faces a similar issue. I've fixed it.The key change was redefining the encoder loss - where kld is defined without logvar (only mu) and the reconstruction loss is defined with L1 rather than MSE. This was a helpful resource on the matter https://github.com/eriklindernoren/PyTorch-GAN/blob/master/implementations/unit/unit.py
Related posts
-
Create an AI prototyping environment using Jupyter Lab IDE with Typescript, LangChain.js and Ollama for rapid AI prototyping
-
Show HN: FileKitty – Combine and label text files for LLM prompt contexts
-
Effortlessly Create an AI Dungeon Master Bot Using Julep and Chainlit
-
An Exploration of Software-defined networks in video streaming, Part Three: Performance of a streaming system over a SDN
-
Clasificador de imágenes con una red neuronal convolucional (CNN)