View on GitHub

VAELoopDemo

audio samples generated by VAE-Loop

Paper Info

Demo Description:

Audio samples for each z are generated by the same latent variable sampled from normal distribution. Note that we did not use any speaker or speaking style label for both training and generating.

VCTK

z_1

0.5z_1 + 0.5z_2

0.3z_1 + 0.7z_2

0.2z_1 + 0.8z_2

z_2

Blizzard2012

z_1

z_2

0.5z_1 + 0.5z_2

z_3