Generating Sound with Neural Networks: Difference between revisions

From EMC23 - Satellite Of Love

Jump to navigation Jump to search

Latest revision as of 15:37, 12 December 2021

Generating Sound with Neural Networks

Learn how to generate sound from audio files 🎧 🎧 using Variational Autoencoders.

Sound Generation with Neural Networks - INTRO[edit]

Sound Generation with Deep Learning - Approaches and Challenges[edit]

0:00 Intro
0:33 Defining the sound generation task
1:17 Classification of sound generation systems
2:14 Types of generated sounds
3:41 Sound representations
4:07 Generation from raw audio
7:40 Challenges of raw audio generation
10:21 Generation from spectrograms
16:12 Advantages of generation from spectrograms
18:07 Challenges of generation from spectrograms
20:26 Can we generate sound with MFCCs?
21:26 DL architectures for sound generation
22:13 Inputs for generation
24:03 Details about the sound generative system we'll build
24:44 What's next?

Mentioned papers:

Wavenet: A Generative Model for Raw Audio:

https://arxiv.org/pdf/1609.03499.pdf

Jukebox: A Generative Model for Music

https://arxiv.org/pdf/2005.00341

DrumGAN: Synthesis of Drum Sounds with Timbral Feature Conditioning Using Generative Adversarial Networks

https://arxiv.org/pdf/2008.12073

Melnet: A generative model for audio in the frequency domain

https://arxiv.org/pdf/1906.01083.pdf

Autoencoders Explained Easily[edit]

How to Implement Autoencoders in Python and Keras || The Encoder[edit]

How to Implement Autoencoders in Python and Keras - The Decoder[edit]

Building and Training an Autoencoder in Keras + TensorFlow + Python[edit]

Saving the Autoencoder in Keras[edit]

Generation with AutoEncoders: Results and Limitations[edit]

From Autoencoders to Variational Autoencoders: Improving the Encoder[edit]

From Autoencoders to Variational Autoencoders: Improving the Loss Function[edit]

How to implement a Variational AutoEncoder in Python and Keras[edit]

Preprocessing Audio Datasets for Machine Learning[edit]

Training a VAE with Speech Data in Keras[edit]

Generating Sound Digits with a Variational AutoEncoder[edit]

Retrieved from "https://wiki.emc23.com/?title=Generating_Sound_with_Neural_Networks&oldid=926"