Drysdale, Jake, Tomczak, Maciej and Hockman, Jason (2021) Style-based drum synthesis with GAN inversion. In: 22nd International Society of Music Information Retrieval Conference, 07 November 2021 - 12 November 2021, Online.
|
Published Version
Available under License Creative Commons Attribution. Download (1MB) | Preview |
Abstract
Neural audio synthesizers exploit deep learning as an alternative to traditional synthesizers that generate audio from hand-designed components such as oscillators and wavetables. For a neural audio synthesizer to be applicable to music creation, meaningful control over the output is essential. This paper provides an overview of an unsupervised approach to deriving useful feature controls learned by a generative model. A system for generation and transformation of drum samples using a style-based generative adversarial network (GAN) is proposed. The system provides functional control of audio style features, based on principal component analysis (PCA) applied to the intermediate latent space. Additionally, we propose the use of an encoder trained to invert input drums back to the latent space of the pre-trained GAN. We experiment with three modes of control and provide audio results on a supporting website.
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.