Learning to Fuse Music Genres with Generative Adversarial Dual Learning

doi:10.48550/arXiv.1712.01456

Learning to Fuse Music Genres with Generative Adversarial Dual Learning

FusionGAN is a novel genre fusion framework for music generation that integrates the strengths of generative adversarial networks and dual learning. In particular, the proposed method offers a dual learning extension that can effectively integrate the styles of the given domains. To efficiently quantify the difference among diverse domains and avoid the vanishing gradient issue, FusionGAN provides a Wasserstein based metric to approximate the distance between the target domain and the existing domains. Adopting the Wasserstein distance, a new domain is created by combining the patterns of the existing domains using adversarial learning. Experimental results on public music datasets demonstrated that our approach could effectively merge two genres.

Publication:

arXiv e-prints

Pub Date:

December 2017

DOI:

10.48550/arXiv.1712.01456

arXiv:

arXiv:1712.01456

Bibcode:

2017arXiv171201456C

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Computer Science - Multimedia;
Computer Science - Sound;
Electrical Engineering and Systems Science - Audio and Speech Processing

E-Print:

International Conference on Data Mining - New Orleans, 2017

NASA/ADS

Learning to Fuse Music Genres with Generative Adversarial Dual Learning

Abstract