Score and Lyrics-Free Singing Voice Generation

doi:10.48550/arXiv.1912.11747

Score and Lyrics-Free Singing Voice Generation

Generative models for singing voice have been mostly concerned with the task of ``singing voice synthesis,'' i.e., to produce singing voice waveforms given musical scores and text lyrics. In this work, we explore a novel yet challenging alternative: singing voice generation without pre-assigned scores and lyrics, in both training and inference time. In particular, we outline three such generation schemes, and propose a pipeline to tackle these new tasks. Moreover, we implement such models using generative adversarial networks and evaluate them both objectively and subjectively.

Publication:

arXiv e-prints

Pub Date:

December 2019

DOI:

10.48550/arXiv.1912.11747

arXiv:

arXiv:1912.11747

Bibcode:

2019arXiv191211747L

Keywords:

Computer Science - Sound;
Computer Science - Machine Learning;
Electrical Engineering and Systems Science - Audio and Speech Processing

E-Print:

Accepted by International Conference on Computational Creativity (ICCC) 2020

NASA/ADS

Score and Lyrics-Free Singing Voice Generation

Abstract