Towards Cover Song Detection with Siamese Convolutional Neural Networks

doi:10.48550/arXiv.2005.10294

Towards Cover Song Detection with Siamese Convolutional Neural Networks

Stamenovic, Marko

A cover song, by definition, is a new performance or recording of a previously recorded, commercially released song. It may be by the original artist themselves or a different artist altogether and can vary from the original in unpredictable ways including key, arrangement, instrumentation, timbre and more. In this work we propose a novel approach to learning audio representations for the task of cover song detection. We train a neural architecture on tens of thousands of cover-song audio clips and test it on a held out set. We obtain a mean precision@1 of 65% over mini-batches, ten times better than random guessing. Our results indicate that Siamese network configurations show promise for approaching the cover song identification problem.

Publication:

arXiv e-prints

Pub Date:

May 2020

DOI:

10.48550/arXiv.2005.10294

arXiv:

arXiv:2005.10294

Bibcode:

2020arXiv200510294S

Keywords:

Electrical Engineering and Systems Science - Audio and Speech Processing;
Computer Science - Machine Learning;
Computer Science - Sound;
Statistics - Machine Learning

E-Print:

Code available at https://github.com/markostam/coversongs-dual-convnet

NASA/ADS

Towards Cover Song Detection with Siamese Convolutional Neural Networks

Abstract