The Effects of Noisy Labels on Deep Convolutional Neural Networks for Music Tagging

doi:10.48550/arXiv.1706.02361

The Effects of Noisy Labels on Deep Convolutional Neural Networks for Music Tagging

Deep neural networks (DNN) have been successfully applied to music classification including music tagging. However, there are several open questions regarding the training, evaluation, and analysis of DNNs. In this article, we investigate specific aspects of neural networks, the effects of noisy labels, to deepen our understanding of their properties. We analyse and (re-)validate a large music tagging dataset to investigate the reliability of training and evaluation. Using a trained network, we compute label vector similarities which is compared to groundtruth similarity. The results highlight several important aspects of music tagging and neural networks. We show that networks can be effective despite relatively large error rates in groundtruth datasets, while conjecturing that label noise can be the cause of varying tag-wise performance differences. Lastly, the analysis of our trained network provides valuable insight into the relationships between music tags. These results highlight the benefit of using data-driven methods to address automatic music tagging.

Publication:

arXiv e-prints

Pub Date:

June 2017

DOI:

10.48550/arXiv.1706.02361

arXiv:

arXiv:1706.02361

Bibcode:

2017arXiv170602361C

Keywords:

Computer Science - Information Retrieval;
Computer Science - Machine Learning;
Computer Science - Multimedia;
Computer Science - Sound

E-Print:

The section that overlapped with arXiv:1709.01922 is completely removed since the earlier version. This is the camera-ready version

NASA/ADS

The Effects of Noisy Labels on Deep Convolutional Neural Networks for Music Tagging

Abstract