Convolutional Recurrent Neural Networks for Bird Audio Detection
Abstract
Bird sounds possess distinctive spectral structure which may exhibit small shifts in spectrum depending on the bird species and environmental conditions. In this paper, we propose using convolutional recurrent neural networks on the task of automated bird audio detection in real-life environments. In the proposed method, convolutional layers extract high dimensional, local frequency shift invariant features, while recurrent layers capture longer term dependencies between the features extracted from short time frames. This method achieves 88.5% Area Under ROC Curve (AUC) score on the unseen evaluation data and obtains the second place in the Bird Audio Detection challenge.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2017
- DOI:
- 10.48550/arXiv.1703.02317
- arXiv:
- arXiv:1703.02317
- Bibcode:
- 2017arXiv170302317E
- Keywords:
-
- Computer Science - Sound;
- Computer Science - Machine Learning;
- Statistics - Machine Learning
- E-Print:
- Submitted to EUSIPCO 2017 Special Session on Bird Audio Signal Processing