Lungmix: A Mixup-Based Strategy for Generalization in Respiratory Sound Classification

doi:10.48550/arXiv.2501.00064

Lungmix: A Mixup-Based Strategy for Generalization in Respiratory Sound Classification

Respiratory sound classification plays a pivotal role in diagnosing respiratory diseases. While deep learning models have shown success with various respiratory sound datasets, our experiments indicate that models trained on one dataset often fail to generalize effectively to others, mainly due to data collection and annotation \emph{inconsistencies}. To address this limitation, we introduce \emph{Lungmix}, a novel data augmentation technique inspired by Mixup. Lungmix generates augmented data by blending waveforms using loudness and random masks while interpolating labels based on their semantic meaning, helping the model learn more generalized representations. Comprehensive evaluations across three datasets, namely ICBHI, SPR, and HF, demonstrate that Lungmix significantly enhances model generalization to unseen data. In particular, Lungmix boosts the 4-class classification score by up to 3.55\%, achieving performance comparable to models trained directly on the target dataset.

Publication:

arXiv e-prints

Pub Date:

December 2024

DOI:

10.48550/arXiv.2501.00064

arXiv:

arXiv:2501.00064

Bibcode:

2025arXiv250100064G

Keywords:

Computer Science - Sound;
Computer Science - Machine Learning;
Electrical Engineering and Systems Science - Audio and Speech Processing

E-Print:

4pages, 3 figures, conference paper

ADS

Lungmix: A Mixup-Based Strategy for Generalization in Respiratory Sound Classification

Abstract