Phase-Aware Single-Channel Speech Enhancement with Modulation-Domain Kalman Filtering

doi:10.48550/arXiv.1708.02171

Phase-Aware Single-Channel Speech Enhancement with Modulation-Domain Kalman Filtering

We present a single-channel phase-sensitive speech enhancement algorithm that is based on modulation-domain Kalman filtering and on tracking the speech phase using circular statistics. With Kalman filtering, using that speech and noise are additive in the complex STFT domain, the algorithm tracks the speech log-spectrum, the noise log-spectrum and the speech phase. Joint amplitude and phase estimation of speech is performed. Given the noisy speech signal, conventional algorithms use the noisy phase for signal reconstruction approximating the speech phase with the noisy phase. In the proposed Kalman filtering algorithm, the speech phase posterior is used to create an enhanced speech phase spectrum for signal reconstruction. The Kalman filter prediction models the temporal/inter-frame correlation of the speech and noise log-spectra and of the speech phase, while the Kalman filter update models their nonlinear relations. With the proposed algorithm, speech is tracked and estimated both in the log-spectral and spectral phase domains. The algorithm is evaluated in terms of speech quality and different algorithm configurations, dependent on the signal model, are compared in different noise types. Experimental results show that the proposed algorithm outperforms traditional enhancement algorithms over a range of SNRs for various noise types.

Publication:

arXiv e-prints

Pub Date:

August 2017

DOI:

10.48550/arXiv.1708.02171

arXiv:

arXiv:1708.02171

Bibcode:

2017arXiv170802171D

Keywords:

Computer Science - Sound

E-Print:

13 pages, 17 figures, Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing

NASA/ADS

Phase-Aware Single-Channel Speech Enhancement with Modulation-Domain Kalman Filtering

Abstract