Article details

Research area
Speech enhancement

12. ITG Symposium on Speech Communication, Paderborn


Simon Graf, Tobias Herbig, Markus Buck, Gerhard Schmidt

Voice Activity Detection Based on Modulation-Phase Differences


Many speech processing algorithms rely on voice activity detection (VAD) that separates speech from noise. For this task, several features have been introduced that employ different characteristic properties of speech. In this contribution, we introduce a new feature that is robust against various types of noise. By considering an alternating excitation structure of low and high frequencies, speech is detected with a high confidence. The computationally low complex feature can cope even with the limited spectral resolution that is typical for in-car-communication systems. By combining the feature with a conventional modulation feature, the performance can be improved. Our simulations confirm the robustness of the feature and show the increasing performance compared to established VAD features.

Read/download now