Article details

Research area
Speech enhancement

ESSV 2018


Jonas Sautter, Friedrich Faubel, Markus Buck, Gerhard Schmidt

Evaluation of Different Excitation Generation Algorithms for Artificial Bandwidth Extension


Artificial bandwidth extension (ABWE) for speech signals is still an important topic in mobile telephony, especially when a 16 kHz wideband (WB) call suddenly falls back to an 8 kHz GSM connection. The aim of ABWE is to bridge the arising voice quality gap by reconstructing the WB signal. In order to achieve this, the speech signal is typically decomposed into a spectral envelope and an excitation signal, both of which are then extended separately. While the algorithms for envelope extension are getting increasingly more sophisticated, excitation generation is still often performed with rudimentary methods such as spectral folding (SF) or spectral shifting (SS). But this can introduce audible artifacts, especially for speech signals where the pitch frequency varies a lot. To reduce these artifacts, we introduce an algorithm that shifts parts of the spectrum multiple times by a smaller frequency shift. Additionally, we investigate if the speech quality can be further improved by interpolating the extended excitation signal with white noise. This is motivated by the fact that the SNR of the harmonic excitation decreases towards higher frequencies for real WB signals. The performance of the proposed algorithm is evaluated and compared to spectral folding and spectral shifting.

Read/download now