Voice source estimation for artificial bandwidth extension of telephone speech

Thomas, Mark R. P.; Gudnason, Jon; Naylor, Patrick A.; Geiser, Bernd; Vary, Peter

Published in

2010 IEEE International Conference on Acoustics, Speech and Signal Processing

DOI: 10.1109/icassp.2010.5495149

Tools

Export citation

Search in Google Scholar

Voice source estimation for artificial bandwidth extension of telephone speech

Proceedings article published in 2010 by Mark R. P. Thomas, Jon Gudnason

, Patrick A. Naylor, Bernd Geiser, Peter Vary

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Artificial bandwidth extension (ABWE) of speech signals aims to estimate wideband speech (50 Hz - 7 kHz) from narrowband signals (300 Hz - 3.4 kHz). Applying the source-filter model of speech, many existing algorithms estimate vocal tract filter parameters independently of the source signal. However, many current methods for extending the narrowband voice source signal are limited to straightforward signal processing techniques which are only effective for high-band estimation. This paper presents a method for ABWE that employs novel data-driven modelling and an existing spectral mirroring technique to estimate the wideband source signal in both the high and low extension bands. A state-of-the-art Hidden Markov Model-based estimator evaluates the temporal and spectral envelopes in the missing frequency bands, with which the ABWE speech signal is synthesized. Informal listening tests comparing two existing source estimation techniques and two permutations of the proposed approach show an improvement in the perceived bandwidth of speech signals, in particular towards low frequencies. Subjective tests on the same data show a preference for the proposed techniques over the existing methods under test.

Published in

Links

Tools

Voice source estimation for artificial bandwidth extension of telephone speech

Abstract