Analyzing features for automatic age estimation on cross-sectional data.

Spiegl, Werner; Stemmer, Georg; Lasarcyk, Eva; Kolhatkar, Varada; Cassidy, Andrew; Potard, Blaise; Shum, Stephen; Song, Young Chol; Xu, Puyang; Beyerlein, Peter; Harnsberger, James D.; Nöth, Elmar

Published in

Interspeech 2009, 2009

DOI: 10.21437/interspeech.2009-740

Tools

Export citation

Search in Google Scholar

Analyzing features for automatic age estimation on cross-sectional data.

Proceedings article published in 2009 by Werner Spiegl, Georg Stemmer, Eva Lasarcyk, Varada Kolhatkar, Andrew Cassidy, Blaise Potard, Stephen Shum, Young Chol Song, Puyang Xu, Peter Beyerlein, James D. Harnsberger, Elmar Nöth

This paper is available in a repository.

Full text: Download

Preprint: policy unknown

Upload

Postprint: policy unknown

Upload

Published version: policy unknown

Upload

Abstract

We develop an acoustic feature set for the estimation of a per- son's age from a recorded speech signal. The baseline features are Mel-frequency cepstral coefficients (MFCCs) which are ex- tended by various prosodic features, pitch and formant frequen- cies. From experiments on the University of Florida Vocal Ag- ing Database we can draw different conclusions. On the one hand, adding prosodic, pitch and formant features to the MFCC baseline leads to relative reductions of the mean absolute error between 4-20%. Improvements are even larger when percep- tual age labels are taken as a reference. On the other hand, reasonable results with a mean absolute error in age estimation of about 12 years are already achieved using a simple gender- independent setup and MFCCs only. Future experiments will evaluate the robustness of the prosodic features against channel variability on other databases and investigate the differences be- tween perceptual and chronological age labels. Index Terms: Age regression, age estimation, vocal aging, prosodic features, support vector regression (SVR)

Published in

Links

Tools

Analyzing features for automatic age estimation on cross-sectional data.

Abstract