Published in

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

DOI: 10.1109/icassp.2003.1202336

2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)

DOI: 10.1109/icme.2003.1221284

Links

Tools

Export citation

Search in Google Scholar

Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech

This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

This paper proposed a support vector machine (SVM) based combining scheme that incorporates idiolectal and acoustic characteristics for speaker recognition. Two statistical model paradigms, namely GMM for acoustic modeling and bigrams for language modeling, provide multilevel speaker information that affords a better classification performance when SVM-based fusion is accomplished. This combining approach is useful for all speaker recognition tasks where a considerable amount of data is available. Motivated by the absence of Spanish databases that made feasible our research experiments, more than nine hours of Spanish conversational speech was collected and manually transcribed from broadcasted radio talk shows.