Institute of Electrical and Electronics Engineers, IEEE Aerospace and Electronic Systems Magazine, 1(22), p. 15-21, 2007
Full text: Unavailable
Automatic speaker recognition systems have been largely dominated by acoustic-spectral-based systems, relying in proper modelling of the short-term vocal tract of speakers. However, there is scientific and intuitive evidence that speaker-specific information is embedded in the speech signal in multiple short- and long-term characteristics. In this work, a multilevel speaker recognition system combining acoustic, phonotactic, and prosodic subsystems is presented and assessed by blind submission to NIST 2005 Speaker Recognition Evaluation