Interaction of long-term acoustic experience and local context information on the perceptual accommodation of talker variability

Zhang, Caicai; Peng, Gang; Wang, William Shi-Yuan

Published in

Acoustical Society of America, The Journal of the Acoustical Society of America, 5(133), p. 3341

DOI: 10.1121/1.4805640

Tools

Export citation

Search in Google Scholar

Interaction of long-term acoustic experience and local context information on the perceptual accommodation of talker variability

Journal article published in 2013 by Caicai Zhang

, Gang Peng, William Shi-Yuan Wang

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving restricted

Upload

Policy details

Data provided by

Abstract

How do listeners recover speech content from acoustic signals, given the immense variability between talkers? In this study, two experiments were conducted on Cantonese level tones, comparing the perception of multi-talker speech stimuli in isolation and within a speech context. Without prior knowledge of a talker's pitch range, listeners resort to the population-average pitch range as a default reference for perception. This effect is attested by the significant correlation between the distance from population-average pitch range and identification accuracy in the isolation condition (r = -0.24, p < 0.01). The closer a talker's pitch range is to the population-average, the higher the identification accuracy is. The population-average reference is gender-specific, showing separate accommodation scales for female and male talkers. Such default reference is presumably built from one's long-term acoustic experience, reflecting the dense distribution of talkers in a community whose pitch is close to the population-average. Above the effect of long-term experience, the presence of a speech context allows listeners to tune to talker-specific pitch range, boosting the identification accuracy from 43% (in isolation) to 86%. Our findings demonstrate that listeners have built-in knowledge of population-average pitch and can shift from the default reference to talker-specific reference with the facilitation of context information.

Published in

Links

Tools

Interaction of long-term acoustic experience and local context information on the perceptual accommodation of talker variability

Abstract