ASR Post-Correction for Spoken Dialogue Systems Based on Semantic, Syntactic, Lexical and Contextual Information

López-Cózar, Ramón; Callejas, Zoraida

Published in

Elsevier, Speech Communication, 8-9(50), p. 745-766, 2008

DOI: 10.1016/j.specom.2008.03.008

Tools

Export citation

Search in Google Scholar

ASR Post-Correction for Spoken Dialogue Systems Based on Semantic, Syntactic, Lexical and Contextual Information

Journal article published in 2008 by Ramón López-Cózar, Zoraida Callejas

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

This paper proposes a technique to correct speech recognition errors in spoken dialogue systems that presents two main novel contributions. On the one hand, it considers several contexts where a speech recognition result can be corrected. A threshold learnt in the training is used to decide whether the correction must be carried out in the context associated with the current prompt type of a dialogue system, or in another context. On the other hand, the technique deals with the confidence scores of the words employed in the corrections. The correction is carried out at two levels: statistical and linguistic. At the first level the technique employs syntactic–semantic and lexical models, both contextual, to decide whether a recognition result is correct. According to this decision the recognition result may be changed. At the second level the technique employs basic linguistic knowledge to decide about the grammatical correctness of the outcome of the first level. According to this decision the outcome may be changed as well. Experimental results indicate that the technique enhances a dialogue system’s word accuracy, speech understanding, implicit recovery and task completion rates by 8.5%, 16.54%, 4% and 44.17%, respectively.

Published in

Links

Tools

ASR Post-Correction for Spoken Dialogue Systems Based on Semantic, Syntactic, Lexical and Contextual Information

Abstract