Relations between de-facto criteria in the evaluation of a spoken dialogue system

Callejas, Zoraida; López-Cózar, Ramón

Published in

Elsevier, Speech Communication, 8-9(50), p. 646-665, 2008

DOI: 10.1016/j.specom.2008.04.004

Tools

Export citation

Search in Google Scholar

Relations between de-facto criteria in the evaluation of a spoken dialogue system

Journal article published in 2008 by Zoraida Callejas

, Ramón López-Cózar

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Evaluation of spoken dialogue systems has been traditionally carried out in terms of instrumentally or expert-derived measures (usually called “objective” evaluation) and quality judgments of users who have previously interacted with the system (also called “subjective” evaluation). Different research efforts have been made to extract relationships between these evaluation criteria. In this paper we report empirical results obtained from statistical studies, which were carried out on interactions of real users with our spoken dialogue system. These studies have rarely been exploited in the literature. Our results show that they can indicate important relationships between criteria, which can be used as guidelines for refinement of the systems under evaluation, as well as contributing to the state-of-the-art knowledge about how quantitative aspects of the systems affect the user’s perceptions about them.

Published in

Links

Tools

Relations between de-facto criteria in the evaluation of a spoken dialogue system

Abstract