Maximum entropy segmentation of broadcast news

Christensen, Heidi; Kolluru, BalaKrishna; Gotoh, Yoshihiko; Renals, Steve

Published in

Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.

DOI: 10.1109/icassp.2005.1415292

Tools

Export citation

Search in Google Scholar

Maximum entropy segmentation of broadcast news

Journal article published in 2005 by Heidi Christensen

, BalaKrishna Kolluru, Yoshihiko Gotoh, Steve Renals

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

This paper presents an automatic system for structuring and preparing a news broadcast for applications such as speech sum-marization, browsing, archiving and information retrieval. This process comprises transcribing the audio using an automatic speech recognizer and subsequently segmenting the text into utter-ances and topics. A maximum entropy approach is used to build statistical models for both utterance and topic segmentation. The experimental work addresses the effect on performance of the topic boundary detector of three factors: the information sources used, the quality of the ASR transcripts, and the quality of the utterance boundary detector. The results show that the topic segmentation is not affected severely by transcripts errors, whereas errors in the utterance segmentation are more devastating.

Published in

Links

Tools

Maximum entropy segmentation of broadcast news

Abstract