Multidocument summarization of engineering papers based on macro- and microstructure

Zhan, Jiaming; Liu, Ying; Loh, Han Tong

Published in

American Society of Mechanical Engineers, Journal of Computing and Information Science in Engineering, 1(11), p. 011008

DOI: 10.1115/1.3563048

Tools

Export citation

Search in Google Scholar

Multidocument summarization of engineering papers based on macro- and microstructure

Journal article published in 2011 by Jiaming Zhan, Ying Liu

, Han Tong Loh

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

This paper focuses on automatic summarization of multiple engineering papers. A summarization approach based on documents’ macro- and microstructure has been proposed. The macrostructure consists of a list of ranked topics from engineering papers. Topics are discovered by extracting and grouping frequently appearing word sequences into equivalence classes. Hence, the macrostructure symbolically presents the topical links in different papers. Meanwhile, the microstructure is defined as the rhetorical structure within a single paper. The identification of microstructure is approached as a classification problem. Each sentence in a paper is automatically labeled with one of the predefined rhetorical categories. Unlike existing summarization methods that first separate documents into nonoverlapping clusters and then summarize each cluster individually, our approach aims to summarize multiple documents according to the characteristics suggested at macro- and microstructure levels. The experimental study showed that our proposed approach outperformed peer systems in terms of recall-oriented understudy for gisting evaluation scores and readers’ responsiveness. In an independent manual categorization task using the summaries generated by our approach and peer systems, we also performed better in terms of precision and recall.

Published in

Links

Tools

Multidocument summarization of engineering papers based on macro- and microstructure

Abstract