Published in

Proceedings of the seventh international conference on Knowledge capture - K-CAP '13

DOI: 10.1145/2479832.2479856

Links

Tools

Export citation

Search in Google Scholar

A semi-automatic approach for building ontologies from acollection of structured web documents

Proceedings article published in 2013 by Mouna Kamel, N. Aussenac Gilles ORCID, Davide Buscaldi, Catherine Comparot
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Many collections of structured documents are available on the web. The collection generally describes the characteristics of entities from a single type, where each page describes one entity. These documents are adequate knowledge sources for building ontologies. As they benefit from a strong and shared layout, they contain less well written text than plain text files but their architecture is very meaningful. Classical linguistic-based methods for identifying concepts and relations are no longer appropriate for analyzing them.The approach we propose in this paper exploits various properties of such documents, combining layout/formatting analysis and linguistic analysis, and using semantic annotation.