Chemical documents: machine understanding and automated information extraction

Townsend, Joe A.; Adams, Sam E.; Waudby, Christopher A.; de Souza, Vanessa K.; Goodman, Jonathan M.; Murray-Rust, Peter

Published in

Royal Society of Chemistry, Organic and Biomolecular Chemistry, 22(2), p. 3294

DOI: 10.1039/b411033a

Tools

Export citation

Search in Google Scholar

Chemical documents: machine understanding and automated information extraction

Journal article published in 2004 by Joe A. Townsend, Sam E. Adams, Christopher A. Waudby, Vanessa K. de Souza, Jonathan M. Goodman

, Peter Murray-Rust

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Automatically extracting chemical information from documents is a challenging task, but an essential one for dealing with the vast quantity of data that is available. The task is least difficult for structured documents, such as chemistry department web pages or the output of computational chemistry programs, but requires increasingly sophisticated approaches for less structured documents, such as chemical papers. The identification of key units of information, such as chemical names, makes the extraction of useful information from unstructured documents possible.

Published in

Links

Tools

Chemical documents: machine understanding and automated information extraction

Abstract