Dissemin is shutting down on January 1st, 2025

Published in

IOS Press, Studies in Health Technology and Informatics, MEDINFO 2013(192), p. 1211-1211, 2013

DOI: 10.3233/978-1-61499-289-9-1211

Links

Tools

Export citation

Search in Google Scholar

Building a Common Pipeline for Rule-based Document Classification.

Journal article published in 2013 by Olga V. Patterson ORCID, Ginter Thomas, T. Ginter, Scott L. Duvall
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Instance-based classification of clinical text is a widely used natural language processing task employed as a step for patient classification, document retrieval, or information extraction. Rule-based approaches rely on concept identification and context analysis in order to determine the appropriate class. We propose a five-step process that enables even small research teams to develop simple but powerful rule-based NLP systems by taking advantage of a common UIMA AS based pipeline for classification. Our proposed methodology coupled with the general-purpose solution provides researchers with access to the data locked in clinical text in cases of limited human resources and compact timelines.