PubMed-supported clinical term weighting approach for improving inter-patient similarity measure in diagnosis prediction

Chan, Lawrence Wc; Liu, Ying; Chan, Tao; Law, Helen Kw; Cesar Wong, S. C.; Wong, Sc Cesar; Yeung, Andy Ph; Lo, Kf F.; Yeung, Sw W.; Kwok, Ky; Chan, William Yl; Lau, Thomas Yh; Shyu, Chi-Ren

Published in

BioMed Central, BMC Medical Informatics and Decision Making, 1(15), 2015

DOI: 10.1186/s12911-015-0166-2

Tools

Export citation

Search in Google Scholar

PubMed-supported clinical term weighting approach for improving inter-patient similarity measure in diagnosis prediction

Journal article published in 2015 by Lawrence Wc Chan

, Ying Liu

, Tao Chan, Helen Kw Law, S. C. Cesar Wong, Sc Cesar Wong, Andy Ph Yeung, Kf F. Lo, Sw W. Yeung, Ky Kwok, William Yl Chan, Thomas Yh Lau, Chi-Ren Shyu

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract Background Similarity-based retrieval of Electronic Health Records (EHRs) from large clinical information systems provides physicians the evidence support in making diagnoses or referring examinations for the suspected cases. Clinical Terms in EHRs represent high-level conceptual information and the similarity measure established based on these terms reflects the chance of inter-patient disease co-occurrence. The assumption that clinical terms are equally relevant to a disease is unrealistic, reducing the prediction accuracy. Here we propose a term weighting approach supported by PubMed search engine to address this issue. Methods We collected and studied 112 abdominal computed tomography imaging examination reports from four hospitals in Hong Kong. Clinical terms, which are the image findings related to hepatocellular carcinoma (HCC), were extracted from the reports. Through two systematic PubMed search methods, the generic and specific term weightings were established by estimating the conditional probabilities of clinical terms given HCC. Each report was characterized by an ontological feature vector and there were totally 6216 vector pairs. We optimized the modified direction cosine (mDC) with respect to a regularization constant embedded into the feature vector. Equal, generic and specific term weighting approaches were applied to measure the similarity of each pair and their performances for predicting inter-patient co-occurrence of HCC diagnoses were compared by using Receiver Operating Characteristics (ROC) analysis. Results The Areas under the curves (AUROCs) of similarity scores based on equal, generic and specific term weighting approaches were 0.735, 0.728 and 0.743 respectively (p

Published in

Links

Tools

PubMed-supported clinical term weighting approach for improving inter-patient similarity measure in diagnosis prediction

Abstract