Potential application of item-response theory to interpretation of medical codes in electronic patient records

van Staa, Tjeerd; Wolfe, Charles; Yardley, Lucy; Dregan, Alex; Charlton, Judith; Delaney, Brendan; Grieve, Andy; Little, Paul; Mc, Gulliford; Moore, Michael; Gulliford, Martin C.; Rudd, Anthony; Van Staa, T.; Taweel, Adel; Team, eCRT Research

Published in

BioMed Central, BMC Medical Research Methodology, 1(11), 2011

DOI: 10.1186/1471-2288-11-168

Tools

Export citation

Search in Google Scholar

Potential application of item-response theory to interpretation of medical codes in electronic patient records

Journal article published in 2011 by Tjeerd van Staa, Charles Wolfe, Lucy Yardley, Alex Dregan, Judith Charlton, Brendan Delaney, Andy Grieve, Paul Little, Gulliford Mc, Michael Moore, Martin C. Gulliford, Anthony Rudd, T. Van Staa

, Adel Taweel, eCRT Research Team

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

BACKGROUND: Electronic patient records are generally coded using extensive sets of codes but the significance of the utilisation of individual codes may be unclear. Item response theory (IRT) models are used to characterise the psychometric properties of items included in tests and questionnaires. This study asked whether the properties of medical codes in electronic patient records may be characterised through the application of item response theory models. METHODS: Data were provided by a cohort of 47,845 participants from 414 family practices in the UK General Practice Research Database (GPRD) with a first stroke between 1997 and 2006. Each eligible stroke code, out of a set of 202 OXMIS and Read codes, was coded as either recorded or not recorded for each participant. A two parameter IRT model was fitted using marginal maximum likelihood estimation. Estimated parameters from the model were considered to characterise each code with respect to the latent trait of stroke diagnosis. The location parameter is referred to as a calibration parameter, while the slope parameter is referred to as a discrimination parameter. RESULTS: There were 79,874 stroke code occurrences available for analysis. Utilisation of codes varied between family practices with intraclass correlation coefficients of up to 0.25 for the most frequently used codes. IRT analyses were restricted to 110 Read codes. Calibration and discrimination parameters were estimated for 77 (70%) codes that were endorsed for 1,942 stroke patients. Parameters were not estimated for the remaining more frequently used codes. Discrimination parameter values ranged from 0.67 to 2.78, while calibration parameters values ranged from 4.47 to 11.58. The two parameter model gave a better fit to the data than either the one- or three-parameter models. However, high chi-square values for about a fifth of the stroke codes were suggestive of poor item fit. CONCLUSION: The application of item response theory models to coded electronic patient records might potentially contribute to identifying medical codes that offer poor discrimination or low calibration. This might indicate the need for improved coding sets or a requirement for improved clinical coding practice. However, in this study estimates were only obtained for a small proportion of participants and there was some evidence of poor model fit. There was also evidence of variation in the utilisation of codes between family practices raising the possibility that, in practice, properties of codes may vary for different coders.

Published in

Links

Tools

Potential application of item-response theory to interpretation of medical codes in electronic patient records

Abstract