Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals

Noor, Kawsar; Roguski, Lukasz; Bai, Xi; Handy, Alex; Klapaukh, Roman; Folarin, Amos; Romao, Luis; Matteson, Joshua; Lea, Nathan; Zhu, Leilei; Asselbergs, Folkert W.; Wong, Wai Keong; Shah, Anoop; Dobson, Richard Jb

Published in

JMIR Publications, JMIR Medical Informatics, 8(10), p. e38122, 2022

DOI: 10.2196/38122

Tools

Export citation

Search in Google Scholar

Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals

Journal article published in 2022 by Kawsar Noor

, Lukasz Roguski

, Xi Bai

, Alex Handy

, Roman Klapaukh

, Amos Folarin

, Luis Romao

, Joshua Matteson

, Nathan Lea

, Leilei Zhu

, Folkert W. Asselbergs

, Wai Keong Wong

, Anoop Shah

, Richard Jb Dobson

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Background As more health care organizations transition to using electronic health record (EHR) systems, it is important for these organizations to maximize the secondary use of their data to support service improvement and clinical research. These organizations will find it challenging to have systems capable of harnessing the unstructured data fields in the record (clinical notes, letters, etc) and more practically have such systems interact with all of the hospital data systems (legacy and current). Objective We describe the deployment of the EHR interfacing information extraction and retrieval platform CogStack at University College London Hospitals (UCLH). Methods At UCLH, we have deployed the CogStack platform, an information retrieval platform with natural language processing capabilities. The platform addresses the problem of data ingestion and harmonization from multiple data sources using the Apache NiFi module for managing complex data flows. The platform also facilitates the extraction of structured data from free-text records through use of the MedCAT natural language processing library. Finally, data science tools are made available to support data scientists and the development of downstream applications dependent upon data ingested and analyzed by CogStack. Results The platform has been deployed at the hospital, and in particular, it has facilitated a number of research and service evaluation projects. To date, we have processed over 30 million records, and the insights produced from CogStack have informed a number of clinical research use cases at the hospital. Conclusions The CogStack platform can be configured to handle the data ingestion and harmonization challenges faced by a hospital. More importantly, the platform enables the hospital to unlock important clinical information from the unstructured portion of the record using natural language processing technology.

Published in

Links

Tools

Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals

Abstract