The My Cancer Genome clinical trial data model and trial curation workflow

Jain, Neha; Mittendorf, Kathleen F.; Holt, Marilyn; Lenoue-Newton, Michele; Maurer, Ian; Miller, Clinton; Stachowiak, Matthew; Botyrius, Michelle; Cole, James; Micheel, Christine; Levy, Mia

Published in

Oxford University Press, JAMIA: A Scholarly Journal of Informatics in Health and Biomedicine, 7(27), p. 1057-1066, 2020

DOI: 10.1093/jamia/ocaa066

Tools

Export citation

Search in Google Scholar

The My Cancer Genome clinical trial data model and trial curation workflow

Journal article published in 2020 by Neha Jain

, Kathleen F. Mittendorf

, Marilyn Holt, Michele Lenoue-Newton, Ian Maurer, Clinton Miller, Matthew Stachowiak, Michelle Botyrius, James Cole, Christine Micheel, Mia Levy

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Abstract Objective As clinical trials evolve in complexity, clinical trial data models that can capture relevant trial data in meaningful, structured annotations and computable forms are needed to support accrual. Material and Methods We have developed a clinical trial information model, curation information system, and a standard operating procedure for consistent and accurate annotation of cancer clinical trials. Clinical trial documents are pulled into the curation system from publicly available sources. Using a web-based interface, a curator creates structured assertions related to disease-biomarker eligibility criteria, therapeutic context, and treatment cohorts by leveraging our data model features. These structured assertions are published on the My Cancer Genome (MCG) website. Results To date, over 5000 oncology trials have been manually curated. All trial assertion data are available for public view on the MCG website. Querying our structured knowledge base, we performed a landscape analysis to assess the top diseases, biomarker alterations, and drugs featured across all cancer trials. Discussion Beyond curating commonly captured elements, such as disease and biomarker eligibility criteria, we have expanded our model to support the curation of trial interventions and therapeutic context (ie, neoadjuvant, metastatic, etc.), and the respective biomarker-disease treatment cohorts. To the best of our knowledge, this is the first effort to capture these fields in a structured format. Conclusion This paper makes a significant contribution to the field of biomedical informatics and knowledge dissemination for precision oncology via the MCG website. Key words knowledge representation, My Cancer Genome, precision oncology, knowledge curation, cancer informatics, clinical trial data model

Published in

Links

Tools

The My Cancer Genome clinical trial data model and trial curation workflow

Abstract