Classify Hyperdiploidy Status of Multiple Myeloma Patients Using Gene Expression Profiles

Li, Yingxiang; Wang, Xujun; Zheng, Haiyang; Wang, Chengyang; Minvielle, Stéphane; Magrangeas, Florence; Avet-Loiseau, Hervé; Pk, Shah; Shah, Parantu K.; Zhang, Yong; Munshi, Nikhil C.; Nc, Munshi; Li, Cheng

Published in

Public Library of Science, PLoS ONE, 3(8), p. e58809, 2013

DOI: 10.1371/journal.pone.0058809

Tools

Export citation

Search in Google Scholar

Classify Hyperdiploidy Status of Multiple Myeloma Patients Using Gene Expression Profiles

Journal article published in 2013 by Yingxiang Li, Xujun Wang

, Haiyang Zheng, Chengyang Wang, Stéphane Minvielle

, Florence Magrangeas

, Hervé Avet-Loiseau

, Shah Pk, Parantu K. Shah, Yong Zhang, Nikhil C. Munshi, Munshi Nc, Cheng Li

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Multiple myeloma (MM) is a cancer of antibody-making plasma cells. It frequently harbors alterations in DNA and chromosome copy numbers, and can be divided into two major subtypes, hyperdiploid (HMM) and non-hyperdiploid multiple myeloma (NHMM). The two subtypes have different survival prognosis, possibly due to different but converging paths to oncogenesis. Existing methods for identifying the two subtypes are fluorescence in situ hybridization (FISH) and copy number microarrays, with increased cost and sample requirements. We hypothesize that chromosome alterations have their imprint in gene expression through dosage effect. Using five MM expression datasets that have HMM status measured by FISH and copy number microarrays, we have developed and validated a K-nearest-neighbor method to classify MM into HMM and NHMM based on gene expression profiles. Classification accuracy for test datasets ranges from 0.83 to 0.88. This classification will enable researchers to study differences and commonalities of the two MM subtypes in disease biology and prognosis using expression datasets without need for additional subtype measurements. Our study also supports the advantages of using cancer specific characteristics in feature design and pooling multiple rounds of classification results to improve accuracy. We provide R source code and processed datasets at www.ChengLiLab.org/software.

Published in

Links

Tools

Classify Hyperdiploidy Status of Multiple Myeloma Patients Using Gene Expression Profiles

Abstract