Published in

Wiley, Genetic Epidemiology, 5(38), p. 467-476, 2014

DOI: 10.1002/gepi.21809

Links

Tools

Export citation

Search in Google Scholar

Whole genome prediction of bladder cancer risk with the Bayesian LASSO.

This paper was not found in any repository, but could be made available legally by the author.
This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

To build a predictive model for urothelial carcinoma of the bladder (UCB) risk combining both genomic and nongenomic data, 1,127 cases and 1,090 controls from the Spanish Bladder Cancer/EPICURO study were genotyped using the HumanHap 1M SNP array. After quality control filters, genotypes from 475,290 variants were available. Nongenomic information comprised age, gender, region, and smoking status. Three Bayesian threshold models were implemented including: (1) only genomic information, (2) only nongenomic data, and (3) both sources of information. The three models were applied to the whole population, to only nonsmokers, to male smokers, and to extreme phenotypes to potentiate the UCB genetic component. The area under the ROC curve allowed evaluating the predictive ability of each model in a 10-fold cross-validation scenario. Smoking status showed the highest predictive ability of UCB risk (AUCtest = 0.62). On the other hand, the AUC of all genetic variants was poorer (0.53). When the extreme phenotype approach was applied, the predictive ability of the genomic model improved 15%. This study represents a first attempt to build a predictive model for UCB risk combining both genomic and nongenomic data and applying state-of-the-art statistical approaches. However, the lack of genetic relatedness among individuals, the complexity of UCB etiology, as well as a relatively small statistical power, may explain the low predictive ability for UCB risk. The study confirms the difficulty of predicting complex diseases using genetic data, and suggests the limited translational potential of findings from this type of data into public health interventions. ? 2014 WILEY PERIODICALS, INC. ; The work was partially supported by the Fondo de Investigacion Sanitaria, Instituto de Salud Carlos III (G03/174, 00/0745, PI051436, PI061614, PI09-02102, G03/174, and Sara Borrell fellowship to E. L. M.), Spain; Fundacio la Marato de TV3 (#050830); Red Tematica de Investigacion Cooperativa en Cancer (RTICC, (RTICC, #C03/009, #RD06/0020, and #RD12/0036/0050), Instituto de Salud Carlos III (ISCIII), Spanish Ministry of Economy and Competitiveness & European Regional Development Fund (ERDF) "Una manera de hacer Europa"); Asociacion Espanola Contra el Cancer (AECC); EU-FP7-201663-UROMOL; and NIH-RO1-CA089715 and CA34627; and by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, USA.