Benefits of dimension reduction in penalized regression methods for high dimensional grouped data: a case study in low sample size

Ajana, Soufiane; Jacqmin-Gadda, Hélène; Korobelnik, Jean-François; Martine, Lucy; Merle, Bénédicte; Acar, Niyazi; Vaysse, Carole; Bretillon, Lionel; Delcourt, Cécile; Berdeaux, Olivier; Bouton, Sylvain; Bron, Alain; Buaud, Benjamin; Cabaret, Stéphanie; Cougnard-Grégoire, Audrey; Hejblum, Boris P.; Creuzot-Garcher, Catherine; Delyfer, Marie-Noelle; Féart-Couret, Catherine; Febvret, Valérie; Grégoire, Stéphane; He, Zhiguo

Published in

Oxford University Press, Bioinformatics, 19(35), p. 3628-3634, 2019

DOI: 10.1093/bioinformatics/btz135

Tools

Export citation

Search in Google Scholar

Benefits of dimension reduction in penalized regression methods for high dimensional grouped data: a case study in low sample size

Journal article published in 2019 by Soufiane Ajana, Hélène Jacqmin-Gadda, Jean-François Korobelnik, Lucy Martine, Bénédicte Merle, Niyazi Acar, Carole Vaysse, Lionel Bretillon, Cécile Delcourt, Olivier Berdeaux, Sylvain Bouton, Alain Bron, Benjamin Buaud, Stéphanie Cabaret, Audrey Cougnard-Grégoire and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Abstract Motivation In some prediction analyses, predictors have a natural grouping structure and selecting predictors accounting for this additional information could be more effective for predicting the outcome accurately. Moreover, in a high dimension low sample size framework, obtaining a good predictive model becomes very challenging. The objective of this work was to investigate the benefits of dimension reduction in penalized regression methods, in terms of prediction performance and variable selection consistency, in high dimension low sample size data. Using two real datasets, we compared the performances of lasso, elastic net, group lasso, sparse group lasso, sparse partial least squares (PLS), group PLS and sparse group PLS. Results Considering dimension reduction in penalized regression methods improved the prediction accuracy. The sparse group PLS reached the lowest prediction error while consistently selecting a few predictors from a single group. Availability and implementation R codes for the prediction methods are freely available at https://github.com/SoufianeAjana/Blisar. Supplementary information Supplementary data are available at Bioinformatics online.

Published in

Links

Tools

Benefits of dimension reduction in penalized regression methods for high dimensional grouped data: a case study in low sample size

Abstract