Dissemin is shutting down on January 1st, 2025

Published in

Taylor and Francis Group, Journal of Statistical Computation and Simulation, 14(88), p. 2827-2851, 2018

DOI: 10.1080/00949655.2018.1490418

Links

Tools

Export citation

Search in Google Scholar

Fully Bayesian Logistic Regression with Hyper-Lasso Priors for High-dimensional Feature Selection

Journal article published in 2014 by Longhai Li ORCID, Weixin Yao ORCID
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Red circle
Preprint: archiving forbidden
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

High-dimensional feature selection arises in many areas of modern sciences. For example, in genomic research we want to find the genes that can be used to separate tissues of different classes (eg. cancer and normal) from tens of thousands of genes that are active (expressed) in certain tissue cells. To this end, we wish to fit regression and classification models with a large number of features (also called variables, predictors), which is still a tremendous challenge to date. In the past few years, penalized likelihood methods for fitting regression models based on hyper-lasso penalization have been explored considerably in the literature. However, fully Bayesian methods that use Markov chain Monte Carlo (MCMC) for fitting regression and classification models with hyper-lasso priors are still lack of investigation. In this paper, we introduce a new class of methods for fitting Bayesian logistic regression models with hyper-lasso priors using Hamiltonian Monte Carlo in restricted Gibbs sampling framework. We call our methods BLRHL for short. We use simulation studies to test BLRHL by comparing to LASSO, and to investigate the problems of choosing heaviness and scale in BLRHL. The main findings are that the choice of heaviness of prior plays a critical role in BLRHL, and that BLRHL is relatively robust to the choice of prior scale. We further demonstrate and investigate BLRHL in an application to a real microarray data set related to prostate cancer, which confirms the previous findings. An R add-on package called BLRHL will be available from http://math.usask.ca/~longhai/software/BLRHL. ; Comment: 34 pages. arXiv admin note: substantial text overlap with arXiv:1308.4690