Hi-LASSO: High-Dimensional LASSO

25Citations
Citations of this article
32Readers
Mendeley users who have this article in their library.

Abstract

High-throughput genomic technologies are leading to a paradigm shift in research of computational biology. Computational analysis with high-dimensional data and its interpretation are essential for the understanding of complex biological systems. Most biological data (e.g., gene expression and DNA sequence data) are high-dimensional, but consist of much fewer samples than predictors. Such high-dimension, low sample size (HDLSS) data often cause computational challenges in biological data analysis. A number of least absolute shrinkage and selection operator (LASSO) methods have been widely used for identifying biomarkers or prognostic factors in the field of bioinformatics. The LASSO solution has been improved through the development of the LASSO derivatives, including elastic-net, adaptive LASSO, relaxed LASSO, VISA, random LASSO, and recursive LASSO. However, there are several known limitations of the existing LASSO solutions: multicollinearity (particularly with different signs), subset size limitation, and the lack of the statistical test of significance. We propose a high-dimensional LASSO (Hi-LASSO) that theoretically improves a LASSO model providing better performance of both prediction and feature selection on extremely high-dimensional data. The Hi-LASSO alleviates bias introduced from bootstrapping, refines importance scores, improves the performance taking advantage of global oracle property, provides a statistical strategy to determine the number of bootstrapping, and allows tests of significance for feature selection with appropriate distribution. The performance of Hi-LASSO was assessed by comparing the existing state-of-the-art LASSO methods in extensive simulation experiments with multiple data settings. The Hi-LASSO was also applied for survival analysis with GBM gene expression data.

References Powered by Scopus

Regression Shrinkage and Selection Via the Lasso

35680Citations
N/AReaders
Get full text

Regularization and variable selection via the elastic net

13105Citations
N/AReaders
Get full text

The adaptive lasso and its oracle properties

5155Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Integrating Artificial Intelligence for Drug Discovery in the Context of Revolutionizing Drug Delivery

59Citations
N/AReaders
Get full text

A Critical Review of Machine Learning Techniques on Thermoelectric Materials

33Citations
N/AReaders
Get full text

Security-Constrained Multi-Objective Optimal Power Flow for a Hybrid AC/VSC-MTDC System with Lasso-Based Contingency Filtering

23Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kim, Y., Hao, J., Mallavarapu, T., Park, J., & Kang, M. (2019). Hi-LASSO: High-Dimensional LASSO. IEEE Access, 7, 44562–44573. https://doi.org/10.1109/ACCESS.2019.2909071

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 12

80%

Lecturer / Post doc 2

13%

Professor / Associate Prof. 1

7%

Readers' Discipline

Tooltip

Computer Science 4

33%

Engineering 4

33%

Mathematics 2

17%

Biochemistry, Genetics and Molecular Bi... 2

17%

Save time finding and organizing research with Mendeley

Sign up for free