A network approach for low dimensional signatures from high throughput data

Nico Curti; Giuseppe Levi; Enrico Giampieri; Gastone Castellani; Daniel Remondini

Journal ArticleOPEN ACCESS

A network approach for low dimensional signatures from high throughput data

Scientific Reports (2022) 12(1)

DOI: 10.1038/s41598-022-25549-9

1Citations

6Readers

Abstract

One of the main objectives of high-throughput genomics studies is to obtain a low-dimensional set of observables—a signature—for sample classification purposes (diagnosis, prognosis, stratification). Biological data, such as gene or protein expression, are commonly characterized by an up/down regulation behavior, for which discriminant-based methods could perform with high accuracy and easy interpretability. To obtain the most out of these methods features selection is even more critical, but it is known to be a NP-hard problem, and thus most feature selection approaches focuses on one feature at the time (k-best, Sequential Feature Selection, recursive feature elimination). We propose DNetPRO, Discriminant Analysis with Network PROcessing, a supervised network-based signature identification method. This method implements a network-based heuristic to generate one or more signatures out of the best performing feature pairs. The algorithm is easily scalable, allowing efficient computing for high number of observables (10 3–10 5). We show applications on real high-throughput genomic datasets in which our method outperforms existing results, or is compatible with them but with a smaller number of selected features. Moreover, the geometrical simplicity of the resulting class-separation surfaces allows a clearer interpretation of the obtained signatures in comparison to nonlinear classification models.

Cite

CITATION STYLE

APA

Curti, N., Levi, G., Giampieri, E., Castellani, G., & Remondini, D. (2022). A network approach for low dimensional signatures from high throughput data. Scientific Reports, 12(1). https://doi.org/10.1038/s41598-022-25549-9

A network approach for low dimensional signatures from high throughput data

Abstract

Cite

Register to see more suggestions