Machine learning based combination of multi-omics data for subgroup identification in non-small cell lung cancer

13Citations
Citations of this article
44Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Non-small Cell Lung Cancer (NSCLC) is a heterogeneous disease with a poor prognosis. Identifying novel subtypes in cancer can help classify patients with similar molecular and clinical phenotypes. This work proposes an end-to-end pipeline for subgroup identification in NSCLC. Here, we used a machine learning (ML) based approach to compress the multi-omics NSCLC data to a lower dimensional space. This data is subjected to consensus K-means clustering to identify the five novel clusters (C1–C5). Survival analysis of the resulting clusters revealed a significant difference in the overall survival of clusters (p-value: 0.019). Each cluster was then molecularly characterized to identify specific molecular characteristics. We found that cluster C3 showed minimal genetic aberration with a high prognosis. Next, classification models were developed using data from each omic level to predict the subgroup of unseen patients. Decision‑level fused classification models were then built using these classifiers, which were used to classify unseen patients into five novel clusters. We also showed that the multi-omics-based classification model outperformed single-omic-based models, and the combination of classifiers proved to be a more accurate prediction model than the individual classifiers. In summary, we have used ML models to develop a classification method and identified five novel NSCLC clusters with different genetic and clinical characteristics.

References Powered by Scopus

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

36247Citations
N/AReaders
Get full text

Cancer statistics, 2020

16898Citations
N/AReaders
Get full text

Robust enumeration of cell subsets from tissue expression profiles

8460Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Novel research and future prospects of artificial intelligence in cancer diagnosis and treatment

40Citations
N/AReaders
Get full text

Prioritization of genes for translation: a computational approach

5Citations
N/AReaders
Get full text

Harnessing Artificial Intelligence in Multimodal Omics Data Integration: Paving the Path for the Next Frontier in Precision Medicine

5Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Khadirnaikar, S., Shukla, S., & Prasanna, S. R. M. (2023). Machine learning based combination of multi-omics data for subgroup identification in non-small cell lung cancer. Scientific Reports, 13(1). https://doi.org/10.1038/s41598-023-31426-w

Readers over time

‘23‘24‘2509182736

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 8

50%

Researcher 4

25%

Professor / Associate Prof. 2

13%

Lecturer / Post doc 2

13%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 6

50%

Pharmacology, Toxicology and Pharmaceut... 3

25%

Medicine and Dentistry 2

17%

Materials Science 1

8%

Article Metrics

Tooltip
Mentions
News Mentions: 2

Save time finding and organizing research with Mendeley

Sign up for free
0