Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma

7Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

Background: Lung adenocarcinoma (LUAD) is the most common type of lung cancer, and has a dismal mortality rate of 80%, mainly due to diagnosis at an advanced stage. Biomarkers with high specificity and sensitivity for the early diagnosis of LUAD are sparse. This study aimed to identify markers for the early diagnosis of LUAD. Methods: The GSE32863 and GSE75037 data sets were standardized and merged to screen for differentially expressed genes (DEGs). Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were conducted. The intersected DEGs from the least absolute shrinkage and selection operator (LASSO) and support vector machine (SVM) regression analyses were considered the hub genes. Then the diagnostic ability and expression of hub genes was tested in GSE63459 data set, Finally, CIBERSORT was used to analyze the correlation between the immune-infiltrating cells and hub genes. Results: The following 7 DEGs were intersected by the LASSO and SVM regression analyses: Locus 401286 (LOC401286), flavin-containing monooxygenase 2 (FMO2), XLKD1, Ras homolog family member J (RHOJ), scavenger receptor Class A member 5 (SCARA5), heat shock protein beta-2 (HSPB2), and serine incorporator 2 (SERINC2). The area under the receiver operating characteristic curve (AUC) of LOC401286, FMO2, XLKD1, RHOJ, SCARA5, HSPB2, and SERINC2 was 0.99, 1.00, 0.99, 1.00, 0.99, 0.99, and 0.98, respectively in the training groups. The AUC of LOC401286, FMO2, XLKD1, RHOJ, SCARA5, HSPB2, and SERINC2 was 0.97, 0.96, 0.94, 0.88, 0.85, 0.94 and 0.89, respectively in the validation group. The immune-cell infiltrations of naive B cells, memory B cells, plasma cells, naive cluster of differentiation (CD) 4 T cells, T follicular helper cells, regulatory T cells, gamma delta T cells, monocytes, M0 macrophages, M1 macrophages, resting mast cells, activated mast cells, and neutrophils were different between the normal and tumor tissues. Notably, these immune cells were correlated with the above-mentioned 7 diagnostic genes. Conclusions: We identified 7 DEGs in LUAD tissue that can be considered diagnostic genes based on 2 machine-learning regression methods, which could be very helpful for the early diagnosis of LUAD in clinical practice.

References Powered by Scopus

Robust enumeration of cell subsets from tissue expression profiles

8358Citations
N/AReaders
Get full text

Accessories to the Crime: Functions of Cells Recruited to the Tumor Microenvironment

3612Citations
N/AReaders
Get full text

The Gene Ontology Resource: 20 years and still GOing strong

3124Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Integrative Methylome and Transcriptome Characterization Identifies SERINC2 as a Tumor-Driven Gene for Papillary Thyroid Carcinoma

6Citations
N/AReaders
Get full text

AI/ML advances in non-small cell lung cancer biomarker discovery

5Citations
N/AReaders
Get full text

Diagnostic value of soluble biomarkers for parapneumonic pleural effusion

5Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Wang, S., Wang, Q., Fan, B., Gong, J., Sun, L., Hu, B., & Wang, D. (2022). Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma. Journal of Thoracic Disease, 14(3), 699–711. https://doi.org/10.21037/jtd-22-206

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 1

100%

Readers' Discipline

Tooltip

Medicine and Dentistry 1

25%

Sports and Recreations 1

25%

Veterinary Science and Veterinary Medic... 1

25%

Biochemistry, Genetics and Molecular Bi... 1

25%

Save time finding and organizing research with Mendeley

Sign up for free