Computational prediction of disease related lncRNAs using machine learning

4Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.

Abstract

Long non-coding RNAs (lncRNAs), which were once considered as transcriptional noise, are now in the limelight of current research. LncRNAs play a major role in regulating various biological processes such as imprinting, cell differentiation, and splicing. The mutations of lncRNAs are involved in various complex diseases. Identifying lncRNA-disease associations has gained a lot of attention as predicting it efficiently will lead towards better disease treatment. In this study, we have developed a machine learning model that predicts disease-related lncRNAs by combining sequence and structure-based features. The features were trained on SVM and Random Forest classifiers. We have compared our method with the state-of-the-art and obtained the highest F1 score of 76% on SVM classifier. Moreover, this study has overcome two serious limitations of the reported method which are lack of redundancy checking and implementation of oversampling for balancing the positive and negative class. Our method has achieved improved performance among machine learning models reported for lncRNA-disease associations. Combining multiple features together specifically lncRNAs sequence mutation has a significant contribution to the disease related lncRNA prediction.

References Powered by Scopus

Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences

8016Citations
N/AReaders
Get full text

StarBase v2.0: Decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data

4234Citations
N/AReaders
Get full text

ViennaRNA Package 2.0

3467Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Targeting and engineering long non-coding RNAs for cancer therapy

53Citations
N/AReaders
Get full text

Recent applications of artificial intelligence in RNA-targeted small molecule drug discovery

5Citations
N/AReaders
Get full text

Targeting epigenetic deregulations for the management of esophageal carcinoma: recent advances and emerging approaches

4Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Khalid, R., Naveed, H., & Khalid, Z. (2023). Computational prediction of disease related lncRNAs using machine learning. Scientific Reports, 13(1). https://doi.org/10.1038/s41598-023-27680-7

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

57%

Professor / Associate Prof. 2

29%

Researcher 1

14%

Readers' Discipline

Tooltip

Computer Science 3

50%

Agricultural and Biological Sciences 1

17%

Biochemistry, Genetics and Molecular Bi... 1

17%

Engineering 1

17%

Article Metrics

Tooltip
Mentions
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free