FEGS: a novel feature extraction model for protein sequences and its applications

26Citations
Citations of this article
28Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: Feature extraction of protein sequences is widely used in various research areas related to protein analysis, such as protein similarity analysis and prediction of protein functions or interactions. Results: In this study, we introduce FEGS (Feature Extraction based on Graphical and Statistical features), a novel feature extraction model of protein sequences, by developing a new technique for graphical representation of protein sequences based on the physicochemical properties of amino acids and effectively employing the statistical features of protein sequences. By fusing the graphical and statistical features, FEGS transforms a protein sequence into a 578-dimensional numerical vector. When FEGS is applied to phylogenetic analysis on five protein sequence data sets, its performance is notably better than all of the other compared methods. Conclusion: The FEGS method is carefully designed, which is practically powerful for extracting features of protein sequences. The current version of FEGS is developed to be user-friendly and is expected to play a crucial role in the related studies of protein sequence analyses.

References Powered by Scopus

Basic local alignment search tool

79227Citations
N/AReaders
Get full text

CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

58546Citations
N/AReaders
Get full text

Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding

8820Citations
N/AReaders
Get full text

Cited by Powered by Scopus

ToxIBTL: Prediction of peptide toxicity based on information bottleneck and transfer learning

89Citations
N/AReaders
Get full text

Machine learning on protein–protein interaction prediction: models, challenges and trends

32Citations
N/AReaders
Get full text

Improved Prediction Model of Protein and Peptide Toxicity by Integrating Channel Attention into a Convolutional Neural Network and Gated Recurrent Units

24Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Mu, Z., Yu, T., Liu, X., Zheng, H., Wei, L., & Liu, J. (2021). FEGS: a novel feature extraction model for protein sequences and its applications. BMC Bioinformatics, 22(1). https://doi.org/10.1186/s12859-021-04223-3

Readers over time

‘21‘22‘23‘24036912

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 8

67%

Researcher 4

33%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 6

60%

Computer Science 2

20%

Nursing and Health Professions 1

10%

Agricultural and Biological Sciences 1

10%

Save time finding and organizing research with Mendeley

Sign up for free
0