Identification of DEP domain-containing proteins by a machine learning method and experimental analysis of their expression in human HCC tissues

18Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

The Dishevelled/EGL-10/Pleckstrin (DEP) domain-containing (DEPDC) proteins have seven members. However, whether this superfamily can be distinguished from other proteins based only on the amino acid sequences, remains unknown. Here, we describe a computational method to segregate DEPDCs and non-DEPDCs. First, we examined the Pfam numbers of the known DEPDCs and used the longest sequences for each Pfam to construct a phylogenetic tree. Subsequently, we extracted 188-dimensional (188D) and 20D features of DEPDCs and non-DEPDCs and classified them with random forest classifier. We also mined the motifs of human DEPDCs to find the related domains. Finally, we designed experimental verification methods of human DEPDC expression at the mRNA level in hepatocellular carcinoma (HCC) and adjacent normal tissues. The phylogenetic analysis showed that the DEPDCs superfamily can be divided into three clusters. Moreover, the 188D and 20D features can both be used to effectively distinguish the two protein types. Motif analysis revealed that the DEP and RhoGAP domain was common in human DEPDCs, human HCC and the adjacent tissues that widely expressed DEPDCs. However, their regulation was not identical. In conclusion, we successfully constructed a binary classifier for DEPDCs and experimentally verified their expression in human HCC tissues.

References Powered by Scopus

MEGA6: Molecular evolutionary genetics analysis version 6.0

36640Citations
N/AReaders
Get full text

MAFFT multiple sequence alignment software version 7: Improvements in performance and usability

31209Citations
N/AReaders
Get full text

Clustal W and Clustal X version 2.0

24680Citations
N/AReaders
Get full text

Cited by Powered by Scopus

70ProPred: A predictor for discovering sigma70 promoters based on combining multiple features

76Citations
N/AReaders
Get full text

Classification of small GTPases with hybrid protein features and advanced machine learning techniques

37Citations
N/AReaders
Get full text

MiR-93-5p promotes cell proliferation through down-regulating PPARGC1A in hepatocellular carcinoma cells by bioinformatics analysis and experimental verification

34Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Liao, Z., Wang, X., Zeng, Y., & Zou, Q. (2016). Identification of DEP domain-containing proteins by a machine learning method and experimental analysis of their expression in human HCC tissues. Scientific Reports, 6. https://doi.org/10.1038/srep39655

Readers over time

‘16‘17‘18‘19‘20‘21‘22‘23‘2401234

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 6

60%

Researcher 3

30%

Professor / Associate Prof. 1

10%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 4

36%

Agricultural and Biological Sciences 3

27%

Medicine and Dentistry 3

27%

Earth and Planetary Sciences 1

9%

Save time finding and organizing research with Mendeley

Sign up for free
0