METSP: A Maximum-Entropy Classifier Based Text Mining Tool for Transporter-Substrate Identification with Semistructured Text

2Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The substrates of a transporter are not only useful for inferring function of the transporter, but also important to discover compound-compound interaction and to reconstruct metabolic pathway. Though plenty of data has been accumulated with the developing of new technologies such as in vitro transporter assays, the search for substrates of transporters is far from complete. In this article, we introduce METSP, a maximum-entropy classifier devoted to retrieve transporter-substrate pairs (TSPs) from semistructured text. Based on the high quality annotation from UniProt, METSP achieves high precision and recall in cross-validation experiments. When METSP is applied to 182,829 human transporter annotation sentences in UniProt, it identifies 3942 sentences with transporter and compound information. Finally, 1547 confidential human TSPs are identified for further manual curation, among which 58.37% pairs with novel substrates not annotated in public transporter databases. METSP is the first efficient tool to extract TSPs from semistructured annotation text in UniProt. This tool can help to determine the precise substrates and drugs of transporters, thus facilitating drug-target prediction, metabolic network reconstruction, and literature classification.

References Powered by Scopus

A protocol for generating a high-quality genome-scale metabolic reconstruction

1341Citations
N/AReaders
Get full text

The universal protein resource (UniProt) in 2010

1039Citations
N/AReaders
Get full text

Learning classifiers from only positive and unlabeled data

860Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Multi-tissue transcriptomics for construction of a comprehensive gene resource for the terrestrial snail Theba pisana

10Citations
N/AReaders
Get full text

Transporter engineering for the development of cyanobacteria as cell factories: A text analytics guided survey

9Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhao, M., Chen, Y., Qu, D., & Qu, H. (2015). METSP: A Maximum-Entropy Classifier Based Text Mining Tool for Transporter-Substrate Identification with Semistructured Text. BioMed Research International, 2015. https://doi.org/10.1155/2015/254838

Readers' Seniority

Tooltip

Researcher 3

60%

Professor / Associate Prof. 1

20%

PhD / Post grad / Masters / Doc 1

20%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 3

38%

Computer Science 2

25%

Engineering 2

25%

Materials Science 1

13%

Save time finding and organizing research with Mendeley

Sign up for free