Contrast and Variability in Gene Names

33Citations
Citations of this article
77Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We studied contrast and variability in a corpus of gene names to identify potential heuristics for use in performing entity identification in the molecular biology domain. Based on our findings, we developed heuristics for mapping weakly matching gene names to their official gene names. We then tested these heuristics against a large body of Medline abstracts, and found that using these heuristics can increase recall, with varying levels of precision. Our findings also underscored the importance of good information retrieval and of the ability to disambiguate between genes, proteins, RNA, and a variety of other referents for performing entity identification with high precision.

Cite

CITATION STYLE

APA

Cohen, K. B., Dolbey, A. E., Acquaah-Mensah, G. K., & Hunter, L. (2002). Contrast and Variability in Gene Names. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 14–20). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1118149.1118152

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free