Text mining for neuroanatomy using whitetext with an updated corpus and a new web application

16Citations
Citations of this article
28Readers
Mendeley users who have this article in their library.

Abstract

We describe the WhiteText project, and its progress towards automatically extracting statements of neuroanatomical connectivity from text. We review progress to date on the three main steps of the project: recognition of brain region mentions, standardization of brain region mentions to neuroanatomical nomenclature, and connectivity statement extraction. We further describe a new version of our manually curated corpus that adds 2,111 connectivity statements from 1,828 additional abstracts. Cross-validation classification within the new corpus replicates results on our original corpus, recalling 67% of connectivity statements at 51% precision. The resulting merged corpus provides 5,208 connectivity statements that can be used to seed species-specific connectivity matrices and to better train automated techniques. Finally, we present a new web application that allows fast interactive browsing of the over 70,000 sentences indexed by the system, as a tool for accessing the data and assisting in further curation. Software and data are freely available at http://www.chibi.ubc.ca/WhiteText/.

Cite

CITATION STYLE

APA

French, L., Liu, P., Marais, O., Koreman, T., Tseng, L., Lai, A., & Pavlidis, P. (2015). Text mining for neuroanatomy using whitetext with an updated corpus and a new web application. Frontiers in Neuroinformatics, 9(May). https://doi.org/10.3389/fninf.2015.00013

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free