Unsupervised Identification of Study Descriptors in Toxicology Research: An Experimental Study

1Citations
Citations of this article
70Readers
Mendeley users who have this article in their library.

Abstract

Identifying and extracting data elements such as study descriptors in publication full texts is a critical yet manual and labor-intensive step required in a number of tasks. In this paper we address the question of identifying data elements in an unsupervised manner. Specifically, provided a set of criteria describing specific study parameters, such as species, route of administration, and dosing regimen, we develop an unsupervised approach to identify text segments (sentences) relevant to the criteria. A binary classifier trained to identify publications that met the criteria performs better when trained on the candidate sentences than when trained on sentences randomly picked from the text, supporting the intuition that our method is able to accurately identify study descriptors.

Cite

CITATION STYLE

APA

Herrmannova, D., Young, S. R., Patton, R. M., Stahl, C. G., Kleinstreuer, N. C., & Wolfe, M. S. (2018). Unsupervised Identification of Study Descriptors in Toxicology Research: An Experimental Study. In EMNLP 2018 - 9th International Workshop on Health Text Mining and Information Analysis, LOUHI 2018 - Proceedings of the Workshop (pp. 71–82). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-5609

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free