Extracting topical phrases from clinical documents

22Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

In clinical documents, medical terms are often expressed in multi-word phrases. Traditional topic modelling approaches relying on the "bag-of-words" assumption are not effective in extracting topic themes from clinical documents. This paper proposes to first extract medical phrases using an off-theshelf tool for medical concept mention extraction, and then train a topic model which takes a hierarchy of Pitman-Yor processes as prior for modelling the generation of phrases of arbitrary length. Experimental results on patients' discharge summaries show that the proposed approach outperforms the state-of-the-art topical phrase extraction model on both perplexity and topic coherence measure and finds more interpretable topics.

Cite

CITATION STYLE

APA

He, Y. (2016). Extracting topical phrases from clinical documents. In 30th AAAI Conference on Artificial Intelligence, AAAI 2016 (pp. 2957–2963). AAAI press. https://doi.org/10.1609/aaai.v30i1.10365

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free