Modelling Patient Sequences for Rare Disease Detection with Semi-supervised Generative Adversarial Nets

2Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Rare diseases affect 350 million patients worldwide, but they are commonly delayed in diagnosis or misdiagnosed. The problem of detecting rare disease faces two main challenges: the first being extreme imbalance of data and the second being finding the appropriate features. In this paper, we propose to address the problems by using semi-supervised generative adversarial networks (GANs) to deal with the data imbalance issue and recurrent neural networks (RNNs) to directly model patient sequences. We experimented with detecting patients with a particular rare disease (exocrine pancreatic insufficiency, EPI). The dataset includes 1.8 million patients with 29,149 patients being positive, from a large longitudinal study using 7 years medical claims. Our model achieved 0.56 PR-AUC and outperformed benchmark models in terms of precision and recall.

Cite

CITATION STYLE

APA

Yu, K., Wang, Y., & Cai, Y. (2020). Modelling Patient Sequences for Rare Disease Detection with Semi-supervised Generative Adversarial Nets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11986 LNAI, pp. 141–150). Springer. https://doi.org/10.1007/978-3-030-39098-3_11

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free