We introduce a new dataset consisting of natural language interactions annotated with medical family histories, obtained during interactions with a genetic counselor and through crowdsourcing, following a questionnaire created by experts in the domain. We describe the data collection process and the annotations performed by medical professionals, including illness and personal attributes (name, age, gender, family relationships) for the patient and their family members. An initial system that performs argument identification and relation extraction shows promising results - average F-score of 0.87 on complex sentences on the targeted relations.
CITATION STYLE
Azab, M., Dadian, S., Nastase, V., An, L., & Mihalcea, R. (2019). Towards extracting medical family history from natural language interactions: A new dataset and baselines. In EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (pp. 1255–1260). Association for Computational Linguistics. https://doi.org/10.18653/v1/d19-1122
Mendeley helps you to discover research relevant for your work.