Adaptation of algorithms for medical information retrieval for working on russian-language text content

Aleksandra Vatian; Natalia Dobrenko; Anastasia Makarenko; Niyaz Nigmatullin; Nikolay Vedernikov; Artem Vasilev; Andrey Stankevich; Natalia Gusarova; Anatoly Shalyto

Conference Proceedings

Adaptation of algorithms for medical information retrieval for working on russian-language text content

Lecture Notes in Computer Science (2018) 11107 LNAI 106-114

DOI: 10.1007/978-3-030-00794-2_11

2Citations

12Readers

Get full text

Abstract

The paper investigates the possibilities of adapting various ADR algorithms to the Russian language environment. In general, the ADR detection process consists of 4 steps: (1) data collection from social media; (2) classification/filtering of ADR assertive text segments; (3) extraction of ADR mentions from text segments; (4) analysis of extracted ADR mentions for signal generation. The implementation of each step in the Russian-language environment is associated with a number of difficulties in comparison with the traditional English-speaking environment. First of all, they are connected with the lack of necessary databases and specialized language resources. In addition, an important negative role is played by the complex grammatical structure of the Russian language. The authors present various methods of machine learning algorithms adaptation in order to overcome these difficulties. For step 3 on the material of Russian-language text forums using the ensemble classifier, the Accuracy = 0.805 was obtained. For step 4 on the material of Russian-language EHR, by adapting pyConTextNLP, the F-measure = 0.935 was obtained, and by adapting ConText algorithm, the F-measure = 0.92–0.95 was obtained. A method for full-scale performing of step 4 was developed using cue-based and rule-based approaches, and the F-measure = 67.5% was obtained that is quite comparable to baseline.

Author supplied keywords

Cite

CITATION STYLE

APA

Vatian, A., Dobrenko, N., Makarenko, A., Nigmatullin, N., Vedernikov, N., Vasilev, A., … Shalyto, A. (2018). Adaptation of algorithms for medical information retrieval for working on russian-language text content. In Lecture Notes in Computer Science (Vol. 11107 LNAI, pp. 106–114). Springer Verlag. https://doi.org/10.1007/978-3-030-00794-2_11

Adaptation of algorithms for medical information retrieval for working on russian-language text content

Abstract

Author supplied keywords

Cite

Register to see more suggestions