Advanced similarity measures using word embeddings and siamese networks in CBR

12Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Automatic fuzzy text processing, context extraction and disambiguation are three challenging research areas with high relevance to complex business domains. Business knowledge can be found in plain text message exchanges, emails, support tickets, internal chat messengers and other volatile means, making the decoding of text-based domain knowledge a challenging task. Traditional natural language processing approaches focus on a comprehensive representation of business knowledge and any relevant mappings. However, such approaches can be highly complex, not cost-effective and of high maintenance, especially in environments that experience frequent changes. This work applies LSTM Siamese Networks to measure text similarities in ambiguous domains. We implement the Manhattan LSTM (MaLSTM) Siamese neural network for semi-automatic knowledge acquisition of business knowledge and decoding of domain-relevant features that enable building similarity measures. Our aim is to minimize the effort from human experts while extracting domain knowledge from rich text, containing context-free abbreviations, grammatically incorrect text and mixed language.

Cite

CITATION STYLE

APA

Amin, K., Lancaster, G., Kapetanakis, S., Althoff, K. D., Dengel, A., & Petridis, M. (2020). Advanced similarity measures using word embeddings and siamese networks in CBR. In Advances in Intelligent Systems and Computing (Vol. 1038, pp. 449–462). Springer Verlag. https://doi.org/10.1007/978-3-030-29513-4_32

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free