This paper outlines and categorizes ways of using syntactic information in a number of algorithms for determining the semantic similarity of short texts. We consider the use of word order information, part-of-speech tagging, parsing and semantic role labeling. We analyze and evaluate the effects of syntax usage on algorithm performance by utilizing the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. We also propose a new classification of algorithms based on their applicability to languages with scarce natural language processing tools.
CITATION STYLE
Batanovic, V., & Bojic, D. (2014). Evaluation and classification of syntax usage in determining short-text semantic similarity. Telfor Journal, 6(1), 64–68. https://doi.org/10.5937/telfor1401064b
Mendeley helps you to discover research relevant for your work.