We propose a novel method for measuring semantic similarity between two sentences. The method exploits both syntactic and semantic features to assess the similarity. In our method, words in a sentence are weighted using their information content. The weights of words help differentiate their contribution towards the meaning of the sentence. The originality of this research is that we explore named entities and their coreference relations as important indicators for measuring the similarity. We conduct experiments and evaluate our proposed method on Microsoft Research Paraphrase Corpus. The experiment results show that named entities and their coreference relations improve significantly the performance of paraphrase identification and the proposed method is comparable with state-of-the-art methods for paraphrase identification.
CITATION STYLE
Nguyen, H. T., Duong, P. H., & Le, T. Q. (2015). A multifaceted approach to sentence similarity. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 9376, pp. 303–314). Springer Verlag. https://doi.org/10.1007/978-3-319-25135-6_29
Mendeley helps you to discover research relevant for your work.