Learning to recognize ancillary information for automatic paraphrase identification

3Citations
Citations of this article
73Readers
Mendeley users who have this article in their library.

Abstract

Previous work on Automatic Paraphrase Identification (PI) is mainly based on modeling text similarity between two sentences. In contrast, we study methods for automatically detecting whether a text fragment only appearing in a sentence of the evaluated sentence pair is important or ancillary information with respect to the paraphrase identification task. Engineering features for this new task is rather difficult, thus, we approach the problem by representing text with syntactic structures and applying tree kernels on them. The results show that the accuracy of our automatic Ancillary Text Classifier (ATC) is promising, i.e., 68.6%, and its output can be used to improve the state of the art in PI.

Cite

CITATION STYLE

APA

Filice, S., & Moschitti, A. (2016). Learning to recognize ancillary information for automatic paraphrase identification. In 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (pp. 1109–1114). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n16-1129

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free