Learning to recognize ancillary information for automatic paraphrase identification

Simone Filice; Alessandro Moschitti

Conference ProceedingsOPEN ACCESS

Learning to recognize ancillary information for automatic paraphrase identification

2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (2016) 1109-1114

DOI: 10.18653/v1/n16-1129

4Citations

78Readers

Abstract

Previous work on Automatic Paraphrase Identification (PI) is mainly based on modeling text similarity between two sentences. In contrast, we study methods for automatically detecting whether a text fragment only appearing in a sentence of the evaluated sentence pair is important or ancillary information with respect to the paraphrase identification task. Engineering features for this new task is rather difficult, thus, we approach the problem by representing text with syntactic structures and applying tree kernels on them. The results show that the accuracy of our automatic Ancillary Text Classifier (ATC) is promising, i.e., 68.6%, and its output can be used to improve the state of the art in PI.

Cite

CITATION STYLE

APA

Filice, S., & Moschitti, A. (2016). Learning to recognize ancillary information for automatic paraphrase identification. In 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (pp. 1109–1114). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n16-1129

Learning to recognize ancillary information for automatic paraphrase identification

Abstract

Cite

Register to see more suggestions