LIPN-IIMAS at SemEval-2016 Task 1: Random forest regression experiments on align-and-differentiate and word embeddings penalizing strategies

0Citations
Citations of this article
73Readers
Mendeley users who have this article in their library.

Abstract

This paper describes the SOPA-N system used by the LIPN-IIMAS team in Semeval 2016 Semantic Textual Similarity (Task 1). We based our work on the SOPA 2015 system. The SOPA-2015 system used 16 similarity features (including Wordnet, Information Retrieval and Syntactic Dependencies) within a Random Forest learning model. We expanded this system with an Align and Differentiate based strategy, word embeddings and penalization, which showed 6.8% of improvement on the development set. However, we found that on the evaluation data for the 2016 STS shared task, the 2015 system outperformed our newer systems.

Cite

CITATION STYLE

APA

Lithgow, O., Meza, I. V., Orozco, A., Flores, J. G., & Buscaldi, D. (2016). LIPN-IIMAS at SemEval-2016 Task 1: Random forest regression experiments on align-and-differentiate and word embeddings penalizing strategies. In SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings (pp. 726–731). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s16-1112

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free