LIPN-IIMAS at SemEval-2016 Task 1: Random forest regression experiments on align-and-differentiate and word embeddings penalizing strategies

Oscar Lithgow; Ivan V. Meza; Albert Orozco; Jorge Gacia Flores; Davide Buscaldi

Conference ProceedingsOPEN ACCESS

LIPN-IIMAS at SemEval-2016 Task 1: Random forest regression experiments on align-and-differentiate and word embeddings penalizing strategies

SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings (2016) 726-731

DOI: 10.18653/v1/s16-1112

0Citations

73Readers

Abstract

This paper describes the SOPA-N system used by the LIPN-IIMAS team in Semeval 2016 Semantic Textual Similarity (Task 1). We based our work on the SOPA 2015 system. The SOPA-2015 system used 16 similarity features (including Wordnet, Information Retrieval and Syntactic Dependencies) within a Random Forest learning model. We expanded this system with an Align and Differentiate based strategy, word embeddings and penalization, which showed 6.8% of improvement on the development set. However, we found that on the evaluation data for the 2016 STS shared task, the 2015 system outperformed our newer systems.

Cite

CITATION STYLE

APA

Lithgow, O., Meza, I. V., Orozco, A., Flores, J. G., & Buscaldi, D. (2016). LIPN-IIMAS at SemEval-2016 Task 1: Random forest regression experiments on align-and-differentiate and word embeddings penalizing strategies. In SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings (pp. 726–731). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s16-1112

LIPN-IIMAS at SemEval-2016 Task 1: Random forest regression experiments on align-and-differentiate and word embeddings penalizing strategies

Abstract

Cite

Register to see more suggestions