Strategy-based technology for estimating mt quality

5Citations
Citations of this article
77Readers
Mendeley users who have this article in their library.

Abstract

This paper introduces our SAU-KERC system that achieved F1 score of 0.39 in the world-level quality estimation task in WMT2015. The goal is to assign each translated word a "OK" or "BAD" label indicating translation quality. We adopt the sequence labeling model, conditional random fields (CRF), to predict the labels. Since "BAD" labels are rare in the training and development sets, recognition rate of "BAD" is low. To solve this problem, we propose two strategies. One is to replace "OK" label with sub-labels to balance label distribution. The other is to reconstruct the training set to include more "BAD" words.

Cite

CITATION STYLE

APA

Shang, L., Cai, D., & Ji, D. (2015). Strategy-based technology for estimating mt quality. In 10th Workshop on Statistical Machine Translation, WMT 2015 at the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Proceedings (pp. 348–352). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w15-3042

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free