Abstract
Text generation systems are ubiquitous in natural language processing applications. However, evaluation of these systems remains a challenge, especially in multilingual settings. In this paper, we propose L'AMBRE - a metric to evaluate the morphosyntactic well-formedness of text using its dependency parse and morphosyntactic rules of the language. We present a way to automatically extract various rules governing morphosyntax directly from dependency treebanks. To tackle the noisy outputs from text generation systems, we propose a simple methodology to train robust parsers. We show the effectiveness of our metric on the task of machine translation through a diachronic study of systems translating into morphologically-rich languages.
Cite
CITATION STYLE
Pratapa, A., Anastasopoulos, A., Rijhwani, S., Chaudhary, A., Mortensen, D. R., Neubig, G., & Tsvetkov, Y. (2021). Evaluating the Morphosyntactic Well-formedness of Generated Texts. In EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 7131–7150). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.emnlp-main.570
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.