Sequence-to-sequence models for data-to-text natural language generation: Word- vs. Character-based processing and output diversity

Glorianna Jagfeld; Sabrina Jenne; Ngoc Thang Vu

Conference ProceedingsOPEN ACCESS

Sequence-to-sequence models for data-to-text natural language generation: Word- vs. Character-based processing and output diversity

INLG 2018 - 11th International Natural Language Generation Conference, Proceedings of the Conference (2018) 221-232

DOI: 10.18653/v1/w18-6529

13Citations

114Readers

Abstract

We present a comparison of word-based and character-based sequence-to-sequence models for data-to-text natural language generation, which generate natural language descriptions for structured inputs. On the datasets of two recent generation challenges, our models achieve comparable or better automatic evaluation results than the best challenge submissions. Subsequent detailed statistical and human analyses shed light on the differences between the two input representations and the diversity of the generated texts. In a controlled experiment with synthetic training data generated from templates, we demonstrate the ability of neural models to learn novel combinations of the templates and thereby generalize beyond the linguistic structures they were trained on.

Cite

CITATION STYLE

APA

Jagfeld, G., Jenne, S., & Vu, N. T. (2018). Sequence-to-sequence models for data-to-text natural language generation: Word- vs. Character-based processing and output diversity. In INLG 2018 - 11th International Natural Language Generation Conference, Proceedings of the Conference (pp. 221–232). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-6529

Sequence-to-sequence models for data-to-text natural language generation: Word- vs. Character-based processing and output diversity

Abstract

Cite

Register to see more suggestions