Deriving machine attention from human rationales

Yujia Bao; Shiyu Chang; Mo Yu; Regina Barzilay

Conference ProceedingsOPEN ACCESS

Deriving machine attention from human rationales

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018 (2018) 1903-1913

DOI: 10.18653/v1/d18-1216

60Citations

160Readers

Abstract

Attention-based models are successful when trained on large amounts of data. In this paper, we demonstrate that even in the low-resource scenario, attention can be learned effectively. To this end, we start with discrete human-annotated rationales and map them into continuous attention. Our central hypothesis is that this mapping is general across domains, and thus can be transferred from resource-rich domains to low-resource ones. Our model jointly learns a domain-invariant representation and induces the desired mapping between rationales and attention. Our empirical results validate this hypothesis and show that our approach delivers significant gains over state-of-the-art baselines, yielding over 15% average error reduction on benchmark datasets.

Cite

CITATION STYLE

APA

Bao, Y., Chang, S., Yu, M., & Barzilay, R. (2018). Deriving machine attention from human rationales. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018 (pp. 1903–1913). Association for Computational Linguistics. https://doi.org/10.18653/v1/d18-1216

Deriving machine attention from human rationales

Abstract

Cite

Register to see more suggestions