DMLM: Descriptive Masked Language Modeling

Edoardo Barba; Niccolò Campolungo; Roberto Navigli

Conference Proceedings

DMLM: Descriptive Masked Language Modeling

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 12770-12788

DOI: 10.18653/v1/2023.findings-acl.808

2Citations

17Readers

Get full text

Abstract

Over the last few years, Masked Language Modeling (MLM) pre-training has resulted in remarkable advancements in many Natural Language Understanding (NLU) tasks, which has sparked an interest in researching alternatives and extensions to the MLM objective. In this paper, we tackle the absence of explicit semantic grounding in MLM and propose Descriptive Masked Language Modeling (DMLM), a knowledge-enhanced reading comprehension objective, where the model is required to predict the most likely word in a context, being provided with the word's definition. For instance, given the sentence “I was going to the _”, if we provided as definition “financial institution”, the model would have to predict the word “bank”; if, instead, we provided “sandy seashore”, the model should predict “beach”. Our evaluation highlights the effectiveness of DMLM in comparison with standard MLM, showing improvements on a number of well-established NLU benchmarks, as well as other semantics-focused tasks, e.g., Semantic Role Labeling. Furthermore, we also demonstrate how it is possible to take full advantage of DMLM to embed explicit semantics in downstream tasks, explore several properties of DMLM-based contextual representations and suggest a number of future directions to investigate.

Cite

CITATION STYLE

APA

Barba, E., Campolungo, N., & Navigli, R. (2023). DMLM: Descriptive Masked Language Modeling. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 12770–12788). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.808

DMLM: Descriptive Masked Language Modeling

Abstract

Cite

Register to see more suggestions