Collostructional analysis is a technique devised to find correlations between particular words and linguistic constructions in order to analyse meaning associations of these constructions. Contrasting collostructional analysis results with output from BERT might provide insights into the way BERT represents the meaning of linguistic constructions. This study tests to what extent English BERT's meaning representations correspond to known constructions from the linguistics literature by means of two tasks that we propose. Firstly, by predicting the words that can be used in open slots of constructions, the meaning associations of more lexicalized constructions can be observed. Secondly, by finding similar sequences using BERT's output embeddings and manually reviewing the resulting sentences, we can observe whether instances of less lexicalized constructions are clustered together in semantic space. These two methods show that BERT represents constructional meaning to a certain extent, but does not separate instances of a construction from a near-synonymous construction that has a different form.
CITATION STYLE
Veenboer, T., & Bloem, J. (2023). Using Collostructional Analysis to evaluate BERT’s representation of linguistic constructions. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 12937–12951). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.819
Mendeley helps you to discover research relevant for your work.