Deep learning models for automatic readability assessment generally discard linguistic features traditionally used in machine learning models for the task. We propose to incorporate linguistic features into neural network models by learning syntactic dense embeddings based on linguistic features. To cope with the relationships between the features, we form a correlation graph among features and use it to learn their embeddings so that similar features will be represented by similar embeddings. Experiments with six data sets of two proficiency levels demonstrate that our proposed methodology can complement BERT-only model to achieve significantly better performances for automatic readability assessment.
CITATION STYLE
Qiu, X., Chen, Y., Chen, H., Nie, J. Y., Shen, Y., & Lu, D. (2021). Learning syntactic dense embedding with correlation graph for automatic readability assessment. In ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference (pp. 3013–3025). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.acl-long.235
Mendeley helps you to discover research relevant for your work.