A joint model for document segmentation and segment labeling

52Citations
Citations of this article
174Readers
Mendeley users who have this article in their library.

Abstract

Text segmentation aims to uncover latent structure by dividing text from a document into coherent sections. Where previous work on text segmentation considers the tasks of document segmentation and segment labeling separately, we show that the tasks contain complementary information and are best addressed jointly. We introduce the Segment Pooling LSTM (S-LSTM) model, which is capable of jointly segmenting a document and labeling segments. In support of joint training, we develop a method for teaching the model to recover from errors by aligning the predicted and ground truth segments. We show that S-LSTM reduces segmentation error by 30% on average, while also improving segment labeling.

Cite

CITATION STYLE

APA

Barrow, J., Jain, R., Morariu, V. I., Manjunatha, V., Oard, D. W., & Resnik, P. (2020). A joint model for document segmentation and segment labeling. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 313–322). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.acl-main.29

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free