A dynamic programming model for text segmentation based on min-max similarity

Na Ye; Jingbo Zhu; Yan Zheng; Matthew Y. Ma; Huizhen Wang; Bin Zhang

Conference Proceedings

A dynamic programming model for text segmentation based on min-max similarity

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 4993 LNCS 141-152

DOI: 10.1007/978-3-540-68636-1_14

6Citations

8Readers

Get full text

Abstract

Text segmentation has a wide range of applications such as information retrieval, question answering and text summarization. In recent years, the use of semantics has been proven to be effective in improving the performance of text segmentation. Particularly, in finding the subtopic boundaries, there have been efforts in focusing on either maximizing the lexical similarity within a segment or minimizing the similarity between adjacent segments. However, no optimal solutions have been attempted to simultaneously achieve maximum within-segment similarity and minimum between-segment similarity. In this paper, a domain independent model based on min-max similarity (MMS) is proposed in order to fill the void. Dynamic programming is adopted to achieve global optimization of the segmentation criterion function. Comparative experimental results on real corpus have shown that MMS model outperforms previous segmentation approaches. © 2008 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Ye, N., Zhu, J., Zheng, Y., Ma, M. Y., Wang, H., & Zhang, B. (2008). A dynamic programming model for text segmentation based on min-max similarity. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4993 LNCS, pp. 141–152). https://doi.org/10.1007/978-3-540-68636-1_14

A dynamic programming model for text segmentation based on min-max similarity

Abstract

Author supplied keywords

Cite

Register to see more suggestions