Abstract
In this paper we describe a novel approach to lexical chain based segmentation of broadcast news stories. Our segmentation system SeLeCT is evaluated with respect to two other lexical cohesion based segmenters TextTiling and C99. Using the Pk and WindowDiff evaluation metrics we show that SeLeCT outperforms both systems on spoken news transcripts (CNN) while the C99 algorithm performs best on the written newswire collection (Reuters). We also examine the differences between spoken and written news styles and how these differences can affect segmentation accuracy.
Cite
CITATION STYLE
Stokes, N. (2003). Spoken and written news story segmentation using lexical chains. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics - Student Research Workshop, HLT-NAACL 2003 (pp. 49–54). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1073416.1073425
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.