The article describes the way of automatic segmentation of natural language text into fragments with different functional semantics. The proposed solution is based on the analysis of how the various parts of speech are distributed through the text. The amount and variety of nouns, verbs and adjectives is calculated for a set of sliding windows with the same length. The text is divided into fragments using clustering of windows set. We considered two clustering methods: ISODATA and a method based on the minimum spanning tree. The results of comparison of the methods with each other and with the manually text markup are shown.
CITATION STYLE
Krayvanova, V., & Duka, S. (2016). Unsupervised trained functional discourse parser for e-learning materials scaffolding. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9811 LNCS, pp. 722–728). Springer Verlag. https://doi.org/10.1007/978-3-319-43958-7_88
Mendeley helps you to discover research relevant for your work.