The automatic detection of appropriate subtopic boundaries in a document is a difficult and very useful task in text processing. Some methods have tried to solve this problem, several of them have had favorable results, but they have presented some drawbacks as well. Besides, several of these solutions are application domain dependant. In this work we propose a new algorithm which uses a window below the paragraphs to measure the lexical cohesion to detect subtopics in scientific papers. We compare our method against two algorithms that use the lexical cohesion too. In this comparison we notice that our method has a good performance and outperforms the other two algorithms. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Rojas, L. H., & Medina Pagola, J. E. (2007). TextLec: A novel method of segmentation by topic using lower windows and lexical cohesion. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4756 LNCS, pp. 724–733). https://doi.org/10.1007/978-3-540-76725-1_75
Mendeley helps you to discover research relevant for your work.