The small world topology is known widespread in biological, social and man-made systems. This paper shows that the small world structure also exists in documents, such as papers. A document is represented by a network; the nodes represent terms, and the edges represent the co-occurrence of terms. This network is shown to have the characteristics of being a small world, i.e., nodes are highly clustered yet the path length between them is small. Based on the topology, we develop an indexing system called KeyWorld, which extracts important terms by measuring their contribution to the graph being small world.
CITATION STYLE
Matsuo, Y., Ohsawa, Y., & Ishizuka, M. (2001). KeyWorld: Extracting keywords from a document as a small world. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2226, pp. 271–281). Springer Verlag. https://doi.org/10.1007/3-540-45650-3_24
Mendeley helps you to discover research relevant for your work.