Selective text utilization and text traversal

34Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

Many large collections of full-text documents are currently stored in machine-readable form and are processed automatically in various ways. These collections may include different types of documents, such as messages, research articles, and books, and the subject matter may vary widely. To process such collections, robust text analysis methods must be used, capable of handling materials in arbitrary subject areas, and flexible access must be provided to texts and text excerpts of varying size. In this study, global text comparison methods are used to identify similarities between text elements, followed by local context-checking operations that resolve ambiguities and distinguish superficially similar texts from texts that actually cover identical topics. A linked text structure is then created that relates similar texts at various levels of detail. In particular, text links are available for full texts, as well as text sections, paragraphs, and sentence groups. The linked structures are usable to identify important text passages, to traverse texts selectively both within particular documents and between documents, and to provide flexible text access to large text collections in response to various kinds of user needs. An automated 29-volume encyclopedia is used as an example to illustrate the text accessing and traversal operations.

Cited by Powered by Scopus

Automatic text structuring and summarization

328Citations
N/AReaders
Get full text

Machine learning for text

207Citations
N/AReaders
Get full text

Automatic structuring and retrieval of large text files

145Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Salton, G., & Allan, J. (1993). Selective text utilization and text traversal. In Proceedings of the ACM Conference on Hypertext (pp. 131–144). Publ by ACM. https://doi.org/10.1145/168750.168809

Readers over time

‘10‘14‘16‘1700.751.52.253

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 3

50%

Researcher 2

33%

Professor / Associate Prof. 1

17%

Readers' Discipline

Tooltip

Computer Science 5

83%

Arts and Humanities 1

17%

Save time finding and organizing research with Mendeley

Sign up for free
0