Universal segmentation of Text with SUMO formalism

  • Quint J
  • Christodoulakis D
N/ACitations
Citations of this article
2Readers
Mendeley users who have this article in their library.

Abstract

We propose a universal formalism for the segmentation of text documents called Sumo. Its main purpose is to help creating segmentation systems for documents in any language. Because the processing is independent of the language, any level of segmentation (be it character, word, sentence, paragraph, etc.) can be considered. We will argue about the usefulness of such a formalism, describe the framework for segmentation on which Sumo relies, and give detailed examples to demonstrate some of its features.

Cite

CITATION STYLE

APA

Quint, J., & Christodoulakis, D. (2000). Universal segmentation of Text with SUMO formalism. In D. N. Christodoulakis (Ed.), Natural Language Processing — NLP 2000 (Vol. 1835, pp. 16–26). Springer Berlin Heidelberg. Retrieved from http://www.springerlink.com/content/wp8mt5fm47d50vrm/

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free