Automating XML markup of text documents

2Citations
Citations of this article
74Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present a novel system for automatically marking up text documents into XML and discuss the benefits of XML markup for intelligent information retrieval. The system uses the Self-Organizing Map (SOM) algorithm to arrange XML marked-up documents on a two-dimensional map so that similar documents appear closer to each other. It then employs an inductive learning algorithm C5 to automatically extract and apply markup rules from the nearest SOM neighbours of an unmarked document. The system is designed to be adaptive, so that once a document is marked-up; its behaviour is modified to improve accuracy. The automatically marked-up documents are again categorized on the Self-Organizing Map.

Cite

CITATION STYLE

APA

Akhtar, S., Reilly, R. G., & Dunnion, J. (2003). Automating XML markup of text documents. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics - Short Papers, HLT-NAACL 2003 (pp. 1–3). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1073483.1073484

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free