This work presents the application of a first-order logic incremental learning system, INTHELEX, to learn rules for the automatic identification of a wide range of significant document classes and their related components. Specifically, the material includes multi-format cultural heritage documents concerning European films from the 20's and 30's provided by the EU project COLLATE. Incrementality plays a key role when the set of documents is continuously augmented. To ensure that there is no performance loss with respect to classical one-step systems, a comparison with Progol was carried out. Experimental results prove that the proposed approach is a viable solution, for both its performance and its effectiveness in the document processing domain.
CITATION STYLE
Basile, T. M. A., Ferilli, S., Mauro, N. D., & Esposito, F. (2004). Incremental induction of classification rules for cultural heritage documents. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3029, pp. 915–923). Springer Verlag. https://doi.org/10.1007/978-3-540-24677-0_94
Mendeley helps you to discover research relevant for your work.