Using loglinear clustering for subcategorization identification

Nuno Miguel Marques; Gabriel Pereira Lopes; Carlos Agra Coelho

Conference ProceedingsOPEN ACCESS

Using loglinear clustering for subcategorization identification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1998) 1510 379-387

DOI: 10.1007/bfb0094841

3Citations

4Readers

Abstract

In this paper we will describe a process for mining syntactical verbal subcategorization, i.e. the information about the kind of phrases or clauses a verb goes with. We will use a large text corpus having almost 10,000,000 tagged words as our resource material. Loglinear modeling is used to analyze and automatically identify the subcategorization dependencies. An unsupervised clustering algorithm is used to accurately determine verbal subcategorization frames. In this paper we just tackle verbal subcategorization of noun phrases and prepositional phrases. A sample of 81 Portuguese verbs was used for evaluation purposes 97% precision and 99% recall for noun phrases and 92% precision and 100% recall for prepositional phrases was obtained.

Cite

CITATION STYLE

APA

Marques, N. M., Lopes, G. P., & Coelho, C. A. (1998). Using loglinear clustering for subcategorization identification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1510, pp. 379–387). Springer Verlag. https://doi.org/10.1007/bfb0094841

Using loglinear clustering for subcategorization identification

Abstract

Cite

Register to see more suggestions