Statistical Filtering and Subcategorization Frame Acquisition

Anna Korhonen; Genevieve Gorrell; Diana McCarthy

Conference Proceedings

Statistical Filtering and Subcategorization Frame Acquisition

Proceedings of the 2000 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, SIGDAT-EMNLP 2000 - Held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics, ACL 2000 (2000) 199-206

DOI: 10.3115/1117794.1117819

25Citations

79Readers

Get full text

Abstract

Research into the automatic acquisition of subcategorization frames (SCFS) from corpora is starting to produce large-scale computational lexicons which include valuable frequency information. However, the accuracy of the resulting lexicons shows room for improvement. One significant source of error lies in the statistical filtering used by some researchers to remove noise from automatically acquired subcategorization frames. In this paper, we compare three different approaches to filtering out spurious hypotheses. Two hypothesis tests perform poorly, compared to filtering frames on the basis of relative frequency. We discuss reasons for this and consider directions for future research.

Cite

CITATION STYLE

APA

Korhonen, A., Gorrell, G., & McCarthy, D. (2000). Statistical Filtering and Subcategorization Frame Acquisition. In Proceedings of the 2000 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, SIGDAT-EMNLP 2000 - Held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics, ACL 2000 (pp. 199–206). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1117794.1117819

Statistical Filtering and Subcategorization Frame Acquisition

Abstract

Cite

Register to see more suggestions