A simple pattern-matching algorithm for recovering empty nodes and their antecedents

76Citations
Citations of this article
99Readers
Mendeley users who have this article in their library.

Abstract

This paper describes a simple patternmatching algorithm for recovering empty nodes and identifying their co-indexed antecedents in phrase structure trees that do not contain this information. The patterns are minimal connected tree fragments containing an empty node and all other nodes co-indexed with it. This paper also proposes an evaluation procedure for empty node recovery procedures which is independent of most of the details of phrase structure, which makes it possible to compare the performance of empty node recovery on parser output with the empty node annotations in a goldstandard corpus. Evaluating the algorithm on the output of Charniak's parser (Charniak, 2000) and the Penn treebank (Marcus et al., 1993) shows that the patternmatching algorithm does surprisingly well on the most frequently occuring types of empty nodes given its simplicity.

Cite

CITATION STYLE

APA

Johnson, M. (2002). A simple pattern-matching algorithm for recovering empty nodes and their antecedents. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2002-July, pp. 136–143). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1073083.1073107

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free