Sampling Tree Fragments from Forests

5Citations
Citations of this article
88Readers
Mendeley users who have this article in their library.

Abstract

We study the problem of sampling trees from forests, in the setting where probabilities for each tree may be a function of arbitrarily large tree fragments. This setting extends recent work for sampling to learn Tree Substitution Grammars to the case where the tree structure (TSG derived tree) is not fixed. We develop a Markov chain Monte Carlo algorithm which corrects for the bias introduced by unbalanced forests, and we present experiments using the algorithm to learn Synchronous Context-Free Grammar rules for machine translation. In this application, the forests being sampled represent the set of Hiero-style rules that are consistent with fixed input word-level alignments.We demonstrate equivalent machine translation performance to standard techniques but with much smaller grammars. © 2014 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Chung, T., Fang, L., Gildea, D., & Štefankovič, D. (2014). Sampling Tree Fragments from Forests. Computational Linguistics, 40(1), 204–229. https://doi.org/10.1162/COLI_a_00170

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free