Unsupervised language acquisition: syntax from plain corpus

  • Horn D
  • Solan Z
  • Ruppin E
  • et al.
N/ACitations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

We describe results of a novel algorithm for grammar induction from a large corpus. The ADIOS (Automatic DIstillation of Structure) algorithm searches for significant patterns, chosen according to context dependent statistical criteria, and builds a hierarchy of such patterns according to a set of rules leading to structured generalization. The corpus is thus generalized into a context free grammar (CFG), composed of patterns, equivalence classes and words of the initial lexicon. We have evaluated our method both on corpora generated by CFG and on natural language ones. The performance of ADIOS is judged by searching for both good recall (acceptance of correct novel sentences) and good precision (production of correct novel sentences). The results are very encouraging.

Cite

CITATION STYLE

APA

Horn, D., Solan, Z., Ruppin, E., & Edelman, S. (2004). Unsupervised language acquisition: syntax from plain corpus. … on Human Language. Retrieved from http://horn.tau.ac.il/~horn/publications/newcastle.pdf

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free