Analysis of selective strategies to build a dependency-analyzed corpus

3Citations
Citations of this article
79Readers
Mendeley users who have this article in their library.

Abstract

This paper discusses sampling strategies for building a dependency-analyzed corpus and analyzes them with different kinds of corpora. We used the Kyoto Text Corpus, a dependency-analyzed corpus of newspaper articles, and prepared the IPAL corpus, a dependency-analyzed corpus of example sentences in dictionaries, as a new and different kind of corpus. The experimental results revealed that the length of the test set controlled the accuracy and that the longest-first strategy was good for an expanding corpus, but this was not the case when constructing a corpus from scratch.

Cite

CITATION STYLE

APA

Ohtake, K. (2006). Analysis of selective strategies to build a dependency-analyzed corpus. In COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Main Conference Poster Sessions (pp. 635–642). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1273073.1273155

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free