Analysis of selective strategies to build a dependency-analyzed corpus

Kiyonori Ohtake

Conference Proceedings

Analysis of selective strategies to build a dependency-analyzed corpus

Ohtake K

COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Main Conference Poster Sessions (2006) 635-642

DOI: 10.3115/1273073.1273155

3Citations

83Readers

Get full text

Abstract

This paper discusses sampling strategies for building a dependency-analyzed corpus and analyzes them with different kinds of corpora. We used the Kyoto Text Corpus, a dependency-analyzed corpus of newspaper articles, and prepared the IPAL corpus, a dependency-analyzed corpus of example sentences in dictionaries, as a new and different kind of corpus. The experimental results revealed that the length of the test set controlled the accuracy and that the longest-first strategy was good for an expanding corpus, but this was not the case when constructing a corpus from scratch.

Cite

CITATION STYLE

APA

Ohtake, K. (2006). Analysis of selective strategies to build a dependency-analyzed corpus. In COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Main Conference Poster Sessions (pp. 635–642). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1273073.1273155

Analysis of selective strategies to build a dependency-analyzed corpus

Abstract

Cite

Register to see more suggestions