Linguistic Theory in Statistical Language Learning

Christer Samuelsson

Conference ProceedingsOPEN ACCESS

Linguistic Theory in Statistical Language Learning

Samuelsson C

Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, NeMLaP/CoNLL 1998 (1998) 83-89

DOI: 10.3115/1603899.1603915

0Citations

68Readers

Abstract

This article attempts to determine what elements of linguistic theory are used in statistical language learning, and why the extracted language models look like they do. The study indicates that some linguistic elements, such as the notion of a word, are simply too useful to be ignored. The second most important factor seems to be features inherited from the original task for which the technique was used, for example using hidden Markov models for part-of-speech tagging, rather than speech recognition. The two remaining important factors are properties of the runtime processing scheme employing the extracted language model, and the properties of the available corpus resources to which the statistical learning techniques are applied. Deliberate attempts to include linguistic theory seem to end up in a fifth place.

Cite

CITATION STYLE

APA

Samuelsson, C. (1998). Linguistic Theory in Statistical Language Learning. In Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, NeMLaP/CoNLL 1998 (pp. 83–89). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1603899.1603915

Linguistic Theory in Statistical Language Learning

Abstract

Cite

Register to see more suggestions