Three approaches to word sense disambiguation for Czech

2Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Before building a full wsd system it is necessary to have a balanced and representative corpus annotated with sense tags. This requirement is not certainly fulfilled for the Czech language. Thus, we decided to developsome particular methods for annotating texts and we have started with the most common nouns. In our approach, the disambiguation algorithm based on sets of words (called bags) was used. The advantage of this approach is the possibility of filling bags in various ways. Our ultimate goal is to reduce manual work as much as possible. Here we present three basic ways of filling bags. The first one is based on the machine readable version of SSJČ, the second takes the advantage of learning from manually annotated text and the strategy of pseudoclustering is the third one.

Cite

CITATION STYLE

APA

Král, R. (2001). Three approaches to word sense disambiguation for Czech. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2166, pp. 174–179). Springer Verlag. https://doi.org/10.1007/3-540-44805-5_22

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free