Scalable multilingual information access

Paul McNamee; James Mayfield

Journal Article

Scalable multilingual information access

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2003) 2785 207-218

DOI: 10.1007/978-3-540-45237-9_17

5Citations

3Readers

Get full text

Abstract

The third Cross-Language Evaluation Forum workshop (CLEF-2002) provides the unprecedented opportunity to evaluate retrieval in eight different languages using a common set of topics and a uniform assessment methodology. This year the Johns Hopkins University Applied Physics Laboratory participated in the monolingual, bilingual, and multilingual retrieval tasks. We contend that information access in a plethora of languages requires approaches that are inexpensive in developer and run-time costs. In this paper we describe a simplified approach that seems suitable for retrieval in many languages; we also show how good retrieval is possible over many languages, even when translation resources are scarce, or when query-time translation is infeasible. In particular, we investigate the use of character n-grams for monolingual retrieval, CLIR between related languages using partial morphological matches, and translation of document representations to an interlingua for computationally efficient retrieval against multiple languages. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

McNamee, P., & Mayfield, J. (2003). Scalable multilingual information access. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2785, 207–218. https://doi.org/10.1007/978-3-540-45237-9_17

Scalable multilingual information access

Abstract

Cite

Register to see more suggestions