An algorithm for finding noun phrase correspondences in bilingual corpora

135Citations
Citations of this article
89Readers
Mendeley users who have this article in their library.

Abstract

The paper describes an algorithm that employs English and French text taggers to associate noun phrases in an aligned bilingual corpus. The tag-gets provide part-of-speech categories which are used by finite-state recognizers to extract simple noun phrases for both languages. Noun phrases are then mapped to each other using an iterative re-estimation algorithm that bears similarities to the Baum-Welch algorithm which is used for training the taggers. The algorithm provides an alternative to other approaches for finding word correspondences, with the advantage that linguistic structure is incorporated. Improvements to the basic algorithm are described, which enable context to be accounted for when constructing the noun phrase mappings.

References Powered by Scopus

Robust part-of-speech tagging using a hidden Markov model

243Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Statistical machine translation

553Citations
N/AReaders
Get full text

A statistical view on bilingual lexicon extraction: From parallel corpora to non-parallel corpora

128Citations
N/AReaders
Get full text

Machine transliteration survey

80Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kupiec, J. (1993). An algorithm for finding noun phrase correspondences in bilingual corpora. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1993-June, pp. 17–22). Association for Computational Linguistics (ACL). https://doi.org/10.3115/981574.981577

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 27

55%

Researcher 11

22%

Professor / Associate Prof. 6

12%

Lecturer / Post doc 5

10%

Readers' Discipline

Tooltip

Computer Science 36

73%

Linguistics 9

18%

Social Sciences 2

4%

Business, Management and Accounting 2

4%

Save time finding and organizing research with Mendeley

Sign up for free