Query expansion for mining translation knowledge from comparable data

Lu Xiang; Yu Zhou; Jie Hao; Dakun Zhang

Journal Article

Query expansion for mining translation knowledge from comparable data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8801 200-211

DOI: 10.1007/978-3-319-12277-9_18

0Citations

3Readers

Get full text

Abstract

When mining parallel text from comparable corpora, we confront vast search space since parallel sentence or sub-sentential fragments can be scattered throughout the source and target corpus. To reduce the search space, most previous approaches have tried to use heuristics to mine comparable documents. However, these heuristics are only available in few cases. Instead, we go on a different direction and adopt the cross-language information retrieval (CLIR) framework to find translation candidates directly at sentence level from comparable corpus. What’s more, for the sake of better retrieval result, two simple but effective query expansion methods are proposed. Experimental results show that using our query expansion methods can help to improve the recall significantly and obtain candidates of sentence pairs with high quality. Thus, our methods can help to make good preparation for extracting both parallel sentences and fragments subsequently.

Author supplied keywords

Cite

CITATION STYLE

APA

Xiang, L., Zhou, Y., Hao, J., & Zhang, D. (2014). Query expansion for mining translation knowledge from comparable data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8801, 200–211. https://doi.org/10.1007/978-3-319-12277-9_18

Query expansion for mining translation knowledge from comparable data

Abstract

Author supplied keywords

Cite

Register to see more suggestions