Synonyms extraction using Web content focused crawling

Chien Hsing Chen; Chung Chian Hsu

Conference Proceedings

Synonyms extraction using Web content focused crawling

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 4993 LNCS 286-297

DOI: 10.1007/978-3-540-68636-1_28

0Citations

4Readers

Get full text

Abstract

Documents or Web pages collected from the World Wide Web have been considered one of the most important sources for information. Using search engines to retrieve the documents can harvest lots of information, facilitating information exchange and knowledge sharing, including foreign information. However, to better understand by local readers, foreign words, like English, are often translated to local language such as Chinese. Due to different translators and the lack of translation standard, translating foreign words may pose a notorious headache and result in different transliterations, particularly in proper nouns like person names and geographical names. For example, Bin Laden is translated into terms (binladeng) or (benladeng). Both are valid synonymous transliterations. In this research, we propose an approach to determining synonymous transliterations via mining Web pages retrieved by a search engine. Experiments show that the proposed approach can effectively extract synonymous transliterations given an input transliteration. © 2008 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Chen, C. H., & Hsu, C. C. (2008). Synonyms extraction using Web content focused crawling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4993 LNCS, pp. 286–297). https://doi.org/10.1007/978-3-540-68636-1_28

Synonyms extraction using Web content focused crawling

Abstract

Author supplied keywords

Cite

Register to see more suggestions