In previous studies, Chinese text retrieval has often been dealt with on the character basis. This approach is not suited to deal with complex queries. We suggest that Chinese text retrieval should work with words instead of characters. The crucial problem is to segment originally continuous Chinese texts into words. In this paper, we first propose a hybrid segmentation approach which unifies the commonly used approaches. The system SMART is then adapted to index the segmented Chinese texts. Finally, we suggest that Chinese text retrieval should move further to include a thesaurus in order to cope with the rich vocabulary of Chinese.
CITATION STYLE
Nie, J. Y., Brisebois, M., & Ren, X. (1996). On Chinese text retrieval. In SIGIR Forum (ACM Special Interest Group on Information Retrieval) (pp. 225–234). https://doi.org/10.1145/243199.243270
Mendeley helps you to discover research relevant for your work.