It has become a challenging work to collect valuable information from fast text streams. In this work, we propose a method which gains useful information effectively and efficiently. Firstly, we maintain an analyzer based on the Trie structure and the dynamic N-Gram tokenizer; secondly, unlike the traditional search engine principle, we consider the documents as a query by building the indexes for the whole query base. The experimental results show that it has the strong adaption ability, low latency and high quality support for the complex query combination compared with the conventional methods.
CITATION STYLE
Qi, B., Ma, G., Shi, Z., & Wang, W. (2014). Collecting valuable information from fast text streams. In IFIP Advances in Information and Communication Technology (Vol. 432, pp. 96–105). Springer New York LLC. https://doi.org/10.1007/978-3-662-44980-6_11
Mendeley helps you to discover research relevant for your work.