Information retrieval from text or 'text mining' is the process of extracting interesting and non-trivial knowledge from unstructured text. With the ever increasing amounts of information stored on the web or archived within a computing system, high performance data processing architectures are required to process this data in real time. The aim of the work presented in this paper is the development of a hardware text mining IP-Core for use in FPGA based systems. In this paper we will describe the pre-processing engine we have developed for the PRESENCE II PCI card, to accelerate the identification of significant words within a document, logging their frequency and position. The performance of this system is then compared to an equivalent software implementation using the Lucene software package. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Freeman, M., & Jayasooriya, T. (2006). Hardware support for language aware information mining. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4253 LNAI-III, pp. 415–423). Springer Verlag. https://doi.org/10.1007/11893011_53
Mendeley helps you to discover research relevant for your work.