A novel approach using Hidden Markov Model (HMM) for the task of finding prices of products on internet sites is proposed in this paper. The proposed Information Extraction System based on HMM (IESHMM) utilizes HMM for its capability to process temporal information. The proposed IESHMM first processes web pages that are returned from search engines and then extracts specific fields such as prices, descriptions, locations, images of products, and other information of interest. The proposed IESHMM is evaluated with real-world problems and compared with a conventional method. The results show that the proposed IESHMM outperforms the other method by 22.9 % and 37.2% in terms of average recall and average precision, respectively. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
Park, D. C., Huong, V. T. L., Woo, D. M., Hieu, D. N., & Ninh, S. T. H. (2009). Information extraction system based on hidden markov model. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5551 LNCS, pp. 52–59). https://doi.org/10.1007/978-3-642-01507-6_7
Mendeley helps you to discover research relevant for your work.