Researches on asynchronous communication-oriented page searching aim at solving the new problems for search engine brought about by the adoption of asynchronous communication technology. At present, a full text search engine crawler mostly adopts the algorithm based on a hyperlink analysis. The crawler searches only the contents of the HTML page and ignores the codes in the script region. But it is through the script codes that asynchronous communication is realized. Since a great number of hyperlinks are hidden in the script region, it is necessary to improve the present search engine crawler to search the codes in the script region and extract the hyperlinks hidden in the script region. This paper proposes an approach, which, with the help of script code operation environment, takes advantage of the Windows message mechanism, and employs simulation clicking script function to extract hyperlinks. Meanwhile, in view of the problem that a feedback webpage is not integrated resulting from the asynchronous communication technology, this paper adopts a method that loads in the source page where hyperlinks locate and uses partial refreshing mechanism to save the refreshed page to solve the problem that information cannot be directly stored. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Fei, Y., Wang, M., & Chen, W. (2008). Research on asynchronous communication-oriented page searching. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4993 LNCS, pp. 412–417). https://doi.org/10.1007/978-3-540-68636-1_40
Mendeley helps you to discover research relevant for your work.