Systematic investigation of keywords selection and processing strategy on search engine forecasting: a case of tourist volume in Beijing

8Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The timeliness, precision, and low cost of search data have great potential for projecting tourist volume. Obtaining valuable information for decision-making, particularly for predicting, is hampered by the vast amount of search data. A systematic investigation of keyword selection and processing has been conducted. Using Beijing tourist volume as an example, 11 different feature extraction algorithms were selected and combined with long short-term memory (LSTM), random forest (RF) and fuzzy time series (FTS) for forecasting tourist volume. A total of 1612 keywords were retrieved from Baidu Index demand mapping using the direct word extraction method, range word extraction method and empirical selection method. The remaining 813 keywords were subjected to feature extraction. Based on the forecasting results of medium and short-term (1-day, 7-days and 10-days), the forecasting results of Kernel principal component analysis (KPCA) and locally linear embedding (LLE) are relatively stable when the dimensionality is reduced to 5 dimensions. The forecasting results of t-stochastic neighbor embedding (t-SNE), isometric mapping (IsoMap) and locally linear embedding (LLE), locality preserving projections (LPP), independent component correlation (ICA) are relatively stable when the dimensionality is reduced to 10 dimensions. Accurately forecasting many factors (transportation, attraction, food, lodging, travel, tips, tickets, and weather) provides a solid foundation for tourism demand optimization and scientific management and a resource for tourists' holistic vacation planning.

Cite

CITATION STYLE

APA

Yuan, Z., & Jia, G. (2022). Systematic investigation of keywords selection and processing strategy on search engine forecasting: a case of tourist volume in Beijing. Information Technology and Tourism, 24(4), 547–580. https://doi.org/10.1007/s40558-022-00238-5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free