With the increase in the number of domestic tourists and the popularity of digital upgrades in attractions, it is crucial to develop a question-answering(QA) system about the details of the attractions. However, there is little work on attractions QA, and the main bottleneck is the lack of available datasets. While previous QA datasets usually focus on news domain like CNN/DAILYMAIL and NewsQA, we present the first large-scale dataset for QA over attraction details. To ensure that the data we collected are useful, we only gather the data from public travel information website. Unlike other QA datasets like SQuAD, which is labeled manually, we formed the dataset by manual and question-answer pair generation(QAG) annotated model. Finally, we obtained a dataset covering 2,808 attractions with a total of 18,245 QA pairs, including seven types of attraction details.
CITATION STYLE
Huang, W., Xu, S., Yuhan, W., Fan, J., & Chang, Q. (2022). AttractionDetailsQA: An Attraction Details Focused on Chinese Question Answering Dataset. IEEE Access, 10, 86215–86221. https://doi.org/10.1109/ACCESS.2022.3181188
Mendeley helps you to discover research relevant for your work.