Text analysis and visualization research on the hetu dangse during the qing dynasty of china

1Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

In traditional historical research, interpreting historical documents subjectively and manually causes problems such as one-sided understanding, selective analysis, and one-way knowledge connection. In this study, we aim to use machine learning to automatically analyze and explore historical documents from a text analysis and visualization perspective. This technology solves the problem of large-scale historical data analysis that is difficult for humans to read and intuitively understand. In this study, we use the historical documents of the Qing Dynasty Hetu Dangse, preserved in the Archives of Liaoning Province, as data analysis samples. China's Hetu Dangse is the largest Qing Dynasty thematic archive with Manchu and Chinese characters in the world. Through word frequency analysis, correlation analysis, co-word clustering, word2vec model, and SVM (Support Vector Machines) algorithms, we visualize historical documents, reveal the relationships between functions of the government departments in the Shengjing area of the Qing Dynasty, achieve the automatic classification of historical archives, improve the efficient use of historical materials as well as build connections between historical knowledge. Through this, archivists can be guided practically in historical materials' management and compilation.

Cite

CITATION STYLE

APA

Wang, Z., Wu, J., Yu, G., & Song, Z. (2021, September 20). Text analysis and visualization research on the hetu dangse during the qing dynasty of china. Information Technology and Libraries. American Library Association. https://doi.org/10.6017/ital.v40i3.13279

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free