A Method of Annotating Disease Names in TCM Patents Based on Co-training

Na Deng; Xu Chen; Caiquan Xiong

Book Chapter

A Method of Annotating Disease Names in TCM Patents Based on Co-training

Springer, (2020), 389-398

DOI: 10.1007/978-3-030-33509-0_35

1Citations

2Readers

Get full text

Abstract

In the era of big data, annotated text data is a scarce resource. The annotated important semantic information can be used as keywords in text analysis, mining and intelligent retrieval, as well as valuable training and testing sets for machine learning. In the analysis, mining and intelligent retrieval of Traditional Chinese Medicine (TCM) patents, similar to Chinese herbal medicine name and medicine efficacy, disease name is also an important annotation object. Utilizing the characteristics of TCM patent texts and based on co-training method in machine learning, this paper proposes a method of annotating disease names from TCM patent texts. Experiments show that this method is feasible and effective. This method can also be extended to annotate other semantic information in TCM patents.

Cite

CITATION STYLE

APA

Deng, N., Chen, X., & Xiong, C. (2020). A Method of Annotating Disease Names in TCM Patents Based on Co-training. In Lecture Notes in Networks and Systems (Vol. 96, pp. 389–398). Springer. https://doi.org/10.1007/978-3-030-33509-0_35

A Method of Annotating Disease Names in TCM Patents Based on Co-training

Abstract

Cite

Register to see more suggestions