Matching Pattern Acquisition Approach for Ancient Chinese Treebank Construction

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Matching Pattern (MP) is a sequence of words or part-of-speech (POS), sampled from clauses, and MP acquisition is an effective approach for ancient Chinese treebank construction. This approach uses the typical characteristics of ancient Chinese short-clauses and strong-patterns, and lays down the syntactic annotation process of the treebank construction in three stages. These stages involve: (1) obtaining weighted MPs with a syntactic skeleton; (2) applying these MPs to match the clauses; and (3) generating syntactic structures of these clauses according to the syntactic skeleton of the MP. The syntactic skeletons are constructed based on the Sentence-based Grammar in our experiments. The MP-based parsing procedures are implemented on both clause and fragment units. Experiments on corpora extracted from Yili and Zuozhuan show that an integrated algorithm, involving both clause and fragment units, can achieve a performance of 99.07%/82.76% and 97.25%/77.77% for coverage/precision, respectively.

Cite

CITATION STYLE

APA

He, J., Song, T., Peng, W., & Song, J. (2018). Matching Pattern Acquisition Approach for Ancient Chinese Treebank Construction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10709 LNAI, pp. 489–498). Springer Verlag. https://doi.org/10.1007/978-3-319-73573-3_44

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free