TauJud: Test augmentation of machine learning in judicial documents

5Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The booming of big data makes the adoption of machine learning ubiquitous in the legal field. As we all know, a large amount of test data can better reflect the performance of the model, so the test data must be naturally expanded. In order to solve the high cost problem of labeling data in natural language processing, people in the industry have improved the performance of text classification tasks through simple data amplification techniques. However, the data amplification requirements in the judgment documents are interpretable and logical, as observed from CAIL2018 test data with over 200,000 judicial documents. Therefore, we have designed a test augmentation tool called TauJud specifically for generating more effective test data with uniform distribution over time and location for model evaluation and save time in marking data. The demo can be found at https://github.com/governormars/TauJud.

Cite

CITATION STYLE

APA

Guo, Z., Liu, J., He, T., Li, Z., & Zhangzhu, P. (2020). TauJud: Test augmentation of machine learning in judicial documents. In ISSTA 2020 - Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis (pp. 549–552). Association for Computing Machinery, Inc. https://doi.org/10.1145/3395363.3404364

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free