Convolutional-neural-network-based multilabel text classification for automatic discrimination of legal documents

Ming Qiu; Yiru Zhang; Tianqi Ma; Qingfeng Wu; Fanzhu Jin

Journal ArticleOPEN ACCESS

Convolutional-neural-network-based multilabel text classification for automatic discrimination of legal documents

Sensors and Materials (2020) 32(8p2) 2659-2672

DOI: 10.18494/SAM.2020.2794

12Citations

23Readers

Abstract

Law courts spend too much time reading documents and judging the type of legal cases. This problem becomes more serious as a crime can be classified into several categories at the same time. Thus, legal documents need a multilabel classification. We propose a multilabel text classification model based on multilabel text convolutional neural network (MLTCNN). We scan legal documents and convert them to text data using optical character recognition (OCR) with a charge-coupled device (CCD) sensor. Then, we use Jieba, a word segmentation tool of Chinese letters, and TensorFlow VocabularyProcessor to generate vocabularies. Then, the case description after segmenting each word is mapped into a word index in the vocabularies. We use a word index vector as an input to the MLTCNN. Lastly, we adopt multiple sigmoid functions for multiple binary classifications. The result shows our method to be efficient in finding errors and deviations for similar cases among district courts. This study provides a new method to improve the legal service and to enable fairer law enforcement.

Author supplied keywords

Cite

CITATION STYLE

APA

Qiu, M., Zhang, Y., Ma, T., Wu, Q., & Jin, F. (2020). Convolutional-neural-network-based multilabel text classification for automatic discrimination of legal documents. Sensors and Materials, 32(8p2), 2659–2672. https://doi.org/10.18494/SAM.2020.2794

Convolutional-neural-network-based multilabel text classification for automatic discrimination of legal documents

Abstract

Author supplied keywords

Cite

Register to see more suggestions