In-memory distributed training of linear-chain conditional random fields with an application to fine-grained named entity recognition

Robert Schwarzenberg; Leonhard Hennig; Holmer Hemsen

Conference ProceedingsOPEN ACCESS

In-memory distributed training of linear-chain conditional random fields with an application to fine-grained named entity recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10713 LNAI 155-167

DOI: 10.1007/978-3-319-73706-5_13

3Citations

11Readers

Abstract

Recognizing fine-grained named entities, i.e., street and city instead of just the coarse type location, has been shown to increase task performance in several contexts. Fine-grained types, however, amplify the problem of data sparsity during training, which is why larger amounts of training data are needed. In this contribution we address scalability issues caused by the larger training sets. We distribute and parallelize feature extraction and parameter estimation in linear-chain conditional random fields, which are a popular choice for sequence labeling tasks such as named entity recognition (NER) and part of speech (POS) tagging. To this end, we employ the parallel stream processing framework Apache Flink which supports in-memory distributed iterations. Due to this feature, contrary to prior approaches, our system becomes iteration-aware during gradient descent. We experimentally demonstrate the scalability of our approach and also validate the parameters learned during distributed training in a fine-grained NER task.

Cite

CITATION STYLE

APA

Schwarzenberg, R., Hennig, L., & Hemsen, H. (2018). In-memory distributed training of linear-chain conditional random fields with an application to fine-grained named entity recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10713 LNAI, pp. 155–167). Springer Verlag. https://doi.org/10.1007/978-3-319-73706-5_13

In-memory distributed training of linear-chain conditional random fields with an application to fine-grained named entity recognition

Abstract

Cite

Register to see more suggestions