Structured regularizer for neural higher-order sequence models

Martin Ratajczak; Sebastian Tschiatschek; Franz Pernkopf

Conference ProceedingsOPEN ACCESS

Structured regularizer for neural higher-order sequence models

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9284 168-183

DOI: 10.1007/978-3-319-23528-8_11

2Citations

9Readers

Abstract

We introduce both joint training of neural higher-order linear-chain conditional random fields (NHO-LC-CRFs) and a new structured regularizer for sequence modelling. We show that this regularizer can be derived as lower bound from a mixture of models sharing parts, e.g. neural sub-networks, and relate it to ensemble learning. Furthermore, it can be expressed explicitly as regularization term in the training objective. We exemplify its effectiveness by exploring the introduced NHO-LCCRFs for sequence labeling. Higher-order LC-CRFs with linear factors are well-established for that task, but they lack the ability to model non-linear dependencies. These non-linear dependencies, however, can be efficiently modeled by neural higher-order input-dependent factors. Experimental results for phoneme classification with NHO-LC-CRFs confirm this fact and we achieve state-of-the-art phoneme error rate of 16.7% on TIMIT using the new structured regularizer.

Author supplied keywords

Cite

CITATION STYLE

APA

Ratajczak, M., Tschiatschek, S., & Pernkopf, F. (2015). Structured regularizer for neural higher-order sequence models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9284, pp. 168–183). Springer Verlag. https://doi.org/10.1007/978-3-319-23528-8_11

Structured regularizer for neural higher-order sequence models

Abstract

Author supplied keywords

Cite

Register to see more suggestions