Mixture of expert/imitator networks: Scalable semi-supervised learning framework

Shun Kiyono; Jun Suzuki; Kentaro Inui

Conference ProceedingsOPEN ACCESS

Mixture of expert/imitator networks: Scalable semi-supervised learning framework

33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (2019) 4073-4081

DOI: 10.1609/aaai.v33i01.33014073

4Citations

35Readers

Abstract

The current success of deep neural networks (DNNs) in an increasingly broad range of tasks involving artificial intelligence strongly depends on the quality and quantity of labeled training data. In general, the scarcity of labeled data, which is often observed in many natural language processing tasks, is one of the most important issues to be addressed. Semi-supervised learning (SSL) is a promising approach to overcoming this issue by incorporating a large amount of unlabeled data. In this paper, we propose a novel scalable method of SSL for text classification tasks. The unique property of our method, Mixture of Expert/Imitator Networks, is that imitator networks learn to “imitate” the estimated label distribution of the expert network over the unlabeled data, which potentially contributes a set of features for the classification. Our experiments demonstrate that the proposed method consistently improves the performance of several types of baseline DNNs. We also demonstrate that our method has the more data, better performance property with promising scalability to the amount of unlabeled data.

Cite

CITATION STYLE

APA

Kiyono, S., Suzuki, J., & Inui, K. (2019). Mixture of expert/imitator networks: Scalable semi-supervised learning framework. In 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (pp. 4073–4081). AAAI Press. https://doi.org/10.1609/aaai.v33i01.33014073

Mixture of expert/imitator networks: Scalable semi-supervised learning framework

Abstract

Cite

Register to see more suggestions