MSSM: A Multiple-level Sparse Sharing Model for Efficient Multi-Task Learning

Ke Ding; Xin Dong; Yong He; Lei Cheng; Chilin Fu; Zhaoxin Huan; Hai Li; Tan Yan; Liang Zhang; Xiaolu Zhang; Linjian Mo

Conference ProceedingsOPEN ACCESS

MSSM: A Multiple-level Sparse Sharing Model for Efficient Multi-Task Learning

SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (2021) 2237-2241

DOI: 10.1145/3404835.3463022

43Citations

26Readers

Get full text

Abstract

Multi-task learning(MTL) is an open and challenging problem in various real-world applications. The typical way of conducting multi-task learning is establishing some global parameter sharing mechanism across all tasks or assigning each task an individual set of parameters with cross-connections between tasks. However, for most existing approaches, all tasks just thoroughly or proportionally share all the features without distinguishing the helpfulness of them. By that, some tasks would be intervened by the unhelpful features that are useful for other tasks, leading to undesired negative transfer between tasks. In this paper, we design a novel architecture named the Multiple-level Sparse Sharing Model (MSSM), which can learn features selectively and share knowledge across all tasks efficiently. MSSM first employs a field-level sparse connection module (FSCM) to enable much more expressive combinations of feature fields to be learned for generalization across tasks while still allowing for task-specific features to be customized for each task. Furthermore, a cell-level sparse sharing module (CSSM) can recognize the sharing pattern through a set of coding variables that selectively choose which cells to route for a given task. Extensive experimental results on several real-world datasets show that MSSM outperforms SOTA models significantly in terms of AUC and LogLoss metrics.

Author supplied keywords

Cite

CITATION STYLE

APA

Ding, K., Dong, X., He, Y., Cheng, L., Fu, C., Huan, Z., … Mo, L. (2021). MSSM: A Multiple-level Sparse Sharing Model for Efficient Multi-Task Learning. In SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2237–2241). Association for Computing Machinery, Inc. https://doi.org/10.1145/3404835.3463022

MSSM: A Multiple-level Sparse Sharing Model for Efficient Multi-Task Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions