TLC-XML: Transformer with Label Correlation for Extreme Multi-label Text Classification

Fei Zhao; Qing Ai; Xiangna Li; Wenhui Wang; Qingyun Gao; Yichun Liu

Journal ArticleOPEN ACCESS

TLC-XML: Transformer with Label Correlation for Extreme Multi-label Text Classification

Neural Processing Letters (2024) 56(1)

DOI: 10.1007/s11063-024-11460-z

2Citations

7Readers

Abstract

Extreme multi-label text classification (XMTC) annotates related labels for unknown text from large-scale label sets. Transformer-based methods have become the dominant approach for solving the XMTC task due to their effective text representation capabilities. However, the existing Transformer-based methods fail to effectively exploit the correlation between labels in the XMTC task. To address this shortcoming, we propose a novel model called TLC-XML, i.e., a Transformer with label correlation for extreme multi-label text classification. TLC-XML comprises three modules: Partition, Matcher and Ranker. In the Partition module, we exploit the semantic and co-occurrence information of labels to construct the label correlation graph, and further partition the strongly correlated labels into the same cluster. In the Matcher module, we propose cluster correlation learning, which uses the graph convolutional network (GCN) to extract the correlation between clusters. We then introduce these valuable correlations into the classifier to match related clusters. In the Ranker module, we propose label interaction learning, which aggregates the raw label prediction with the information of the neighboring labels. The experimental results on benchmark datasets show that TLC-XML significantly outperforms state-of-the-art XMTC methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhao, F., Ai, Q., Li, X., Wang, W., Gao, Q., & Liu, Y. (2024). TLC-XML: Transformer with Label Correlation for Extreme Multi-label Text Classification. Neural Processing Letters, 56(1). https://doi.org/10.1007/s11063-024-11460-z

TLC-XML: Transformer with Label Correlation for Extreme Multi-label Text Classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions