Distant domain adaptation for text classification

Zhenlong Zhu; Yuhua Li; Ruixuan Li; Xiwu Gu

Conference Proceedings

Distant domain adaptation for text classification

Zhu Z
Li Y
Li R
et al.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11061 LNAI 55-66

DOI: 10.1007/978-3-319-99365-2_5

4Citations

4Readers

Get full text

Abstract

Text classification becomes a hot topic nowadays. In reality, the training data and the test data may come from different distributions, which causes the problem of domain adaptation. In this paper, we study a novel learning problem: Distant Domain Adaptation for Text classification (DDAT). In DDAT, the target domain can be very different from the source domain, where the traditional transfer learning methods do not work well because they assume that the source and target domains are similar. To solve this issue we propose a Selective Domain Adaptation Algorithm (SDAA). SDAA iteratively selects reliable instances from the source and intermediate domain to bridge the source and target domains. Extensive experiments show that SDAA has state-of-the-art classification accuracies on the test datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhu, Z., Li, Y., Li, R., & Gu, X. (2018). Distant domain adaptation for text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11061 LNAI, pp. 55–66). Springer Verlag. https://doi.org/10.1007/978-3-319-99365-2_5

Distant domain adaptation for text classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions