Learn The Big Picture: Representation Learning for Clustering

Sumanta Kashyapi; Laura Dietz

Conference ProceedingsOPEN ACCESS

Learn The Big Picture: Representation Learning for Clustering

RepL4NLP 2021 - 6th Workshop on Representation Learning for NLP, Proceedings of the Workshop (2021) 141-151

DOI: 10.18653/v1/2021.repl4nlp-1.15

1Citations

42Readers

Abstract

Existing supervised models for text clustering find it difficult to directly optimize for clustering results. This is because clustering is a discrete process and it is difficult to estimate meaningful gradient of any discrete function that can drive gradient based optimization algorithms. So, existing supervised clustering algorithms indirectly optimize for some continuous function that approximates the clustering process. We propose a scalable training strategy that directly optimizes for a discrete clustering metric. We train a BERT-based embedding model using our method and evaluate it on two publicly available datasets. We show that our method outperforms another BERT-based embedding model employing Triplet loss and other unsupervised baselines. This suggests that optimizing directly for the clustering outcome indeed yields better representations suitable for clustering.

Cite

CITATION STYLE

APA

Kashyapi, S., & Dietz, L. (2021). Learn The Big Picture: Representation Learning for Clustering. In RepL4NLP 2021 - 6th Workshop on Representation Learning for NLP, Proceedings of the Workshop (pp. 141–151). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.repl4nlp-1.15

Learn The Big Picture: Representation Learning for Clustering

Abstract

Cite

Register to see more suggestions