Leveraging Domain Information to Classify Financial Documents via Unsupervised Graph Momentum Contrast

1Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Financial documents often contain rich domain information, such as named entities, which could be used to indicate the documents' classification categories. Existing classification methods either ignore such contained financial domain information, achieving less optimal performances, or train document representations in supervised ways, with expensive data labeling costs. In this paper, we propose to leverage domain information to improve classification performance for financial documents, via a graph representation learning model, namely G-MoCo, based on unsupervised graph momentum contrast. With G-MoCo, we could extract latent features from massive unlabeled raw data, and then further use the learned representations for document classification. Compared with the state-of-the-art baselines, representations learned by our method could improve performances by significant margins on a financial document dataset and three non-financial public graph datasets.

Cite

CITATION STYLE

APA

Luo, X., Cheng, D., Ma, H., Wang, J., Fan, M., & Luo, Y. (2021). Leveraging Domain Information to Classify Financial Documents via Unsupervised Graph Momentum Contrast. In International Conference on Information and Knowledge Management, Proceedings (pp. 3298–3302). Association for Computing Machinery. https://doi.org/10.1145/3459637.3482133

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free