Leveraging Domain Information to Classify Financial Documents via Unsupervised Graph Momentum Contrast

Xueni Luo; Dawei Cheng; Haorui Ma; Junhao Wang; Mengzhen Fan; Yifeng Luo

Conference ProceedingsOPEN ACCESS

Leveraging Domain Information to Classify Financial Documents via Unsupervised Graph Momentum Contrast

International Conference on Information and Knowledge Management, Proceedings (2021) 3298-3302

DOI: 10.1145/3459637.3482133

1Citations

7Readers

Get full text

Abstract

Financial documents often contain rich domain information, such as named entities, which could be used to indicate the documents' classification categories. Existing classification methods either ignore such contained financial domain information, achieving less optimal performances, or train document representations in supervised ways, with expensive data labeling costs. In this paper, we propose to leverage domain information to improve classification performance for financial documents, via a graph representation learning model, namely G-MoCo, based on unsupervised graph momentum contrast. With G-MoCo, we could extract latent features from massive unlabeled raw data, and then further use the learned representations for document classification. Compared with the state-of-the-art baselines, representations learned by our method could improve performances by significant margins on a financial document dataset and three non-financial public graph datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Luo, X., Cheng, D., Ma, H., Wang, J., Fan, M., & Luo, Y. (2021). Leveraging Domain Information to Classify Financial Documents via Unsupervised Graph Momentum Contrast. In International Conference on Information and Knowledge Management, Proceedings (pp. 3298–3302). Association for Computing Machinery. https://doi.org/10.1145/3459637.3482133

Leveraging Domain Information to Classify Financial Documents via Unsupervised Graph Momentum Contrast

Abstract

Author supplied keywords

Cite

Register to see more suggestions