Rationale is defined as a subset of input features that best explains or supports the prediction by machine learning models. Rationale identification has improved the generalizability and interpretability of neural networks on vision and language data. In graph applications such as molecule and polymer property prediction, identifying representative subgraph structures named as graph rationales plays an essential role in the performance of graph neural networks. Existing graph pooling and/or distribution intervention methods suffer from the lack of examples to learn to identify optimal graph rationales. In this work, we introduce a new augmentation operation called environment replacement that automatically creates virtual data examples to improve rationale identification. We propose an efficient framework that performs rationale-environment separation and representation learning on the real and augmented examples in latent spaces to avoid the high complexity of explicit graph decoding and encoding. Comparing against recent techniques, experiments on seven molecular and four polymer datasets demonstrate the effectiveness and efficiency of the proposed augmentation-based graph rationalization framework. Data and the implementation of the proposed framework are publicly available https://github.com/liugangcode/GREA.
CITATION STYLE
Liu, G., Zhao, T., Xu, J., Luo, T., & Jiang, M. (2022). Graph Rationalization with Environment-based Augmentations. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1069–1078). Association for Computing Machinery. https://doi.org/10.1145/3534678.3539347
Mendeley helps you to discover research relevant for your work.