Optimizing Large Language Models for ESG Activity Detection in Financial Texts

Mattia Birti; Andrea Maurino; Francesco Osborne

Conference ProceedingsOPEN ACCESS

Optimizing Large Language Models for ESG Activity Detection in Financial Texts

ICAIF 2025 - 6th ACM International Conference on AI in Finance (2025) 856-863

DOI: 10.1145/3768292.3770371

1Citations

54Readers

Get full text

Abstract

The integration of Environmental, Social, and Governance (ESG) factors into corporate decision-making is a fundamental aspect of sustainable finance. However, ensuring that business practices align with evolving regulatory frameworks remains a persistent challenge. AI-driven solutions for automatically assessing the alignment of sustainability reports and non-financial disclosures with specific ESG activities could greatly support this process. Yet, this task remains complex due to the limitations of general-purpose Large Language Models (LLMs) in domain-specific contexts and the scarcity of structured, high-quality datasets. In this paper, we investigate the ability of current-generation LLMs to identify text related to environmental activities. Furthermore, we demonstrate that their performance can be significantly enhanced through fine-tuning on a combination of original and synthetically generated data. To this end, we introduce ESG-Activities, a benchmark dataset containing 1,325 labeled text segments classified according to the EU ESG taxonomy. Our experimental results show that fine-tuning on ESG-Activities significantly enhances classification accuracy, with open models such as Llama_7B and Gemma_7B outperforming large proprietary solutions in specific configurations. These findings have important implications for financial analysts, policymakers, and AI researchers seeking to enhance ESG transparency and compliance through natural language processing techniques.

Author supplied keywords

Cite

CITATION STYLE

APA

Birti, M., Maurino, A., & Osborne, F. (2025). Optimizing Large Language Models for ESG Activity Detection in Financial Texts. In ICAIF 2025 - 6th ACM International Conference on AI in Finance (pp. 856–863). Association for Computing Machinery, Inc. https://doi.org/10.1145/3768292.3770371

Optimizing Large Language Models for ESG Activity Detection in Financial Texts

Abstract

Author supplied keywords

Cite

Register to see more suggestions