Financial statements provide a view of company’s financial status at a specific point in time including the quantitative as well as qualitative view. Besides the quantitative information, the paper asserts that the qualitative information present in the form of textual disclosures have high discriminating power to predict the financial default. Towards this, the paper presents a technique to capture comprehensive 360- ∘ features from qualitative textual data at multiple granularities. The paper proposes a new sentence embedding (SE) from large language models specifically built for financial domain to encode the textual data and presents three deep learning models built on SE for financial default prediction. To accommodate unstructured and non-standard financial statements from small and unlisted companies, the paper also presents a document processing pipeline to be inclusive of such companies in the financial text modelling. Finally, the paper presents comprehensive experimental results on two datasets demonstrating the discriminating power of textual features to predict financial defaults.
CITATION STYLE
Doshi, C., Shrotiya, H., Bhiogade, R., Bhatt, H. S., & Jha, A. (2023). Analyzing Textual Information from Financial Statements for Default Prediction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 14189 LNCS, pp. 48–65). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-41682-8_4
Mendeley helps you to discover research relevant for your work.