In this paper, we propose a method of extracting causal information from PDF files of the summary of financial statements of companies, e.g., ”The sales of smart phones was expanded continually”. Cause information is useful for investors in selecting companies to invest. We downloaded 106,885 PDF files of the summary of financial statements of companies fromWeb pages of the companies automatically. Our method extracts causal information from the PDF files by using clue expressions (e.g., ”was expanded”) and keywords relevant to a company. The clue expressions are extracted from the PDF files of the summary of financial statements of companies and articles concerning business performance of companies automatically. We developed the search system which is able to retrieve causal informations extracted by our method. The search system shows causal information containing a keyword inputted by users, and the summary of financial statements containing the retrieved causal information. We evaluated our method and it attained 83.91% precision and 55.04% recall, respectively. Moreover, we compared our method with Sakai et al’s method originally proposed for extracting causal information from financial articles concerning business performance of companies and experimental results showed that our method outperforms Sakai et al’s method.
CITATION STYLE
Sakai, H., Nishizawa, H., Matsunami, S., & Sakaji, H. (2015). Extraction of causal information from PDF files of the summary of financial statements of companies. Transactions of the Japanese Society for Artificial Intelligence, 30(1), 172–182. https://doi.org/10.1527/tjsai.30.172
Mendeley helps you to discover research relevant for your work.