Trillion Dollar Words: A New Financial Dataset, Task & Market Analysis

22Citations
Citations of this article
40Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Monetary policy pronouncements by Federal Open Market Committee (FOMC) are a major driver of financial market returns. We construct the largest tokenized and annotated dataset of FOMC speeches, meeting minutes, and press conference transcripts in order to understand how monetary policy influences financial markets. In this study, we develop a novel task of hawkish-dovish classification and benchmark various pre-trained language models on the proposed dataset. Using the best-performing model (RoBERTa-large), we construct a measure of monetary policy stance for the FOMC document release days. To evaluate the constructed measure, we study its impact on the treasury market, stock market, and macroeconomic indicators. Our dataset, models, and code are publicly available on Huggingface and GitHub under CC BY-NC 4.0 license.

Cite

CITATION STYLE

APA

Shah, A., Paturi, S., & Chava, S. (2023). Trillion Dollar Words: A New Financial Dataset, Task & Market Analysis. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 6664–6679). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.368

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free