Integrating low-latency data streaming into data warehouse architectures has become an important enhancement to support modern data warehousing applications. In these architectures, heterogeneous workloads with data ingestion and analytical queries must be executed with strict performance guarantees. Furthermore, the data warehouse may consists of multiple different types of storage engines (a.k.a., polystores or multi-stores). A paramount problem is data placement; different workload scenarios call for different data placement designs. Moreover, workload conditions change frequently. In this paper, we provide evidence that a dynamic, workload-driven approach is needed for data placement in polystores with low-latency data ingestion support. We study the problem based on the characteristics of the TPC-DI benchmark in the context of an abbreviated polystore that consists of S-Store and Postgres.
CITATION STYLE
Du, J., Meehan, J., Tatbul, N., & Zdonik, S. (2019). Towards Dynamic Data Placement for Polystore Ingestion. In Lecture Notes in Business Information Processing (Vol. 337, pp. 211–228). Springer. https://doi.org/10.1007/978-3-030-24124-7_13
Mendeley helps you to discover research relevant for your work.