Data integration tasks on heterogeneous systems using OpenCL

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

In the era of big data, many new algorithms are developed to try and find the most efficient way to perform computations with massive amounts of data. However, what is often overlooked is the preprocessing step for many of these applications. The Data Integration Benchmark Suite (DIBS) [1] was designed to understand the characteristics of dataset transformations in a hardware agnostic way. While on the surface these applications have a high amount of data parallelism, there are caveats in their specification that can potentially affect this characteristic. Even still, OpenCL can be an effective deployment environment for these applications. In this work we take a subset of the data transformations from each category presented in DIBS and implement them in OpenCL to evaluate their performance for heterogeneous systems. For targeting heterogeneous systems, we take a common application and attempt to deploy it to three platforms targetable by OpenCL (CPU, GPU, and FPGA). The applications are evaluated by their average transformation data rate (see Figure 1). We illustrate the advantages of each compute device in the data integration space along with different communications schemes allowed for host/device communication in the OpenCL platform.

Cite

CITATION STYLE

APA

Faber, C. J., Cabrera, A. M., Booker, O., Maayan, G., & Chamberlain, R. D. (2019). Data integration tasks on heterogeneous systems using OpenCL. In ACM International Conference Proceeding Series. Association for Computing Machinery. https://doi.org/10.1145/3318170.3318187

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free