Making big data small

Wenfei Fan

ArticleOPEN ACCESS

Making big data small

Fan W

Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences

DOI: 10.1098/rspa.2019.0034

1Citations

14Readers

Abstract

Big data analytics is often prohibitively costly and is typically conducted by parallel processing with a cluster of machines. Is big data analytics beyond the reach of small companies that can only afford limited resources? This paper tackles this question by presenting Boundedly EvAlable SQL (BEAS), a system for querying big relations with constrained resources. The idea is to make big data small. To answer a query posed on a dataset, it often suffices to access a small fraction of the data no matter how big the dataset is. In the light of this, BEAS answers queries on big data by identifying and fetching a small set of the data needed. Under available resources, it computes exact answers whenever possible and otherwise approximate answers with accuracy guarantees. Underlying BEAS are principled approaches of bounded evaluation and data-driven approximation, the focus of this paper.

Author supplied keywords

Cite

CITATION STYLE

APA

Fan, W. (2019, May 1). Making big data small. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences. Royal Society Publishing. https://doi.org/10.1098/rspa.2019.0034

Making big data small

Abstract

Author supplied keywords

Cite

Register to see more suggestions