Retaining data from streams of social platforms with minimal regret

Nguyen Thanh Tam; Matthias Weidlich; Duong Chi Thang; Hongzhi Yin; Nguyen Quoc Viet Hung

Conference Proceedings

Retaining data from streams of social platforms with minimal regret

IJCAI International Joint Conference on Artificial Intelligence (2017) 0 2850-2856

DOI: 10.24963/ijcai.2017/397

37Citations

29Readers

Get full text

Abstract

Today's social platforms, such as Twitter and Facebook, continuously generate massive volumes of data. The resulting data streams exceed any reasonable limit for permanent storage, especially since data is often redundant, overlapping, sparse, and generally of low value. This calls for means to retain solely a small fraction of the data in an online manner. In this paper, we propose techniques to effectively decide which data to retain, such that the induced loss of information, the regret of neglecting certain data, is minimized. These techniques enable not only efficient processing of massive streaming data, but are also adaptive and address the dynamic nature of social media. Experiments on large-scale real-world datasets illustrate the feasibility of our approach in terms of both, runtime and information quality.

Cite

CITATION STYLE

APA

Tam, N. T., Weidlich, M., Thang, D. C., Yin, H., & Hung, N. Q. V. (2017). Retaining data from streams of social platforms with minimal regret. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 0, pp. 2850–2856). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2017/397

Retaining data from streams of social platforms with minimal regret

Abstract

Cite

Register to see more suggestions