Communication-efficient surrogate quantile regression for non-randomly distributed system

56Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Distributed system has been widely used to solve massive data analysis tasks. This article targets on quantile regression on distributed system with non-randomly distributed massive data, and proposes a new communication-efficient surrogate quantile regression. Specifically, based on a small size random pilot sample collected from different worker machines, we approximate the global quantile regression as a surrogate one on the master machine, which relates to the local datasets only through their gradient vectors, and can overcome the non-randomly distributed nature. Then the resulting estimator can be obtained on the master, and the communication cost is greatly reduced, since the pilot sample and local gradients can be transferred conveniently. In theory, without any restrictive assumption about randomness, the established asymptotical results show that the proposed method works beautifully just as the data were stored on one single machine. Synthetic data and real world data evaluations are also used to illustrate the proposed method.

Cite

CITATION STYLE

APA

Wang, K., Zhang, B., Alenezi, F., & Li, S. (2022). Communication-efficient surrogate quantile regression for non-randomly distributed system. Information Sciences, 588, 425–441. https://doi.org/10.1016/j.ins.2021.12.078

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free