A faster subquadratic algorithm for finding outlier correlations

21Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We study the problem of detecting outlier pairs of strongly correlated variables among a collection of n variables with otherwise weak pairwise correlations. After normalization, this task amounts to the geometric task where we are given as input a set of n vectors with unit Euclidean norm and dimension d, and we are asked to find all the outlier pairs of vectors whose inner product is at least p in absolute value, subject to the promise that all but at most q pairs of vectors have inner product at most r in absolute value for some constants 0 < t < p < 1. Improving on an algorithm of G. Valiant [FOCS 2012; J. ACM 2015], we present a randomized algorithm that for Boolean inputs ({-1, l}-valued data normalized to unit Euclidean length) runs in time Q(nmax{l-γ, +M(δγ γ), M(l-γ, 2δ γ)} +qdnγ) where 0 < γ < 1 is a constant tradeoff parameter and M(μ, v ) is the exponent to multiply an [nμ] × [nv] matrix with an [nv]×[nμ] matrix and A = 1/(1-logTp). As corollaries we obtain randomized algorithms that run in time O(2Omega;/n3-logTp+qdn 2(1-logTp)/3-logTp and in time O(4/n2+α(1-logTp+qdn 2α(1-logTp)/2+α(1-logTp) where 2 < ω < 2.38 is the exponent for square matrix multiplication and 0.3 < α< 1 is the exponent for rectangular matrix multiplication. We present further corollaries for the light bulb problem and for learning sparse Boolean functions. (The notation O(-) hides polylogarithmic factors in n and d whose degree may depend on p and t.)

Cite

CITATION STYLE

APA

Karppa, M., Kaski, P., & Kohonen, J. (2016). A faster subquadratic algorithm for finding outlier correlations. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms (Vol. 2, pp. 1288–1305). Association for Computing Machinery. https://doi.org/10.1137/1.9781611974331.ch90

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free