Improved Guarantees and a Multiple-descent Curve for Column Subset Selection and the Nyström Method (Extended Abstract)

0Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

The Column Subset Selection Problem (CSSP) and the Nyström method are among the leading tools for constructing interpretable low-rank approximations of large datasets by selecting a small but representative set of features or instances. A fundamental question in this area is: what is the cost of this interpretability, i.e., how well can a data subset of size k compete with the best rank k approximation? We develop techniques which exploit spectral properties of the data matrix to obtain improved approximation guarantees which go beyond the standard worst-case analysis. Our approach leads to significantly better bounds for datasets with known rates of singular value decay, e.g., polynomial or exponential decay. Our analysis also reveals an intriguing phenomenon: the cost of interpretability as a function of k may exhibit multiple peaks and valleys, which we call a multiple-descent curve. A lower bound we establish shows that this behavior is not an artifact of our analysis, but rather it is an inherent property of the CSSP and Nyström tasks. Finally, using the example of a radial basis function (RBF) kernel, we show that both our improved bounds and the multiple-descent curve can be observed on real datasets simply by varying the RBF parameter.

References Powered by Scopus

LIBSVM: A Library for support vector machines

28165Citations
N/AReaders
Get full text

CUR matrix decompositions for improved data analysis

588Citations
N/AReaders
Get full text

COINCIDENCE APPROACH TO STOCHASTIC POINT PROCESS.

365Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Derezinski, M., Khanna, R., & Mahoney, M. W. (2021). Improved Guarantees and a Multiple-descent Curve for Column Subset Selection and the Nyström Method (Extended Abstract). In IJCAI International Joint Conference on Artificial Intelligence (pp. 4765–4769). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2021/647

Readers over time

‘20‘21‘22‘23‘24036912

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 7

50%

Researcher 5

36%

Professor / Associate Prof. 1

7%

Lecturer / Post doc 1

7%

Readers' Discipline

Tooltip

Computer Science 9

69%

Engineering 2

15%

Agricultural and Biological Sciences 1

8%

Economics, Econometrics and Finance 1

8%

Save time finding and organizing research with Mendeley

Sign up for free
0