lrSVD: An efficient imputation algorithm for incomplete high-throughput compositional data

Javier Palarea-Albaladejo; Josep Antoni Martín-Fernández; Anne Ruiz-Gazen; Christine Thomas-Agnan

Journal ArticleOPEN ACCESS

lrSVD: An efficient imputation algorithm for incomplete high-throughput compositional data

Journal of Chemometrics (2022) 36(12)

DOI: 10.1002/cem.3459

6Citations

8Readers

Abstract

Compositional methods have been successfully integrated into the chemometric toolkit to analyse and model different types of data generated by modern high-throughput technologies. Within this compositional framework, the focus is put on the relative information conveyed in the data by using log-ratio coordinate representations. However, log-ratios cannot be computed when the data matrix is not complete. A new computationally efficient data imputation algorithm based on compositional principles and aimed at high-throughput continuous-valued compositions is introduced that relies on a constrained low-rank matrix approximation of the data. Simulation and real metabolomics data are used to demonstrate its performance and ability to deal with different forms of incomplete data: zeros, nondetects, missing values or a combination of them. The computer routines lrSVD and lrSVDplus are implemented in the R package zCompositions to facilitate its use by practitioners.

Author supplied keywords

Cite

CITATION STYLE

APA

Palarea-Albaladejo, J., Antoni Martín-Fernández, J., Ruiz-Gazen, A., & Thomas-Agnan, C. (2022). lrSVD: An efficient imputation algorithm for incomplete high-throughput compositional data. Journal of Chemometrics, 36(12). https://doi.org/10.1002/cem.3459

lrSVD: An efficient imputation algorithm for incomplete high-throughput compositional data

Abstract

Author supplied keywords

Cite

Register to see more suggestions