TY - JOUR
T1 - Multi-reference factor analysis
T2 - Low-rank covariance estimation under unknown translations
AU - Landa, Boris
AU - Shkolnisky, Yoel
N1 - Publisher Copyright:
© 2020 The Author(s) 2020. Published by Oxford University Press on behalf of the Institute of Mathematics and its Applications. All rights reserved.
PY - 2021/9/1
Y1 - 2021/9/1
N2 - We consider the problem of estimating the covariance matrix of a random signal observed through unknown translations (modeled by cyclic shifts) and corrupted by noise. Solving this problem allows to discover low-rank structures masked by the existence of translations (which act as nuisance parameters), with direct application to principal components analysis. We assume that the underlying signal is of length $L$ and follows a standard factor model with mean zero and $r$ normally distributed factors. To recover the covariance matrix in this case, we propose to employ the second- and fourth-order shift-invariant moments of the signal known as the power spectrum and the trispectrum. We prove that they are sufficient for recovering the covariance matrix (under a certain technical condition) when $r<\sqrt{L}$. Correspondingly, we provide a polynomial-time procedure for estimating the covariance matrix from many (translated and noisy) observations, where no explicit knowledge of $r$ is required, and prove the procedure's statistical consistency. While our results establish that covariance estimation is possible from the power spectrum and the trispectrum for low-rank covariance matrices, we prove that this is not the case for full-rank covariance matrices. We conduct numerical experiments that corroborate our theoretical findings and demonstrate the favourable performance of our algorithms in various settings, including in high levels of noise.
AB - We consider the problem of estimating the covariance matrix of a random signal observed through unknown translations (modeled by cyclic shifts) and corrupted by noise. Solving this problem allows to discover low-rank structures masked by the existence of translations (which act as nuisance parameters), with direct application to principal components analysis. We assume that the underlying signal is of length $L$ and follows a standard factor model with mean zero and $r$ normally distributed factors. To recover the covariance matrix in this case, we propose to employ the second- and fourth-order shift-invariant moments of the signal known as the power spectrum and the trispectrum. We prove that they are sufficient for recovering the covariance matrix (under a certain technical condition) when $r<\sqrt{L}$. Correspondingly, we provide a polynomial-time procedure for estimating the covariance matrix from many (translated and noisy) observations, where no explicit knowledge of $r$ is required, and prove the procedure's statistical consistency. While our results establish that covariance estimation is possible from the power spectrum and the trispectrum for low-rank covariance matrices, we prove that this is not the case for full-rank covariance matrices. We conduct numerical experiments that corroborate our theoretical findings and demonstrate the favourable performance of our algorithms in various settings, including in high levels of noise.
KW - covariance estimation
KW - invariant moments
KW - method of moments
KW - multi-reference alignment
KW - principal component analysis
KW - shift invariance
KW - trispectrum
UR - http://www.scopus.com/inward/record.url?scp=85116355382&partnerID=8YFLogxK
U2 - 10.1093/imaiai/iaaa019
DO - 10.1093/imaiai/iaaa019
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85116355382
SN - 2049-8772
VL - 10
SP - 773
EP - 812
JO - Information and Inference
JF - Information and Inference
IS - 3
ER -