TY - GEN
T1 - Efficient Probability Mass Function Estimation from Partially Observed Data
AU - Chege, Joseph K.
AU - Grasis, Mikus J.
AU - Manina, Alla
AU - Yeredor, Arie
AU - Haardt, Martin
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Estimating the joint probability mass function (PMF) of a set of random variables from partially observed data is a crucial part of statistical learning and data analysis, with applications in areas such as recommender systems and data classification. Recently, it has been proposed to estimate the joint PMF based on the maximum likelihood (ML) of the data, fitted to a low-rank canonical polyadic decomposition (CPD) model of the joint PMF. To this end, a hybrid alternating-directions expectation-maximization (AD-EM) algorithm was proposed to solve the ML optimization problem, consisting of computationally expensive AD iterations followed by an EM refinement stage. It is well known that the convergence rate of EM decreases as the fraction of missing data increases. In this paper, we address the slow convergence of the EM algorithm. By adapting the squared iterative methods (SQUAREM) acceleration scheme to the context of PMF estimation, we propose the SQUAREM-PMF algorithm to speed up the convergence of the EM algorithm. Moreover, we demonstrate that running the computationally cheaper EM algorithm alone after an appropriate initialization is sufficient. Numerical results on both synthetic and real data in the context of movie recommendation show that our algorithm outperforms state-of-the-art PMF estimation algorithms.
AB - Estimating the joint probability mass function (PMF) of a set of random variables from partially observed data is a crucial part of statistical learning and data analysis, with applications in areas such as recommender systems and data classification. Recently, it has been proposed to estimate the joint PMF based on the maximum likelihood (ML) of the data, fitted to a low-rank canonical polyadic decomposition (CPD) model of the joint PMF. To this end, a hybrid alternating-directions expectation-maximization (AD-EM) algorithm was proposed to solve the ML optimization problem, consisting of computationally expensive AD iterations followed by an EM refinement stage. It is well known that the convergence rate of EM decreases as the fraction of missing data increases. In this paper, we address the slow convergence of the EM algorithm. By adapting the squared iterative methods (SQUAREM) acceleration scheme to the context of PMF estimation, we propose the SQUAREM-PMF algorithm to speed up the convergence of the EM algorithm. Moreover, we demonstrate that running the computationally cheaper EM algorithm alone after an appropriate initialization is sufficient. Numerical results on both synthetic and real data in the context of movie recommendation show that our algorithm outperforms state-of-the-art PMF estimation algorithms.
KW - Joint PMF estimation
KW - expectation-maximization (EM)
KW - maximum likelihood (ML)
KW - rec-ommendation systems
KW - tensor decomposition
UR - http://www.scopus.com/inward/record.url?scp=85150197993&partnerID=8YFLogxK
U2 - 10.1109/IEEECONF56349.2022.10052047
DO - 10.1109/IEEECONF56349.2022.10052047
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85150197993
T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers
SP - 256
EP - 262
BT - 56th Asilomar Conference on Signals, Systems and Computers, ACSSC 2022
A2 - Matthews, Michael B.
PB - IEEE Computer Society
T2 - 56th Asilomar Conference on Signals, Systems and Computers, ACSSC 2022
Y2 - 31 October 2022 through 2 November 2022
ER -