Efficient Probability Mass Function Estimation from Partially Observed Data

Joseph K. Chege, Mikus J. Grasis, Alla Manina, Arie Yeredor, Martin Haardt

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Estimating the joint probability mass function (PMF) of a set of random variables from partially observed data is a crucial part of statistical learning and data analysis, with applications in areas such as recommender systems and data classification. Recently, it has been proposed to estimate the joint PMF based on the maximum likelihood (ML) of the data, fitted to a low-rank canonical polyadic decomposition (CPD) model of the joint PMF. To this end, a hybrid alternating-directions expectation-maximization (AD-EM) algorithm was proposed to solve the ML optimization problem, consisting of computationally expensive AD iterations followed by an EM refinement stage. It is well known that the convergence rate of EM decreases as the fraction of missing data increases. In this paper, we address the slow convergence of the EM algorithm. By adapting the squared iterative methods (SQUAREM) acceleration scheme to the context of PMF estimation, we propose the SQUAREM-PMF algorithm to speed up the convergence of the EM algorithm. Moreover, we demonstrate that running the computationally cheaper EM algorithm alone after an appropriate initialization is sufficient. Numerical results on both synthetic and real data in the context of movie recommendation show that our algorithm outperforms state-of-the-art PMF estimation algorithms.

Original languageEnglish
Title of host publication56th Asilomar Conference on Signals, Systems and Computers, ACSSC 2022
EditorsMichael B. Matthews
PublisherIEEE Computer Society
Pages256-262
Number of pages7
ISBN (Electronic)9781665459068
DOIs
StatePublished - 2022
Event56th Asilomar Conference on Signals, Systems and Computers, ACSSC 2022 - Virtual, Online, United States
Duration: 31 Oct 20222 Nov 2022

Publication series

NameConference Record - Asilomar Conference on Signals, Systems and Computers
Volume2022-October
ISSN (Print)1058-6393

Conference

Conference56th Asilomar Conference on Signals, Systems and Computers, ACSSC 2022
Country/TerritoryUnited States
CityVirtual, Online
Period31/10/222/11/22

Funding

FundersFunder number
Deutsche ForschungsgemeinschaftHA 2239/16-1

    Keywords

    • Joint PMF estimation
    • expectation-maximization (EM)
    • maximum likelihood (ML)
    • rec-ommendation systems
    • tensor decomposition

    Fingerprint

    Dive into the research topics of 'Efficient Probability Mass Function Estimation from Partially Observed Data'. Together they form a unique fingerprint.

    Cite this