Generalization Error Bounds for Multiclass Sparse Linear Classifiers

Tomer Levy, Felix Abramovich

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

We consider high-dimensional multiclass classification by sparse multinomial logistic regression. Unlike binary classification, in the multiclass setup one can think about an entire spectrum of possible notions of sparsity associated with different structural assumptions on the regression coefficients matrix. We propose a computationally feasible feature selection procedure based on penalized maximum likelihood with convex penalties capturing a specific type of sparsity at hand. In particular, we consider global row-wise sparsity, double row-wise sparsity, and low-rank sparsity, and show that with the properly chosen tuning parameters the derived plug-in classifiers attain the minimax generalization error bounds (in terms of misclassification excess risk) within the corresponding classes of multiclass sparse linear classifiers. The developed approach is general and can be adapted to other types of sparsity as well.

Original languageEnglish
Article number151
JournalJournal of Machine Learning Research
Volume24
StatePublished - 2023

Funding

FundersFunder number
Israel Science FoundationISF-1095/22, ISF-589/18

    Keywords

    • Feature selection
    • high-dimensionality
    • minimaxity
    • misclassification excess risk
    • sparsity

    Fingerprint

    Dive into the research topics of 'Generalization Error Bounds for Multiclass Sparse Linear Classifiers'. Together they form a unique fingerprint.

    Cite this