Multiclass classification by sparse multinomial logistic regression

Felix Abramovich*, Vadim Grinshtein, Tomer Levy

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

In this paper we consider high-dimensional multiclass classification by sparse multinomial logistic regression. We propose first a feature selection procedure based on penalized maximum likelihood with a complexity penalty on the model size and derive the nonasymptotic bounds for misclassification excess risk of the resulting classifier. We establish also their tightness by deriving the corresponding minimax lower bounds. In particular, we show that there is a phase transition between small and large number of classes. The bounds can be reduced under the additional low noise condition. To find a penalized maximum likelihood solution with a complexity penalty requires, however, a combinatorial search over all possible models. To design a feature selection procedure computationally feasible for high-dimensional data, we propose multinomial logistic group Lasso and Slope classifiers and show that they also achieve the minimax order.

Original languageEnglish
Article number9410597
Pages (from-to)4637-4646
Number of pages10
JournalIEEE Transactions on Information Theory
Volume67
Issue number7
DOIs
StatePublished - Jul 2021

Funding

FundersFunder number
Israel Science FoundationISF-589/18

    Keywords

    • Complexity penalty
    • Convex relaxation
    • Feature selection
    • High-dimensionality
    • Minimaxity
    • Misclassification excess risk
    • Sparsity

    Fingerprint

    Dive into the research topics of 'Multiclass classification by sparse multinomial logistic regression'. Together they form a unique fingerprint.

    Cite this