Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

Bruno Loureiro; Gabriele Sicuro; Cédric Gerbelot; Alessandro Pacco; Florent Krzakala; Lenka Zdeborová

Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

Bruno Loureiro, Gabriele Sicuro, Cédric Gerbelot, Alessandro Pacco, Florent Krzakala, Lenka Zdeborová

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

29 Citations (Scopus)

Abstract

Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of K Gaussians with generic means and covariances via empirical risk minimisation (ERM) with any convex loss and regularisation. In particular, we prove exact asymptotics characterising the ERM estimator in high-dimensions, extending several previous results about Gaussian mixture classification in the literature. We exemplify our result in two tasks of interest in statistical learning: a) classification for a mixture with sparse means, where we study the efficiency of ℓ₁ penalty with respect to ℓ₂; b) max-margin multiclass classification, where we characterise the phase transition on the existence of the multi-class logistic maximum likelihood estimator for K > 2. Finally, we discuss how our theory can be applied beyond the scope of synthetic data, showing that in different cases Gaussian mixtures capture closely the learning curve of classification tasks in real data sets.

Original language	English
Title of host publication	Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
Editors	Marc'Aurelio Ranzato, Alina Beygelzimer, Yann Dauphin, Percy S. Liang, Jenn Wortman Vaughan
Publisher	Neural information processing systems foundation
Pages	10144-10157
Number of pages	14
ISBN (Electronic)	9781713845393
Publication status	Published - 2021
Event	35th Conference on Neural Information Processing Systems, NeurIPS 2021 - Virtual, Online Duration: 6 Dec 2021 → 14 Dec 2021

Publication series

Name	Advances in Neural Information Processing Systems
Volume	13
ISSN (Print)	1049-5258

Conference

Conference	35th Conference on Neural Information Processing Systems, NeurIPS 2021
City	Virtual, Online
Period	6/12/2021 → 14/12/2021

Cite this

Loureiro, B., Sicuro, G., Gerbelot, C., Pacco, A., Krzakala, F., & Zdeborová, L. (2021). Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions. In MA. Ranzato, A. Beygelzimer, Y. Dauphin, P. S. Liang, & J. Wortman Vaughan (Eds.), Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021 (pp. 10144-10157). (Advances in Neural Information Processing Systems; Vol. 13). Neural information processing systems foundation.

Loureiro, Bruno ; Sicuro, Gabriele ; Gerbelot, Cédric et al. / Learning Gaussian Mixtures with Generalised Linear Models : Precise Asymptotics in High-dimensions. Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. editor / Marc'Aurelio Ranzato ; Alina Beygelzimer ; Yann Dauphin ; Percy S. Liang ; Jenn Wortman Vaughan. Neural information processing systems foundation, 2021. pp. 10144-10157 (Advances in Neural Information Processing Systems).

@inbook{958b62d9d7854bb591a0a465cce6d844,

title = "Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions",

abstract = "Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of K Gaussians with generic means and covariances via empirical risk minimisation (ERM) with any convex loss and regularisation. In particular, we prove exact asymptotics characterising the ERM estimator in high-dimensions, extending several previous results about Gaussian mixture classification in the literature. We exemplify our result in two tasks of interest in statistical learning: a) classification for a mixture with sparse means, where we study the efficiency of ℓ1 penalty with respect to ℓ2; b) max-margin multiclass classification, where we characterise the phase transition on the existence of the multi-class logistic maximum likelihood estimator for K > 2. Finally, we discuss how our theory can be applied beyond the scope of synthetic data, showing that in different cases Gaussian mixtures capture closely the learning curve of classification tasks in real data sets.",

author = "Bruno Loureiro and Gabriele Sicuro and C{\'e}dric Gerbelot and Alessandro Pacco and Florent Krzakala and Lenka Zdeborov{\'a}",

note = "Funding Information: We thank Rapha{\"e}l Berthier and Francesca Mignacco for discussions. We acknowledge funding from the ERC under the European Union{\textquoteright}s Horizon 2020 Research and Innovation Program Grant Agreement 714608-SMiLe, and from the French National Research Agency grants ANR-17-CE23-0023-01 PAIL. GS is grateful to EPFL for its generous hospitality during the finalization of the project. Publisher Copyright: {\textcopyright} 2021 Neural information processing systems foundation. All rights reserved.; 35th Conference on Neural Information Processing Systems, NeurIPS 2021 ; Conference date: 06-12-2021 Through 14-12-2021",

year = "2021",

language = "English",

series = "Advances in Neural Information Processing Systems",

publisher = "Neural information processing systems foundation",

pages = "10144--10157",

editor = "Marc'Aurelio Ranzato and Alina Beygelzimer and Yann Dauphin and Liang, {Percy S.} and {Wortman Vaughan}, Jenn",

booktitle = "Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021",

}

Loureiro, B, Sicuro, G, Gerbelot, C, Pacco, A, Krzakala, F & Zdeborová, L 2021, Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions. in MA Ranzato, A Beygelzimer, Y Dauphin, PS Liang & J Wortman Vaughan (eds), Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Advances in Neural Information Processing Systems, vol. 13, Neural information processing systems foundation, pp. 10144-10157, 35th Conference on Neural Information Processing Systems, NeurIPS 2021, Virtual, Online, 6/12/2021.

Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions. / Loureiro, Bruno; Sicuro, Gabriele; Gerbelot, Cédric et al.
Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. ed. / Marc'Aurelio Ranzato; Alina Beygelzimer; Yann Dauphin; Percy S. Liang; Jenn Wortman Vaughan. Neural information processing systems foundation, 2021. p. 10144-10157 (Advances in Neural Information Processing Systems; Vol. 13).

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

TY - CHAP

T1 - Learning Gaussian Mixtures with Generalised Linear Models

T2 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021

AU - Loureiro, Bruno

AU - Sicuro, Gabriele

AU - Gerbelot, Cédric

AU - Pacco, Alessandro

AU - Krzakala, Florent

AU - Zdeborová, Lenka

N1 - Funding Information: We thank Raphaël Berthier and Francesca Mignacco for discussions. We acknowledge funding from the ERC under the European Union’s Horizon 2020 Research and Innovation Program Grant Agreement 714608-SMiLe, and from the French National Research Agency grants ANR-17-CE23-0023-01 PAIL. GS is grateful to EPFL for its generous hospitality during the finalization of the project. Publisher Copyright: © 2021 Neural information processing systems foundation. All rights reserved.

PY - 2021

Y1 - 2021

N2 - Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of K Gaussians with generic means and covariances via empirical risk minimisation (ERM) with any convex loss and regularisation. In particular, we prove exact asymptotics characterising the ERM estimator in high-dimensions, extending several previous results about Gaussian mixture classification in the literature. We exemplify our result in two tasks of interest in statistical learning: a) classification for a mixture with sparse means, where we study the efficiency of ℓ1 penalty with respect to ℓ2; b) max-margin multiclass classification, where we characterise the phase transition on the existence of the multi-class logistic maximum likelihood estimator for K > 2. Finally, we discuss how our theory can be applied beyond the scope of synthetic data, showing that in different cases Gaussian mixtures capture closely the learning curve of classification tasks in real data sets.

AB - Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of K Gaussians with generic means and covariances via empirical risk minimisation (ERM) with any convex loss and regularisation. In particular, we prove exact asymptotics characterising the ERM estimator in high-dimensions, extending several previous results about Gaussian mixture classification in the literature. We exemplify our result in two tasks of interest in statistical learning: a) classification for a mixture with sparse means, where we study the efficiency of ℓ1 penalty with respect to ℓ2; b) max-margin multiclass classification, where we characterise the phase transition on the existence of the multi-class logistic maximum likelihood estimator for K > 2. Finally, we discuss how our theory can be applied beyond the scope of synthetic data, showing that in different cases Gaussian mixtures capture closely the learning curve of classification tasks in real data sets.

UR - http://www.scopus.com/inward/record.url?scp=85131558172&partnerID=8YFLogxK

M3 - Conference paper

AN - SCOPUS:85131558172

T3 - Advances in Neural Information Processing Systems

SP - 10144

EP - 10157

BT - Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021

A2 - Ranzato, Marc'Aurelio

A2 - Beygelzimer, Alina

A2 - Dauphin, Yann

A2 - Liang, Percy S.

A2 - Wortman Vaughan, Jenn

PB - Neural information processing systems foundation

Y2 - 6 December 2021 through 14 December 2021

ER -

Loureiro B, Sicuro G, Gerbelot C, Pacco A, Krzakala F, Zdeborová L. Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions. In Ranzato MA, Beygelzimer A, Dauphin Y, Liang PS, Wortman Vaughan J, editors, Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Neural information processing systems foundation. 2021. p. 10144-10157. (Advances in Neural Information Processing Systems).

Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

Abstract

Publication series

Conference

Other files and links

Fingerprint

Cite this