GT Celeste
Architectural bias in a transport-based generative model : an asymptotic perspective
12
déc. 2024
-
12
déc. 2024
logo_team
Intervenant : Hugo Cui
Institution : Harvard University
Heure : 11h00 - 12h00

The model bias of transport-based generative models can help (or hinder) them in learning distributions in high dimensions.  We consider the problem of learning a generative model parametrized by a two-layer auto-encoder, and trained with online stochastic gradient descent, to sample from a high-dimensional data distribution with an underlying low-dimensional structure. We provide a tight asymptotic characterization of low-dimensional projections of the resulting generated density, and evidence how mode(l) collapse can arise.  On the other hand, we discuss how in a case where the architectural bias is suited to the target density, these simple models can efficiently learn to sample from a binary Gaussian mixture target distribution. Based on joint works with Yue M Lu, Lenka Zdeborová, Florent Krzakala and Eric Vanden-Eijnden.

Voir tous les événements