- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- BIRS Workshop Lecture Videos /
- Approximation of flows near target sets and two-layers...
Open Collections
BIRS Workshop Lecture Videos
BIRS Workshop Lecture Videos
Approximation of flows near target sets and two-layers neural networks training dynamics Gerbelot, Cedric
Description
Based on joint work with Jean-Christophe Mourrat.
We study the gradient-based learning dynamics of a wide two-layers neural network on a misspecified single index regression task, in a two-timescale regime where the second layer weights are learned faster than the first layer weights.
Conjectures for the solution to the dynamical system describing the training dynamics were obtained recently by R. Berthier, A. Montanari and K. Zhou; using the relative training speed between the two layers as a perturbative parameter and matched asymptotic expansions arguments. We provide rigorous counterparts to these predictions. Our proofs are based on a quantitative approximation result for dynamical systems evolving near target sets defined by integral constraints involving the empirical measure of the weights. When the latter is a point mass, the auxiliary system can be viewed as obtained from a skewed projection on the tangent space to the manifold defined by the constraint functions.
Item Metadata
| Title |
Approximation of flows near target sets and two-layers neural networks training dynamics
|
| Creator | |
| Publisher |
Banff International Research Station for Mathematical Innovation and Discovery
|
| Date Issued |
2026-02-05
|
| Description |
Based on joint work with Jean-Christophe Mourrat.
We study the gradient-based learning dynamics of a wide two-layers neural network on a misspecified single index regression task, in a two-timescale regime where the second layer weights are learned faster than the first layer weights.
Conjectures for the solution to the dynamical system describing the training dynamics were obtained recently by R. Berthier, A. Montanari and K. Zhou; using the relative training speed between the two layers as a perturbative parameter and matched asymptotic expansions arguments. We provide rigorous counterparts to these predictions. Our proofs are based on a quantitative approximation result for dynamical systems evolving near target sets defined by integral constraints involving the empirical measure of the weights. When the latter is a point mass, the auxiliary system can be viewed as obtained from a skewed projection on the tangent space to the manifold defined by the constraint functions.
|
| Extent |
41.0 minutes
|
| Subject | |
| Type | |
| File Format |
video/mp4
|
| Language |
eng
|
| Notes |
Author affiliation: ENS Lyon
|
| Series | |
| Date Available |
2026-02-09
|
| Provider |
Vancouver : University of British Columbia Library
|
| Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
| DOI |
10.14288/1.0451476
|
| URI | |
| Affiliation | |
| Peer Review Status |
Unreviewed
|
| Scholarly Level |
Researcher
|
| Rights URI | |
| Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International