- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- BIRS Workshop Lecture Videos /
- Explanation of variability through optimal transport
Open Collections
BIRS Workshop Lecture Videos
BIRS Workshop Lecture Videos
Explanation of variability through optimal transport Tabak, Esteban
Description
An optimal-transport based methodology is proposed for the explanation of variability in data. The central idea is to estimate and simulate conditional distributions $\rho(x|z)$ by mapping them optimally to their Wasserstein barycenter $\mu(y)$, thus removing all $z$-dependence. The barycenter problem needs to be formulated and solved in a data-driven format, as the distributions are only known through samples.
A particular implementation is developed, ``attributable components’’, in which the maps are restricted to $z$-dependent, non-parametric rigid translations. It is shown that this proposal encompasses standard methodologies, such as least-square regression, k-means clustering and principal components, and extends them broadly to explain accurately and robustly variability driven by complex sets of explicit and latent covariates in a computationally effective way.
Applications are shown to climate science, medicine, risk estimation and variations of the Netflix problem.
Item Metadata
| Title |
Explanation of variability through optimal transport
|
| Creator | |
| Publisher |
Banff International Research Station for Mathematical Innovation and Discovery
|
| Date Issued |
2017-05-01T12:00
|
| Description |
An optimal-transport based methodology is proposed for the explanation of variability in data. The central idea is to estimate and simulate conditional distributions $\rho(x|z)$ by mapping them optimally to their Wasserstein barycenter $\mu(y)$, thus removing all $z$-dependence. The barycenter problem needs to be formulated and solved in a data-driven format, as the distributions are only known through samples.
A particular implementation is developed, ``attributable components’’, in which the maps are restricted to $z$-dependent, non-parametric rigid translations. It is shown that this proposal encompasses standard methodologies, such as least-square regression, k-means clustering and principal components, and extends them broadly to explain accurately and robustly variability driven by complex sets of explicit and latent covariates in a computationally effective way.
Applications are shown to climate science, medicine, risk estimation and variations of the Netflix problem.
|
| Extent |
53 minutes
|
| Subject | |
| Type | |
| File Format |
video/mp4
|
| Language |
eng
|
| Notes |
Author affiliation: Courant Institute
|
| Series | |
| Date Available |
2017-10-28
|
| Provider |
Vancouver : University of British Columbia Library
|
| Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
| DOI |
10.14288/1.0357376
|
| URI | |
| Affiliation | |
| Peer Review Status |
Unreviewed
|
| Scholarly Level |
Faculty
|
| Rights URI | |
| Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International