Factoring over probability simplices : from theory to applications

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Factoring over probability simplices : from theory to applications Graham, Naomi

Abstract

Non-negative matrix factorization (NMF) is a fundamental technique in modern data analysis. Its distinguishing characteristic—requiring non-negative factors—ensures that it models only additive components, producing interpretable parts-based representations. When augmented with sum-to-one constraints, yielding stochastic matrix factorization (SMF), this interpretability becomes mathematically precise through an exact correspondence with latent variable models. This thesis develops a comprehensive framework extending these ideas to continuous multivariate probability densities via non-negative simplex-constrained tensor factorizations. We demonstrate the framework’s versatility through two novel applications: sediment provenance tracing in geology and spatial transcriptomics denoising in biology. Algorithmically, we develop a novel rescaled block coordinate descent method that efficiently handles simplex constraints while maintaining iterate bound- edness—a key challenge in constrained NMF. By exploiting the semialgebraic structure of our objectives through the Kurdyka-Łojasiewicz (KL) property, we establish subsequential and full-sequence convergence guarantees. This analysis framework proves remarkably versatile: we successfully apply it to both our probabilistic tensor factorization models and to an adaptive graph-regularized denoising method for spatial transcriptomics, where the regularization evolves with the solution. Theoretically, we investigate why NMF methods work well despite non-convexity. Drawing inspiration from the benign geometry of low-rank matri- ces, we analyze the structure of SMF feasible sets through novel reparameter- izations and boundary characterizations. Our analysis reveals that spurious stationary points primarily occur at boundaries, suggesting new algorithmic strategies for avoiding local minima. These geometric insights, combined with our convergence guarantees and practical applications, provide a unified view of constrained factorization methods that bridges theory and practice.

Item Metadata

Title	Factoring over probability simplices : from theory to applications
Creator	Graham, Naomi
Supervisor	Friedlander, Michael Paul
Publisher	University of British Columbia
Date Issued	2025
Description	Non-negative matrix factorization (NMF) is a fundamental technique in modern data analysis. Its distinguishing characteristic—requiring non-negative factors—ensures that it models only additive components, producing interpretable parts-based representations. When augmented with sum-to-one constraints, yielding stochastic matrix factorization (SMF), this interpretability becomes mathematically precise through an exact correspondence with latent variable models. This thesis develops a comprehensive framework extending these ideas to continuous multivariate probability densities via non-negative simplex-constrained tensor factorizations. We demonstrate the framework’s versatility through two novel applications: sediment provenance tracing in geology and spatial transcriptomics denoising in biology. Algorithmically, we develop a novel rescaled block coordinate descent method that efficiently handles simplex constraints while maintaining iterate bound- edness—a key challenge in constrained NMF. By exploiting the semialgebraic structure of our objectives through the Kurdyka-Łojasiewicz (KL) property, we establish subsequential and full-sequence convergence guarantees. This analysis framework proves remarkably versatile: we successfully apply it to both our probabilistic tensor factorization models and to an adaptive graph-regularized denoising method for spatial transcriptomics, where the regularization evolves with the solution. Theoretically, we investigate why NMF methods work well despite non-convexity. Drawing inspiration from the benign geometry of low-rank matri- ces, we analyze the structure of SMF feasible sets through novel reparameter- izations and boundary characterizations. Our analysis reveals that spurious stationary points primarily occur at boundaries, suggesting new algorithmic strategies for avoiding local minima. These geometric insights, combined with our convergence guarantees and practical applications, provide a unified view of constrained factorization methods that bridges theory and practice.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2025-08-27
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0449899
URI	http://hdl.handle.net/2429/92099
Degree (Theses)	Doctor of Philosophy - PhD
Program (Theses)	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2025-11
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Factoring over probability simplices : from theory to applications Graham, Naomi

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights