- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Graph-augmented deep learning using literature-informed...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
Graph-augmented deep learning using literature-informed biological priors for predicting perturbations in single-cell RNA sequencing Tu, Lin Shuan (Wilson)
Abstract
Single-cell perturbation experiments promise to revolutionize drug discovery by revealing how gene-level interventions change cellular states. However, modelling these complex, out-of-distribution responses remains challenging. In this thesis, we introduce Transcriptomic Perturbation Predictor (TxPert), a graph-based deep learning model that leverages literature-informed biological priors to improve generalization. TxPert embeds curated gene–gene interaction networks and pathway annotations into its architecture, combining a basal state encoder (mapping control gene expression to a latent state) with a perturbation encoder (mapping one or more gene knockouts into a change vector). The combined latent representation is decoded into predicted gene expression. By training TxPert on diverse single-cell RNA-seq perturbation datasets, we ensure it handles three key out-of-distribution tasks: unseen single-gene perturbations, transfer to novel cell types, and multi-gene perturbations. Empirically, TxPert achieves state-of-the-art accuracy on one-gene perturbation prediction and remains competitive on two-gene cases, substantially outperforming prior models. Crucially, we demonstrate that incorporating biological priors yields meaningful gains in robustness and accuracy across datasets through various ablation studies. We contribute further by examining commonly used metrics and datasets, providing a unifying framework upon which to improve. These findings suggest that graph-augmented models like TxPert can serve as building blocks for “virtual cells,” enabling rapid in silico hypothesis testing. By accurately predicting cell responses to genetic changes, such models could accelerate experiment design and therapeutic discovery.
Item Metadata
Title |
Graph-augmented deep learning using literature-informed biological priors for predicting perturbations in single-cell RNA sequencing
|
Creator | |
Supervisor | |
Publisher |
University of British Columbia
|
Date Issued |
2025
|
Description |
Single-cell perturbation experiments promise to revolutionize drug discovery by revealing how gene-level interventions change cellular states. However, modelling these complex, out-of-distribution responses remains challenging. In this thesis, we introduce Transcriptomic Perturbation Predictor (TxPert), a graph-based deep learning model that leverages literature-informed biological priors to improve generalization. TxPert embeds curated gene–gene interaction networks and pathway annotations into its architecture, combining a basal state encoder (mapping control gene expression to a latent state) with a perturbation encoder (mapping one or more gene knockouts into a change vector). The combined latent representation is decoded into predicted gene expression. By training TxPert on diverse single-cell RNA-seq perturbation datasets, we ensure it handles three key out-of-distribution tasks: unseen single-gene perturbations, transfer to novel cell types, and multi-gene perturbations. Empirically, TxPert achieves state-of-the-art accuracy on one-gene perturbation prediction and remains competitive on two-gene cases, substantially outperforming prior models. Crucially, we demonstrate that incorporating biological priors yields meaningful gains in robustness and accuracy across datasets through various ablation studies. We contribute further by examining commonly used metrics and datasets, providing a unifying framework upon which to improve. These findings suggest that graph-augmented models like TxPert can serve as building blocks for “virtual cells,” enabling rapid in silico hypothesis testing. By accurately predicting cell responses to genetic changes, such models could accelerate experiment design and therapeutic discovery.
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2025-07-04
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0449273
|
URI | |
Degree | |
Program | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
2025-11
|
Campus | |
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International