Estimating cell type proportions in human cord blood samples from DNAm arrays

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Estimating cell type proportions in human cord blood samples from DNAm arrays Dinh, Louie

Abstract

Epigenome-wide association studies are used to link patterns in the epigenome to human phenotypes and disease. These studies continue to increase in num- ber, driven by improving technologies and decreasing costs. However, results from population-scale association studies are often difficult to interpret. One major chal- lenge to interpretation is separating biologically relevant epigenetic changes from changes to the underlying cell type composition. This thesis focuses on computa- tional methods for correcting cell type composition in epigenome-wide association studies measuring DNAm in blood. Specifically, we focus on a class of methods, called reference-based methods, that rely on measurements of DNAm from puri- fied constituent cell types. Currently, reference-based correction methods perform poorly on human cord blood. This is unusual because adult blood, a closely related tissue, is a case-study in successful computational correction. Several previous attempts at improving cord blood estimation were only partially successful. We demonstrate how reference-based estimation methods that rely on for cord blood can be improved. First, we validated that existing methods perform poorly on cord blood, especially in minor cell types. Then, we demonstrated how this low per- formance stems from missing cell type references, data normalization and violated assumptions in signature construction. Resolving these issues improved estimates in a validation set with experimentally generated ground truth. Finally, we com- pared our reference-based estimates against reference-free techniques, an alterna- tive class of computational correction methods. Going forward, this thesis provides a template for extending reference-based estimation to other heterogeneous tissues.

Item Metadata

Title	Estimating cell type proportions in human cord blood samples from DNAm arrays
Creator	Dinh, Louie
Publisher	University of British Columbia
Date Issued	2017
Description	Epigenome-wide association studies are used to link patterns in the epigenome to human phenotypes and disease. These studies continue to increase in num- ber, driven by improving technologies and decreasing costs. However, results from population-scale association studies are often difficult to interpret. One major chal- lenge to interpretation is separating biologically relevant epigenetic changes from changes to the underlying cell type composition. This thesis focuses on computa- tional methods for correcting cell type composition in epigenome-wide association studies measuring DNAm in blood. Specifically, we focus on a class of methods, called reference-based methods, that rely on measurements of DNAm from puri- fied constituent cell types. Currently, reference-based correction methods perform poorly on human cord blood. This is unusual because adult blood, a closely related tissue, is a case-study in successful computational correction. Several previous attempts at improving cord blood estimation were only partially successful. We demonstrate how reference-based estimation methods that rely on for cord blood can be improved. First, we validated that existing methods perform poorly on cord blood, especially in minor cell types. Then, we demonstrated how this low per- formance stems from missing cell type references, data normalization and violated assumptions in signature construction. Resolving these issues improved estimates in a validation set with experimentally generated ground truth. Finally, we com- pared our reference-based estimates against reference-free techniques, an alterna- tive class of computational correction methods. Going forward, this thesis provides a template for extending reference-based estimation to other heterogeneous tissues.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2017-10-10
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0356611
URI	http://hdl.handle.net/2429/63224
Degree (Theses)	Master of Science - MSc
Program (Theses)	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2017-11
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Estimating cell type proportions in human cord blood samples from DNAm arrays Dinh, Louie

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights