Molecular interpretation of genome-wide association studies using multiomics analysis

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Molecular interpretation of genome-wide association studies using multiomics analysis Farhadi Hassan Kiadeh, Farnush

Abstract

Genome-wide association studies have found thousands of single-nucleotide polymorphisms associated with various human traits. Recently, a powerful statistical approach called MetaXcan has been proposed for interpreting genome-wide associations at the gene level. We extended MetaXcan to a multiomics application, using a brain cortex reference dataset that includes gene expression, DNA methylation, and histone acetylation data from approximately 400 individuals. Our approach, Multi-MetaXcan, consists of three steps. In the first step, we use regularized regression to build models that predict gene expression and variation in epigenomic modifications from single-nucleotide polymorphisms. We call these models genotype-based imputation models. In the second step, we apply these models to map genome-wide associations to gene-level and epigenomic-level associations. Finally, in the third step, our model summarizes all molecular-level associations at the gene level by building epigenome-based imputation models that predict gene expression levels from nearby epigenomic marks like CpG sites and transcriptionally active regions. In summary, Multi-MetaXcan identifies trait-associated genes whose expression levels are impacted by single-nucleotide polymorphisms and their influence on intermediate molecular traits such as DNA methylation and histone acetylation. We applied Multi-MetaXcan to a major depressive disorder genome-wide association study. As the result, we discovered 12 genes, 25 transcriptionally active regions, and 163 CpG sites associated with major depressive disorder corresponding to 74 genes in total. 26 of these genes fall within or close to previously identified major depressive disorder-associated genomic regions. Importantly, the inclusion of epigenomic data resulted in an additional 62 genes that were not identified by gene expression imputation model alone.

Item Metadata

Title	Molecular interpretation of genome-wide association studies using multiomics analysis
Creator	Farhadi Hassan Kiadeh, Farnush
Publisher	University of British Columbia
Date Issued	2018
Description	Genome-wide association studies have found thousands of single-nucleotide polymorphisms associated with various human traits. Recently, a powerful statistical approach called MetaXcan has been proposed for interpreting genome-wide associations at the gene level. We extended MetaXcan to a multiomics application, using a brain cortex reference dataset that includes gene expression, DNA methylation, and histone acetylation data from approximately 400 individuals. Our approach, Multi-MetaXcan, consists of three steps. In the first step, we use regularized regression to build models that predict gene expression and variation in epigenomic modifications from single-nucleotide polymorphisms. We call these models genotype-based imputation models. In the second step, we apply these models to map genome-wide associations to gene-level and epigenomic-level associations. Finally, in the third step, our model summarizes all molecular-level associations at the gene level by building epigenome-based imputation models that predict gene expression levels from nearby epigenomic marks like CpG sites and transcriptionally active regions. In summary, Multi-MetaXcan identifies trait-associated genes whose expression levels are impacted by single-nucleotide polymorphisms and their influence on intermediate molecular traits such as DNA methylation and histone acetylation. We applied Multi-MetaXcan to a major depressive disorder genome-wide association study. As the result, we discovered 12 genes, 25 transcriptionally active regions, and 163 CpG sites associated with major depressive disorder corresponding to 74 genes in total. 26 of these genes fall within or close to previously identified major depressive disorder-associated genomic regions. Importantly, the inclusion of epigenomic data resulted in an additional 62 genes that were not identified by gene expression imputation model alone.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2018-06-07
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0368592
URI	http://hdl.handle.net/2429/66261
Degree (Theses)	Master of Science - MSc
Program (Theses)	Bioinformatics
Affiliation	Science, Faculty of
Degree Grantor	University of British Columbia
Graduation Date	2018-09
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Molecular interpretation of genome-wide association studies using multiomics analysis Farhadi Hassan Kiadeh, Farnush

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights