Comparative study of statistical methods for finding biomarkers in longitudinal data

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Comparative study of statistical methods for finding biomarkers in longitudinal data Hollander, Zsuzsanna

Abstract

Solid organ transplantation is a common procedure for end-stage organ failure. After the transplantation, the rejection of the new organ is possible due to the patient's immune system trying to eliminate the foreign object. To prevent rejection, monthly painful and costly procedure is needed which involves taking a biopsy of the allograft. The purpose of our project is to find biomarkers based on blood samples, so the diagnosis/prognosis of the rejection can occur based on a simple blood or urine sample. Up to eight blood samples are taken from rejection and non-rejection patients and for each sample a microarray is created. The microarray data is longitudinal and contains 54,613 genes. The analysis consists of normalization, pre-filtering, filtering, testing the candidate biomarkers for diagnosis/prediction, pathway analysis, and biomarker validation. For our type of data, the bottleneck and the most understudied step is the filtering. We focused our research on finding possible filtering methods. We tested these methods against the questions our biologists wanted to get the answer for. We generated a data set, based on real data, to find the strengths and weaknesses of the filtering methods we proposed to use. We also tested which one of the filtering methods would provide the most precise answer to each group of questions by creating synthetic data sets with a number of biomarkers planted in them. Our conclusion is that a statistical method, or group of methods, would not be able to provide the perfect answer to all of our biological questions. That is why we created a table where we matched our questions to methods that, based on our experiments, give the best results. Also, we provided some advice on which methods perform better under specific conditions.

Item Metadata

Title	Comparative study of statistical methods for finding biomarkers in longitudinal data
Creator	Hollander, Zsuzsanna
Publisher	University of British Columbia
Date Issued	2006
Description	Solid organ transplantation is a common procedure for end-stage organ failure. After the transplantation, the rejection of the new organ is possible due to the patient's immune system trying to eliminate the foreign object. To prevent rejection, monthly painful and costly procedure is needed which involves taking a biopsy of the allograft. The purpose of our project is to find biomarkers based on blood samples, so the diagnosis/prognosis of the rejection can occur based on a simple blood or urine sample. Up to eight blood samples are taken from rejection and non-rejection patients and for each sample a microarray is created. The microarray data is longitudinal and contains 54,613 genes. The analysis consists of normalization, pre-filtering, filtering, testing the candidate biomarkers for diagnosis/prediction, pathway analysis, and biomarker validation. For our type of data, the bottleneck and the most understudied step is the filtering. We focused our research on finding possible filtering methods. We tested these methods against the questions our biologists wanted to get the answer for. We generated a data set, based on real data, to find the strengths and weaknesses of the filtering methods we proposed to use. We also tested which one of the filtering methods would provide the most precise answer to each group of questions by creating synthetic data sets with a number of biomarkers planted in them. Our conclusion is that a statistical method, or group of methods, would not be able to provide the perfect answer to all of our biological questions. That is why we created a table where we matched our questions to methods that, based on our experiments, give the best results. Also, we provided some advice on which methods perform better under specific conditions.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2010-01-05
Provider	Vancouver : University of British Columbia Library
Rights	For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
DOI	10.14288/1.0051174
URI	http://hdl.handle.net/2429/17587
Degree (Theses)	Master of Science - MSc
Program (Theses)	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2006-05
Campus	UBCV
Scholarly Level	Graduate
Aggregated Source Repository	DSpace

Item Media

ubc_2006-0053.pdf -- 4.37MB

Item Citations and Data

Rights

For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.

Open Collections

UBC Theses and Dissertations

Comparative study of statistical methods for finding biomarkers in longitudinal data Hollander, Zsuzsanna

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights