UBC Theses and Dissertations
Automated analysis of single cell leukemia data O'Neill, Kieran
Acute myeloid leukemia (AML) is a high grade malignancy of non-lymphoid cells of the hematopoietic system. AML is a heterogeneous disease, and numerous attempts have been made to risk-stratify AML so that appropriate treatment can be offered. Single cell analysis methods could provide insights into the biology of AML leading to risk-stratified and functionally tailored treatments and hence improved outcomes. Recent advances in flow cytometry allow the simultaneous measurement of up to 17 antibody markers per cell for up to millions of cells, and it is performed routinely during AML clinical workup. However, despite vast amounts of flow cytometry data being gathered, comprehensive, objective and automated studies of this data have not been undertaken. Another method, strand-seq, elucidates template strand inheritance in single cells, with a range of potential applications, none of which had been automated when this thesis work commenced. I have developed bioinformatic methods enabling research into AML using both these types of data. I present flowBin, a method for faithfully recombining multitube flow cytometry data. I present flowType-DP, a new version of flowType, able to process flow cytometry and other single cell data having more than 12 markers (including flowBin output). I demonstrate the application of flowBin to AML data, for digitally isolating abnormal cells, and classifying AML patients. I also use flowBin in conjunction with flowType to find cell types associated with clinically relevant gene mutations in AML. I present BAIT, a software package for accurately detecting sister chromatid exchanges in strand-seq data. I present functionality to place unbridged contigs in late-build genomes into their correct location, and have, with collaborators, published the corrected locations of more than half the unplaced contigs in the current build of the mouse genome. I present contiBAIT, a software package for assembling early-build genomes which consist entirely of unanchored, unbridged contigs. ContiBAIT has the potential to dramatically improve the quality of many model organism genomes at low cost. These developments enable rapid, automated, objective and reproducible deep profiling of AML flow cytometry data, subclonal cell analysis of AML cytogenetics, and improvements to model organisms used in AML research.
Item Citations and Data
Attribution 2.5 Canada