Statistical analysis of RNA-seq data at different scales Leek, Jeff


RNA-seq is now the primary technology used to measure transcriptional abundance. The analysis of RNA-seq data can be done at multiple levels (genes, regions, or transcripts) and at multiple scales (small experiments or large population cohorts). I will discuss statistical challenges in developing and applying software for the analysis of RNA-seq data at multiple scales including reproducibility, statistical power, trust in genomic annotations, and detection and removal of artifacts. These issues are critical in the analysis of data from genomic experiments in general, but are particularly acute in the analysis of dynamic data from transcriptomes.

