UBC Research Data

Data from: Genomic prediction accuracies in space and time for height and wood density of Douglas-fir using exome capture as the genotyping platform Thistlethwaite, Frances R.; Ratcliffe, Blaise; Klápště, Jaroslav; Porth, Ilga; Chen, Charles; Stoehr, Michael U.; El-Kassaby, Yousry A.

Description

<b>Abstract</b><br/>Background Genomic selection (GS) can offer unprecedented gains, in terms of cost efficiency and generation turnover, to forest tree selective breeding; especially for late expressing and low heritability traits. Here, we used: 1) exome capture as a genotyping platform for 1372 Douglas-fir trees representing 37 full-sib families growing on three sites in British Columbia, Canada and 2) height growth and wood density (EBVs), and deregressed estimated breeding values (DEBVs) as phenotypes. Representing models with (EBVs) and without (DEBVs) pedigree structure. Ridge regression best linear unbiased predictor (RR-BLUP) and generalized ridge regression (GRR) were used to assess their predictive accuracies over space (within site, cross-sites, multi-site, and multi-site to single site) and time (age-age/ trait-trait). Results The RR-BLUP and GRR models produced similar predictive accuracies across the studied traits. Within-site GS prediction accuracies with models trained on EBVs were high (RR-BLUP: 0.79–0.91 and GRR: 0.80–0.91), and were generally similar to the multi-site (RR-BLUP: 0.83–0.91, GRR: 0.83–0.91) and multi-site to single-site predictive accuracies (RR-BLUP: 0.79–0.92, GRR: 0.79–0.92). Cross-site predictions were surprisingly high, with predictive accuracies within a similar range (RR-BLUP: 0.79–0.92, GRR: 0.78–0.91). Height at 12 years was deemed the earliest acceptable age at which accurate predictions can be made concerning future height (age-age) and wood density (trait-trait). Using DEBVs reduced the accuracies of all cross-validation procedures dramatically, indicating that the models were tracking pedigree (family means), rather than marker-QTL LD. Conclusions While GS models’ prediction accuracies were high, the main driving force was the pedigree tracking rather than LD. It is likely that many more markers are needed to increase the chance of capturing the LD between causal genes and markers.; <b>Usage notes</b><br /><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Douglas-fir exomic SNP file</h4><div class="o-metadata__file-description">Exomic genotype file for Douglas-fir produced by RAPiD Genomics© , containing 74199 biallelic SNPs with less than 40% missing data and minQ=10.</div><div class="o-metadata__file-name">FiltStep1_minQ10_Gen.txt</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Douglas-fir phenotypes</h4><div class="o-metadata__file-description">Phenotypic measurements from Douglas-fir trial over 3 sites in British Columbia, Canada. Courtesy of Forests, Lands and Natural Resource Operations, BC, Canada.</div><div class="o-metadata__file-name">DF_phenotypes_dryad.txt</br></div></div>

Item Media

Item Citations and Data

Licence

This dataset is made available under a Creative Commons CC0 license with the following additional/modified terms and conditions: CC0 Waiver