- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Research Data /
- High-density genetic linkage mapping in Sitka spruce...
Open Collections
UBC Research Data
High-density genetic linkage mapping in Sitka spruce advances the integration of genomic resources in conifers Tumas, Hayley; Ilska, Joanna J.; Gérardi, Sebastien; Laroche, Jerome; A'Hara, Stuart; Boyle, Brian; Janes, Mateja; McLean, Paul; Lopez, Gustavo; Lee, Steve J.; Cottrell, Joan; Gorjanc, Gregor; Bousquet, Jean; Wolliams, John A.; Mackay, John J.
Description
<b>Abstract</b><br/><p class="paragraph" style="line-height:150%;vertical-align:baseline;"><span lang="EN-US" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">In species with large and complex genomes such as conifers, dense linkage maps are a useful for supporting genome assembly and laying the genomic groundwork at the structural, populational and functional levels. However, most of the 600+ extant conifer species still lack extensive genotyping resources, which hampers the development of high-density linkage maps. In this study, </span><span class="normaltextrun"><span lang="EN-GB" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">we developed a linkage map relying on 21,570 SNP makers in </span></span><span lang="EN-US" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">Sitka spruce (<em>Picea sitchensis</em> [Bong.] Carr.)</span><span class="normaltextrun"><em><span lang="EN-GB" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">, </span></em></span><span class="normaltextrun"><span lang="EN-US" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">a long-lived conifer from western North America that is widely planted for productive forestry in the British Isles. </span></span><span lang="EN-US" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">We used a single-step mapping approach to efficiently combine RAD-Seq and genotyping array SNP data for 528 individuals from two full-sib families. As expected for spruce taxa, the saturated map contained 12 linkages groups with a total length of 2,142 cM. The positioning of 5,414 unique gene coding sequences allowed us to compare our map with that of other Pinaceae species, which provided evidence for high levels of synteny and gene order conservation in this family. We then developed an integrated map for <em>P. sitchensis</em> and <em>P. glauca</em> based on 27,052 makers and 11,609 gene sequences. Altogether, these two linkage maps, the accompanying catalog of 286,159 SNPs and the genotyping chip developed herein opens new perspectives for a variety of fundamental and more applied research objectives, such as for the improvement of spruce genome assemblies, or for marker-assisted sustainable management of genetic resources in Sitka spruce and related species.</span></p>; <b>Methods</b><br /><p class="MsoNormal">The data included in this dataset is genotypic data for two full-sib families of Sitka spruce (<em>Picea sitchensis</em>) in the United Kingdom and resulting linkage maps for the species. Samples for DNA extraction and genotyping were collected from two full-sib genetic field trials as described in the accompanying publication. A SNP Chip array was developed for this work using exome capture. A subset of the samples had been genotyped using RAD Seq from a previous project (Fuentes-Utrilla et al 2017). The dataset includes information on the SNP array developed for the project and genotype data that has been filtered for missingness and minor allele frequency. Final results are in the form of linkage maps stored in csv files. Further information on collection methods and processing are detailed in the accompanying manuscript and scripts for data processing are available on GitHub (<a href="https://github.com/HayleyTumas/SitkaLinkageMap">https://github.com/HayleyTumas/SitkaLinkageMap</a>).</p>; <b>Usage notes</b><br /><p class="MsoNormal">All files should be able to be opened using open access, freely available software. All tabular data are CSV or text files for the larger genotype data files. Code files are stored as bash script and can be opened using any text editor or in .R files that can be opened using the freely available R software.</p>
Item Metadata
Title |
High-density genetic linkage mapping in Sitka spruce advances the integration of genomic resources in conifers
|
Creator | |
Date Issued |
2024-01-25
|
Description |
<b>Abstract</b><br/><p class="paragraph" style="line-height:150%;vertical-align:baseline;"><span lang="EN-US" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">In species with large and complex genomes such as conifers, dense linkage maps are a useful for supporting genome assembly and laying the genomic groundwork at the structural, populational and functional levels. However, most of the 600+ extant conifer species still lack extensive genotyping resources, which hampers the development of high-density linkage maps. In this study, </span><span class="normaltextrun"><span lang="EN-GB" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">we developed a linkage map relying on 21,570 SNP makers in </span></span><span lang="EN-US" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">Sitka spruce (<em>Picea sitchensis</em> [Bong.] Carr.)</span><span class="normaltextrun"><em><span lang="EN-GB" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">, </span></em></span><span class="normaltextrun"><span lang="EN-US" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">a long-lived conifer from western North America that is widely planted for productive forestry in the British Isles. </span></span><span lang="EN-US" style="font-size:11.0pt;line-height:150%;font-family:'Calibri' , sans-serif;">We used a single-step mapping approach to efficiently combine RAD-Seq and genotyping array SNP data for 528 individuals from two full-sib families. As expected for spruce taxa, the saturated map contained 12 linkages groups with a total length of 2,142 cM. The positioning of 5,414 unique gene coding sequences allowed us to compare our map with that of other Pinaceae species, which provided evidence for high levels of synteny and gene order conservation in this family. We then developed an integrated map for <em>P. sitchensis</em> and <em>P. glauca</em> based on 27,052 makers and 11,609 gene sequences. Altogether, these two linkage maps, the accompanying catalog of 286,159 SNPs and the genotyping chip developed herein opens new perspectives for a variety of fundamental and more applied research objectives, such as for the improvement of spruce genome assemblies, or for marker-assisted sustainable management of genetic resources in Sitka spruce and related species.</span></p>; <b>Methods</b><br /><p class="MsoNormal">The data included in this dataset is genotypic data for two full-sib families of Sitka spruce (<em>Picea sitchensis</em>) in the United Kingdom and resulting linkage maps for the species. Samples for DNA extraction and genotyping were collected from two full-sib genetic field trials as described in the accompanying publication. A SNP Chip array was developed for this work using exome capture. A subset of the samples had been genotyped using RAD Seq from a previous project (Fuentes-Utrilla et al 2017). The dataset includes information on the SNP array developed for the project and genotype data that has been filtered for missingness and minor allele frequency. Final results are in the form of linkage maps stored in csv files. Further information on collection methods and processing are detailed in the accompanying manuscript and scripts for data processing are available on GitHub (<a href="https://github.com/HayleyTumas/SitkaLinkageMap">https://github.com/HayleyTumas/SitkaLinkageMap</a>).</p>; <b>Usage notes</b><br /><p class="MsoNormal">All files should be able to be opened using open access, freely available software. All tabular data are CSV or text files for the larger genotype data files. Code files are stored as bash script and can be opened using any text editor or in .R files that can be opened using the freely available R software.</p>
|
Subject | |
Type | |
Notes |
Dryad version number: 7</p> Version status: submitted</p> Dryad curation status: Published</p> Sharing link: http://datadryad.org/stash/dataset/doi:10.5061/dryad.ghx3ffbv6</p> Storage size: 149206418</p> Visibility: public</p> |
Date Available |
2024-01-20
|
Provider |
University of British Columbia Library
|
License |
CC0 1.0
|
DOI |
10.14288/1.0438904
|
URI | |
Publisher DOI | |
Grant Funding Agency |
Biotechnology and Biological Sciences Research Council; Genome Canada
|
Aggregated Source Repository |
Dataverse
|
Item Media
Item Citations and Data
Licence
CC0 1.0