UBC Research Data

Data from: Recent non-hybrid origin of sunflower ecotypes in a novel habitat Rieseberg, Loren H.; Andrew, Rose L.; Kane, Nolan C.; Baute, Greg J.; Grassa, Christopher J.

Description

<b>Abstract</b><br/>The genomics of local adaptation is an increasingly active field, providing insights into the forces driving ecological speciation and the repeatability of evolution. Demography and gene flow play an important role in determining the paths by which parallel evolution occurs and the genomic signatures of adaptation. In the annual sunflowers, hybridization between species has repeatedly led to the colonization of extreme habitats, such as sand dunes. In a new case of adaptation to sand dunes that occurs in populations of H. petiolaris growing at Great Sand Dunes National Park and Preserve (Colorado), we wished to determine the age and long-term migration patterns of the system, as well as its ancestry. We addressed these questions with restriction-associated DNA (RAD) sequence data, aligned to a reference transcriptome. In an isolation with migration model using RAD sequences, coalescent analysis showed that the dune ecotype originated since the last ice age, which is very recent compared with the hybrid dune species, H. anomalus. Large effective population sizes and substantial numbers of gene migrants per generation between dune and nondune ecotypes explained the highly heterogeneous divergence observed among loci. Analysis of RAD-derived SNPs identified heterogeneous divergence between the dune and nondune ecotypes, as well as identifying its nearest relative. Our results did not support the hypothesis that the dune ecotype has hybrid ancestry, suggesting that adaptation of sunflowers to dunes has occurred by multiple mechanisms. The ancestry and long-term history of gene flow between incipient sunflower species provides valuable context for our understanding of ecological speciation and parallel adaptation.; <b>Usage notes</b><br /><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">GSD_RAD_SNPs</h4><div class="o-metadata__file-description">GSD_RAD_SNPs contains SNP data, formatted as codominant nucleotide genotypes (e.g. "AT", "CC" etc.) and with missing data marked as "-"). The first two columns give the reference contig where each SNP aligned and its position. The final column gives Nei's Fst between the dune and non-dune ecotypes, estimated by the 'basic.stats' function in the HIERFSTAT package of the R statistical software. The data are sorted on this column. The data columns are as follows: 50 dune individuals, 15 intermediate, 35 non-dune, 10 H. petiolaris and 10 H. annuus.</div><div class="o-metadata__file-name"></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">gsd_rad_structure_data</h4><div class="o-metadata__file-description">Zip file containing files used for STRUCTURE analyses presented in Figures 4, S1, S2, S3 and S4.</div><div class="o-metadata__file-name"></div><div class="o-metadata__file-name"></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">gsd75_sample100_3</h4><div class="o-metadata__file-description">Data file for IMa2 analysis.</div><div class="o-metadata__file-name"></div><div class="o-metadata__file-name"></div></div>

Item Media