UBC Research Data

Data from: Genomics of Compositae crops: reference transcriptome assemblies, and evidence of hybridization with wild relatives Hodgins, Kathryn A.; Lai, Zhao; Oliveira, Luiz O.; Still, David W.; Scascitelli, Moira; Barker, Michael S.; Kane, Nolan C.; Dempewolf, Hannes; Kozik, Alex; Kesseli, Richard V.; et al.

Description

<b>Abstract</b><br/>Although the Compositae harbours only two major food crops, sunflower and lettuce, many other species in this family are utilized by humans and have experienced various levels of domestication. Here we have used next generation sequencing technology to develop 15 reference transcriptome assemblies for Compositae crops or their wild relatives. These data allow us to gain insight into the evolutionary and genomic consequences of plant domestication. Specifically, we performed Illumina sequencing of Cichorium endivia, Cichorium intybus, Echinacea angustifolia, Iva annua, Helianthus tuberosus, Dahlia hybrida, Leontodon taraxacoides and Glebionis segetum, as well 454 sequencing of Guizotia scabra, Stevia rebaudiana, Parthenium argentatum and Smallanthus sonchifolius. Illumina reads were assembled using Trinity, and 454 reads were assembled using MIRA and CAP3. We evaluated the coverage of the transcriptomes using BLASTX analysis of a set of ultra-conserved orthologs (UCOs) and recovered most of these genes (88-98%). We found a correlation between contig length and read length for the 454 assemblies, and greater contig lengths for the 454 compared to the Illumina assemblies. This suggests that longer reads can aid in the assembly of more complete transcripts. Finally, we compared the divergence of orthologs at synonymous sites (Ks) between Compositae crops and their wild relatives and found greater divergence when the progenitors were self-incompatible. We also found greater divergence between pairs of taxa that had some evidence of post-zygotic isolation. For several more distantly related congeners, such as chicory and endive, we identified a signature of introgression in the distribution of Ks values.; <b>Usage notes</b><br /><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">The 15 reference transcriptome assemblies of Compositae crops and their wild relatives</h4><div class="o-metadata__file-name">dryad_submission.tar</br></div><div class="o-metadata__file-name"></div></div>

Item Media

Item Citations and Data

Licence

This dataset is made available under a Creative Commons CC0 license with the following additional/modified terms and conditions: CC0 Waiver