- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Research Data /
- Data from: Untangling the early diversification of...
Open Collections
UBC Research Data
Data from: Untangling the early diversification of eukaryotes: a phylogenomic study of the evolutionary origins of Centrohelida, Haptophyta, and Cryptista Burki, Fabien; Kaplan, Maia; Tikhonekov, Denis V.; Zlatogursky, Vasily; Minh, Bui Quang; Radaykina, Liudmila V.; Smirnov, Alexey; Mylnikov, Alexander P.; Keeling, Patrick J.; Tikhonenkov, Denis V.
Description
<b>Abstract</b><br/>Assembling the global eukaryotic tree of life has long been a major effort of Biology. In recent years, pushed by the new availability of genome-scale data for microbial eukaryotes, it has become possible to revisit many evolutionary enigmas. However, some of the most ancient nodes, which are essential for inferring a stable tree, have remained highly controversial. Among other reasons, the lack of adequate genomic datasets for key taxa has prevented the robust reconstruction of early diversification events. In this context, the centrohelid heliozoans are particularly relevant for reconstructing the tree of eukaryotes because they represent one of the last substantial groups that was missing large and diverse genomic data. Here, we filled this gap by sequencing high-quality transcriptomes for four centrohelid lineages, each corresponding to a different family. Combining these new data with a broad eukaryotic sampling, we produced a gene-rich taxon-rich phylogenomic dataset that enabled us to refine the structure of the tree. Specifically, we show that (i) centrohelids relate to haptophytes, confirming Haptista; (ii) Haptista relates to SAR; (iii) Cryptista share strong affinity with Archaeplastida; and (iv) Haptista + SAR is sister to Cryptista + Archaeplastida. The implications of this topology are discussed in the broader context of plastid evolution.; <b>Usage notes</b><br /><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Amastigomonas sp</h4><div class="o-metadata__file-description">RNA-seq assembly of Amastigomonas sp. from Genbank SRA accession #SRR2170627. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters.</div><div class="o-metadata__file-name">Amastigomonas_sp_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Raineriophrys erinaceoides</h4><div class="o-metadata__file-description">RNA-seq assembly of Raineriophrys erinaceoides from Genbank SRA accession #SRR2170634. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters.</div><div class="o-metadata__file-name">Raineriophrys_erinaceoides_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Choanocystis sp</h4><div class="o-metadata__file-description">RNA-seq assembly of Choanocystis sp. from Genbank SRA accession #SRR2170626. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters</div><div class="o-metadata__file-name">Choanocystis_sp_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Acanthocystis sp</h4><div class="o-metadata__file-description">RNA-seq assembly of Acanthocystis sp. from Genbank SRA accession #SRR2170625. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters</div><div class="o-metadata__file-name">Acanthocystis_sp_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Raphidiophrys heterophryoidea</h4><div class="o-metadata__file-description">RNA-seq assembly of Raphidiophrys heterophryoidea from Genbank SRA accession #SRR2170621. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters</div><div class="o-metadata__file-name">Raphidiophrys_heterophryoidea_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Trimmed alignment</h4><div class="o-metadata__file-description">Trimmed alignment of all 250 genes. BMGE was used for trimming, following MAFFT-LINSI for automatic alignment.</div><div class="o-metadata__file-name">fasta_trimmed.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Untrimmed sequences</h4><div class="o-metadata__file-description">Fasta files of all 250 genes containing untrimmed sequences.</div><div class="o-metadata__file-name">fasta_untrimmed.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Single-gene phylogenetic trees</h4><div class="o-metadata__file-description">RAxML phylogenetic trees of all 250 genes.</div><div class="o-metadata__file-name">trees.zip</br></div></div>
Item Metadata
Title |
Data from: Untangling the early diversification of eukaryotes: a phylogenomic study of the evolutionary origins of Centrohelida, Haptophyta, and Cryptista
|
Creator | |
Date Issued |
2021-05-19
|
Description |
<b>Abstract</b><br/>Assembling the global eukaryotic tree of life has long been a major effort of Biology. In recent years, pushed by the new availability of genome-scale data for microbial eukaryotes, it has become possible to revisit many evolutionary enigmas. However, some of the most ancient nodes, which are essential for inferring a stable tree, have remained highly controversial. Among other reasons, the lack of adequate genomic datasets for key taxa has prevented the robust reconstruction of early diversification events. In this context, the centrohelid heliozoans are particularly relevant for reconstructing the tree of eukaryotes because they represent one of the last substantial groups that was missing large and diverse genomic data. Here, we filled this gap by sequencing high-quality transcriptomes for four centrohelid lineages, each corresponding to a different family. Combining these new data with a broad eukaryotic sampling, we produced a gene-rich taxon-rich phylogenomic dataset that enabled us to refine the structure of the tree. Specifically, we show that (i) centrohelids relate to haptophytes, confirming Haptista; (ii) Haptista relates to SAR; (iii) Cryptista share strong affinity with Archaeplastida; and (iv) Haptista + SAR is sister to Cryptista + Archaeplastida. The implications of this topology are discussed in the broader context of plastid evolution.; <b>Usage notes</b><br /><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Amastigomonas sp</h4><div class="o-metadata__file-description">RNA-seq assembly of Amastigomonas sp. from Genbank SRA accession #SRR2170627. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters.</div><div class="o-metadata__file-name">Amastigomonas_sp_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Raineriophrys erinaceoides</h4><div class="o-metadata__file-description">RNA-seq assembly of Raineriophrys erinaceoides from Genbank SRA accession #SRR2170634. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters.</div><div class="o-metadata__file-name">Raineriophrys_erinaceoides_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Choanocystis sp</h4><div class="o-metadata__file-description">RNA-seq assembly of Choanocystis sp. from Genbank SRA accession #SRR2170626. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters</div><div class="o-metadata__file-name">Choanocystis_sp_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Acanthocystis sp</h4><div class="o-metadata__file-description">RNA-seq assembly of Acanthocystis sp. from Genbank SRA accession #SRR2170625. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters</div><div class="o-metadata__file-name">Acanthocystis_sp_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Transcriptome assembly of Raphidiophrys heterophryoidea</h4><div class="o-metadata__file-description">RNA-seq assembly of Raphidiophrys heterophryoidea from Genbank SRA accession #SRR2170621. Read quality was assessed with FastQC before and after quality trimming and SMART adaptors removal, which was performed with FastqMcf. Cleaned reads were assembled into contigs with Trinity r20140717 using default parameters</div><div class="o-metadata__file-name">Raphidiophrys_heterophryoidea_transcriptome.fasta.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Trimmed alignment</h4><div class="o-metadata__file-description">Trimmed alignment of all 250 genes. BMGE was used for trimming, following MAFFT-LINSI for automatic alignment.</div><div class="o-metadata__file-name">fasta_trimmed.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Untrimmed sequences</h4><div class="o-metadata__file-description">Fasta files of all 250 genes containing untrimmed sequences.</div><div class="o-metadata__file-name">fasta_untrimmed.zip</br></div></div><div class="o-metadata__file-usage-entry"><h4 class="o-heading__level3-file-title">Single-gene phylogenetic trees</h4><div class="o-metadata__file-description">RAxML phylogenetic trees of all 250 genes.</div><div class="o-metadata__file-name">trees.zip</br></div></div>
|
Subject | |
Type | |
Notes |
Dryad version number: 1</p> Version status: submitted</p> Dryad curation status: Published</p> Sharing link: https://datadryad.org/stash/share/6K39Xsr7oVuH9nZwC3hfL9o8lGzjc074sYfSb4_L7Fw</p> Storage size: 52602197</p> Visibility: public</p> |
Date Available |
2020-06-24
|
Provider |
University of British Columbia Library
|
License |
CC0 1.0
|
DOI |
10.14288/1.0397977
|
URI | |
Publisher DOI | |
Rights URI | |
Aggregated Source Repository |
Dataverse
|
Item Media
Item Citations and Data
Licence
CC0 1.0