- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Research Data /
- Data from: Phylogenomics from whole genome sequences...
Open Collections
UBC Research Data
Data from: Phylogenomics from whole genome sequences using aTRAM Allen, Julie M.; Boyd, Bret; Nguyen, Nam-Phuong; Vachaspati, Pranjal; Warnow, Tandy; Huang, Daisie I.; Grady, Patrick G. S.; Bell, Kayce C.; Cronk, Quentin C.B.; Mugisha, Lawrence; Pittendrigh, Barry R.; Soledad Leonardi, M.; Reed, David L.; Johnson, Kevin P.
Description
Abstract
Novel sequencing technologies are rapidly expanding the size of data sets that can be applied to phylogenetic studies. Currently the most commonly used phylogenomic approaches involve some form of genome reduction. While these approaches make assembling phylogenomic data sets more economical for organisms with large genomes, they reduce the genomic coverage and thereby the long-term utility of the data. Currently, for organisms with moderate to small genomes (<1000 Mbp) it is feasible to sequence the entire genome at modest coverage (10−30×). Computational challenges for handling these large data sets can be alleviated by assembling targeted reads, rather than assembling the entire genome, to produce a phylogenomic data matrix. Here we demonstrate the use of automated Target Restricted Assembly Method (aTRAM) to assemble 1107 single-copy ortholog genes from whole genome sequencing of sucking lice (Anoplura) and out-groups. We developed a pipeline to extract exon sequences from the aTRAM assemblies by annotating them with respect to the original target protein. We aligned these protein sequences with the inferred amino acids and then performed phylogenetic analyses on both the concatenated matrix of genes and on each gene separately in a coalescent analysis. Finally, we tested the limits of successful assembly in aTRAM by assembling 100 genes from close- to distantly related taxa at high to low levels of coverage.
Usage notes
Concatenated alignment and tree
Item Metadata
| Title |
Data from: Phylogenomics from whole genome sequences using aTRAM
|
| Creator | |
| Date Issued |
2021-05-19
|
| Description |
Abstract
Novel sequencing technologies are rapidly expanding the size of data sets that can be applied to phylogenetic studies. Currently the most commonly used phylogenomic approaches involve some form of genome reduction. While these approaches make assembling phylogenomic data sets more economical for organisms with large genomes, they reduce the genomic coverage and thereby the long-term utility of the data. Currently, for organisms with moderate to small genomes (<1000 Mbp) it is feasible to sequence the entire genome at modest coverage (10−30×). Computational challenges for handling these large data sets can be alleviated by assembling targeted reads, rather than assembling the entire genome, to produce a phylogenomic data matrix. Here we demonstrate the use of automated Target Restricted Assembly Method (aTRAM) to assemble 1107 single-copy ortholog genes from whole genome sequencing of sucking lice (Anoplura) and out-groups. We developed a pipeline to extract exon sequences from the aTRAM assemblies by annotating them with respect to the original target protein. We aligned these protein sequences with the inferred amino acids and then performed phylogenetic analyses on both the concatenated matrix of genes and on each gene separately in a coalescent analysis. Finally, we tested the limits of successful assembly in aTRAM by assembling 100 genes from close- to distantly related taxa at high to low levels of coverage.; Usage notes Concatenated alignment and tree |
| Subject |
Other; Bureelia antiqua; Osborniella crotophagae; Pedicinus badii; gene assembly; Haematopinus eurysternus; Degeeriella rufa; Pthirus gorillae; Neohaematopinus pacificus; Pedicinus badii; Genome sequencing; Linognathus spicatus; Pthirus pubis; Pediculus humanus; Pediculus schaeffi; Proechinopthirus fluctus; aTRAM; Hoplopleura arboricola; Haematopinus eurysternus; Stimulopalpus japonicus; present day; Antarctopthirus microchir; Bothriometopus macrocnemus; Genome sequencing; Holocene
|
| Geographic Location | |
| Type | |
| Notes |
Dryad version number: 1 Version status: submitted Dryad curation status: Published Sharing link: https://datadryad.org/stash/share/PR_oEYNZ-85sEFxTV9xmFYeC22Nk8_UTs1QalxMiACs</p> Storage size: 8736762 Visibility: public |
| Date Available |
2020-06-24
|
| Provider |
University of British Columbia Library
|
| License |
CC0 1.0
|
| DOI |
10.14288/1.0397756
|
| URI | |
| Publisher DOI | |
| Rights URI | |
| Aggregated Source Repository |
Dataverse
|
Item Media
Item Citations and Data
License
CC0 1.0