- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Faculty Research and Publications /
- Structural characterization of genomes by large scale...
Open Collections
UBC Faculty Research and Publications
Structural characterization of genomes by large scale sequence-structure threading Cherkasov, Artem; Jones, Steven J. M.
Abstract
Background: Using sequence-structure threading we have conducted structural characterization of complete proteomes of 37 archaeal, bacterial and eukaryotic organisms (including worm, fly, mouse and human) totaling 167,888 genes. Results The reported data represent first rather general evaluation of performance of full sequence-structure threading on multiple genomes providing opportunity to evaluate its general applicability for large scale studies. According to the estimated results the sequence-structure threading has assigned protein folds to more then 60% of eukaryotic, 68% of archaeal and 70% of bacterial proteomes. The repertoires of protein classes, architectures, topologies and homologous superfamilies (according to the CATH 2.4 classification) have been established for distant organisms and superkingdoms. It has been found that the average abundance of CATH classes decreases from "alpha and beta" to "mainly beta", followed by "mainly alpha" and "few secondary structures". 3-Layer (aba) Sandwich has been characterized as the most abundant protein architecture and Rossman fold as the most common topology. Conclusion The analysis of genomic occurrences of CATH 2.4 protein homologous superfamilies and topologies has revealed the power-law character of their distributions. The corresponding double logarithmic "frequency – genomic occurrence" dependences characteristic of scale-free systems have been established for individual organisms and for three superkingdoms. Supplementary materials to this works are available at [1].
Item Metadata
Title |
Structural characterization of genomes by large scale sequence-structure threading
|
Creator | |
Publisher |
BioMed Central
|
Date Issued |
2004-04-03
|
Description |
Background:
Using sequence-structure threading we have conducted structural characterization of complete proteomes of 37 archaeal, bacterial and eukaryotic organisms (including worm, fly, mouse and human) totaling 167,888 genes.
Results
The reported data represent first rather general evaluation of performance of full sequence-structure threading on multiple genomes providing opportunity to evaluate its general applicability for large scale studies.
According to the estimated results the sequence-structure threading has assigned protein folds to more then 60% of eukaryotic, 68% of archaeal and 70% of bacterial proteomes.
The repertoires of protein classes, architectures, topologies and homologous superfamilies (according to the CATH 2.4 classification) have been established for distant organisms and superkingdoms. It has been found that the average abundance of CATH classes decreases from "alpha and beta" to "mainly beta", followed by "mainly alpha" and "few secondary structures".
3-Layer (aba) Sandwich has been characterized as the most abundant protein architecture and Rossman fold as the most common topology.
Conclusion
The analysis of genomic occurrences of CATH 2.4 protein homologous superfamilies and topologies has revealed the power-law character of their distributions. The corresponding double logarithmic "frequency – genomic occurrence" dependences characteristic of scale-free systems have been established for individual organisms and for three superkingdoms.
Supplementary materials to this works are available at [1].
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2015-11-05
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution 4.0 International (CC BY 4.0)
|
DOI |
10.14288/1.0220537
|
URI | |
Affiliation | |
Citation |
BMC Bioinformatics. 2004 Apr 03;5(1):37
|
Publisher DOI |
10.1186/1471-2105-5-37
|
Peer Review Status |
Reviewed
|
Scholarly Level |
Faculty
|
Copyright Holder |
Cherkasov and Jones
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution 4.0 International (CC BY 4.0)