0 Datasets
0 Files
Get instant academic access to this publication’s datasets.
Yes. After verification, you can browse and download datasets at no cost. Some premium assets may require author approval.
Files are stored on encrypted storage. Access is restricted to verified users and all downloads are logged.
Yes, message the author after sign-up to request supplementary files or replication code.
Join 50,000+ researchers worldwide. Get instant access to peer-reviewed datasets, advanced analytics, and global collaboration tools.
✓ Immediate verification • ✓ Free institutional access • ✓ Global collaborationJoin our academic network to download verified datasets and collaborate with researchers worldwide.
Get Free AccessAbstract Viruses with large double-stranded DNA genomes appear to have captured the majority of their genes from the hosts at different stages of evolution. The origin of many virus genes is readily detected through highly significant sequence similarity with cellular homologs. This is the case, in particular, for virus enzymes, such as DNA and RNA polymerases or nucleotide kinases, that retain their catalytic activity after capture by an ancestral virus. However, a large fraction of virus genes have no readily detectable cellular homologs so that their origin remains enigmatic. We sought to explore potential origins of proteins of unknown provenance encoded in the genomes of orthopoxviruses, a thoroughly studied virus genus which includes major human pathogens. To this end, we used AlphaFold2, to predict the structures of all 214 proteins encoded by orthopoxviruses. Among the proteins of unknown provenance, structure prediction yielded a clear indication of origin for 14, along with validating several inferences previously made by sequence analysis. The major trend that emerges from these findings is the exaptation of enzymes from cellular organisms for non-enzymatic, structural roles in virus reproduction which is accompanied by disruption of catalytic sites and overall drastic divergence which precludes detection of homology at the sequence level. Among the 16 orthopoxvirus proteins found to be inactivated enzyme derivatives, are the poxvirus replication processivity factor A20, an inactivated derivative of bacterial NAD-dependent DNA ligase; major core protein A3, an inactivated deubiquitinase; F11, an inactivated prolyl hydroxylase; and more similar cases. However, for nearly one third of the orthopoxvirus virion proteins, no significantly similar structures were identified, suggesting exaptation with subsequent major structural rearrangement, yielding novel protein folds.
Pascal Mutz, Wolfgang Resch, Guilhem Faure, Tatiana G. Senkevich, Eugene V Koonin, Bernard Moss (2022). Exaptation of inactivated host enzymes for structural roles in orthopoxviruses and novel protein folds revealed by protein structure modeling. , DOI: https://doi.org/10.1101/2022.11.22.517515.
Datasets shared by verified academics with rich metadata and previews.
Authors choose access levels; downloads are logged for transparency.
Students and faculty get instant access after verification.
Type
Preprint
Year
2022
Authors
6
Datasets
0
Total Files
0
Language
en
DOI
https://doi.org/10.1101/2022.11.22.517515
Access datasets from 50,000+ researchers worldwide with institutional verification.
Get Free Access