期刊
MOLECULAR BIOLOGY AND EVOLUTION
卷 38, 期 12, 页码 5211-5224出版社
OXFORD UNIV PRESS
DOI: 10.1093/molbev/msab246
关键词
SARS-CoV-2; COVID-19; Sequence Read Archive; phylogenetics; forensic bioinformatics
资金
- NIH Office of Research Infrastructure Programs [S10OD028685]
The study reveals that a data set containing early Wuhan epidemic SARS-CoV-2 sequences deleted from the NIH's Sequence Read Archive has been recovered and analyzed, suggesting that the sequences from Huanan Seafood Market do not fully represent the early viruses in Wuhan. It is suggested that the progenitor of known SARS-CoV-2 sequences likely had three mutations making it more similar to bat coronavirus relatives than the market viruses.
The origin and early spread of SARS-CoV-2 remains shrouded in mystery. Here, I identify a data set containing SARS-CoV-2 sequences from early in the Wuhan epidemic that has been deleted from the NIH's Sequence Read Archive. I recover the deleted files from the Google Cloud and reconstruct partial sequences of 13 early epidemic viruses. Phylogenetic analysis of these sequences in the context of carefully annotated existing data further supports the idea that the Huanan Seafood Market sequences are not fully representative of the viruses in Wuhan early in the epidemic. Instead, the progenitor of currently known SARS-CoV-2 sequences likely contained three mutations relative to the market viruses that made it more similar to SARS-CoV-2's bat coronavirus relatives.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据