4.6 Article

Updated HIV-1 Consensus Sequences Change but Stay Within Similar Distance From Worldwide Samples

期刊

FRONTIERS IN MICROBIOLOGY
卷 12, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fmicb.2021.828765

关键词

HIV; subtypes; consensus sequences; evolution; molecular epidemiology; pandemic

资金

  1. NIH/NIAID [R01AI087520]

向作者/读者索取更多资源

This study reconstructed 90 new HIV-1 subtype and CRF consensus sequences and compared them with worldwide HIV-1 genome sequences. The 2021 consensus sequences were found to be shorter than the 2002 sequences, but closer to the global genome sequences. The results suggest that the 2021 consensus sequences are likely good representations of the typical subtype/CRF genome nucleotide states.
HIV consensus sequences are used in various bioinformatic, evolutionary, and vaccine related research. Since the previous HIV-1 subtype and CRF consensus sequences were constructed in 2002, the number of publicly available HIV-1 sequences have grown exponentially, especially from non-EU and US countries. Here, we reconstruct 90 new HIV-1 subtype and CRF consensus sequences from 3,470 high-quality, representative, full genome sequences in the LANL HIV database. While subtypes and CRFs are unevenly spread across the world, in total 89 countries were represented. For consensus sequences that were based on at least 20 genomes, we found that on average 2.3% (range 0.8-10%) of the consensus genome site states changed from 2002 to 2021, of which about half were nucleotide state differences and the rest insertions and deletions. Interestingly, the 2021 consensus sequences were shorter than in 2002, and compared to 4,674 HIV-1 worldwide genome sequences, the 2021 consensuses were somewhat closer to the worldwide genome sequences, i.e., showing on average fewer nucleotide state differences. Some subtypes/CRFs have had limited geographical spread, and thus sampling of subtypes/CRFs is uneven, at least in part, due to the epidemiological dynamics. Thus, taken as a whole, the 2021 consensus sequences likely are good representations of the typical subtype/CRF genome nucleotide states. The new consensus sequences are available at the LANL HIV database.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据