4.6 Article

Assessment of Inter-Laboratory Differences in SARS-CoV-2 Consensus Genome Assemblies between Public Health Laboratories in Australia

期刊

VIRUSES-BASEL
卷 14, 期 2, 页码 -

出版社

MDPI
DOI: 10.3390/v14020185

关键词

SARS-CoV-2; whole-genome sequencing; Pango lineage; bioinformatics

类别

资金

  1. UNSW COVID-19 Rapid Response Research Initiative
  2. MRFF Investigator Grant [APP1173594]
  3. Cancer Institute NSW Early Career Fellowship [2018/ECF013]
  4. Juvenile Diabetes Research Foundation Postdoctoral Fellowship [3-PDF-2020-940-A-N]

向作者/读者索取更多资源

Whole-genome sequencing of SARS-CoV-2-positive specimens from different laboratories in Sydney, Australia showed predominantly concordant matched genome sequences with a median pairwise identity of 99.997%. Differences were mainly driven by ambiguous site content. Ignoring these differences may lead to significant variations in the number of defined clusters in epidemiological inference based on single nucleotide variant distances.
Whole-genome sequencing of viral isolates is critical for informing transmission patterns and for the ongoing evolution of pathogens, especially during a pandemic. However, when genomes have low variability in the early stages of a pandemic, the impact of technical and/or sequencing errors increases. We quantitatively assessed inter-laboratory differences in consensus genome assemblies of 72 matched SARS-CoV-2-positive specimens sequenced at different laboratories in Sydney, Australia. Raw sequence data were assembled using two different bioinformatics pipelines in parallel, and resulting consensus genomes were compared to detect laboratory-specific differences. Matched genome sequences were predominantly concordant, with a median pairwise identity of 99.997%. Identified differences were predominantly driven by ambiguous site content. Ignoring these produced differences in only 2.3% (5/216) of pairwise comparisons, each differing by a single nucleotide. Matched samples were assigned the same Pango lineage in 98.2% (212/216) of pairwise comparisons, and were mostly assigned to the same phylogenetic clade. However, epidemiological inference based only on single nucleotide variant distances may lead to significant differences in the number of defined clusters if variant allele frequency thresholds for consensus genome generation differ between laboratories. These results underscore the need for a unified, best-practices approach to bioinformatics between laboratories working on a common outbreak problem.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据