4.6 Article

DNAscan: personal computer compatible NGS analysis, annotation and visualisation

期刊

BMC BIOINFORMATICS
卷 20, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12859-019-2791-8

关键词

Bioinformatics; Variant calling; Viral detection; Repeat expansion; Structural variants; Annotation; Next generation sequencing

资金

  1. EU Joint Programme -Neurodegenerative Disease Research (JPND) project
  2. Medical Research Council [MR/L501529/1, MR/R024804/1]
  3. Economic and Social Research Council [ES/L008238/1]
  4. Motor Neurone Disease Association - National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London
  5. European Community [633413]
  6. MND Association
  7. Wellcome Trust
  8. ESRC [ES/L008238/1] Funding Source: UKRI
  9. MRC [MC_PC_17214, MR/R024804/1, G0600974] Funding Source: UKRI
  10. Economic and Social Research Council [ES/L008238/1] Funding Source: researchfish
  11. Medical Research Council [MR/R024804/1, MR/L501529/1, G0600974] Funding Source: researchfish
  12. Motor Neurone Disease Association [Al-Chalabi/Apr15/844-791] Funding Source: researchfish

向作者/读者索取更多资源

BackgroundNext Generation Sequencing (NGS) is a commonly used technology for studying the genetic basis of biological processes and it underpins the aspirations of precision medicine. However, there are significant challenges when dealing with NGS data. Firstly, a huge number of bioinformatics tools for a wide range of uses exist, therefore it is challenging to design an analysis pipeline. Secondly, NGS analysis is computationally intensive, requiring expensive infrastructure, and many medical and research centres do not have adequate high performance computing facilities and cloud computing is not always an option due to privacy and ownership issues. Finally, the interpretation of the results is not trivial and most available pipelines lack the utilities to favour this crucial step.ResultsWe have therefore developed a fast and efficient bioinformatics pipeline that allows for the analysis of DNA sequencing data, while requiring little computational effort and memory usage. DNAscan can analyse a whole exome sequencing sample in 1h and a 40x whole genome sequencing sample in 13h, on a midrange computer. The pipeline can look for single nucleotide variants, small indels, structural variants, repeat expansions and viral genetic material (or any other organism). Its results are annotated using a customisable variety of databases and are available for an on-the-fly visualisation with a local deployment of the gene.iobio platform. DNAscan is implemented in Python. Its code and documentation are available on GitHub: https://github.com/KHP-Informatics/DNAscan. Instructions for an easy and fast deployment with Docker and Singularity are also provided on GitHub.ConclusionsDNAscan is an extremely fast and computationally efficient pipeline for analysis, visualization and interpretation of NGS data. It is designed to provide a powerful and easy-to-use tool for applications in biomedical research and diagnostic medicine, at minimal computational cost. Its comprehensive approach will maximise the potential audience of users, bringing such analyses within the reach of non-specialist laboratories, and those from centres with limited funding available.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据