4.7 Article

gofasta: command-line utilities for genomic epidemiology research

Journal

BIOINFORMATICS
Volume 38, Issue 16, Pages 4033-4035

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac424

Keywords

-

Funding

  1. Medical Research Council (MRC) part of UK Research & Innovation (UKRI)
  2. National Institute of Health Research (NIHR) [MC_PC_19027]
  3. Genome Research Limited
  4. Wellcome Sanger Institute

Ask authors/readers for more resources

gofasta is a set of command-line utilities designed for handling short assembled genomes in the context of genomic epidemiology. It was specifically developed for processing closely related SARS-CoV-2 viral genomes and can also be applied to other densely sampled pathogen genomic datasets. It offers functions to convert sam-format pairwise alignments to fasta format, annotate mutations in multiple sequence alignments, and extract sets of sequences based on genetic distance measures for outbreak investigations.
A Summary: gofasta comprises a set of command-line utilities for handling alignments of short assembled genomes in a genomic epidemiology context. It was developed for processing large numbers of closely related SARS-CoV-2 viral genomes and should be useful with other densely sampled pathogen genomic datasets. It provides functions to convert sam-format pairwise alignments between assembled genomes to fasta format; to annotate mutations in multiple sequence alignments, and to extract sets of sequences by genetic distance measures for use in outbreak investigations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available