4.7 Article

Non-standard bioinformatics characterization of SARS-CoV-2

Journal

COMPUTERS IN BIOLOGY AND MEDICINE
Volume 131, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compbiomed.2021.104247

Keywords

Alignment-free methods; Moments of inertia; Similarity; dissimilarity analysis of DNA; RNA; sequences

Ask authors/readers for more resources

The non-standard bioinformatics method, 4D-Dynamic Representation of DNA/RNA Sequences, successfully analyzes genome sequences, such as the 2019 novel coronavirus, and suggests the origin of SARS-CoV-2 from bats and pangolins through distribution of points in 3D classification maps. This approach also proves applicable to describing the time evolution of virus genome sequences, like Zika virus.
A non-standard bioinformatics method, 4D-Dynamic Representation of DNA/RNA Sequences, aiming at an analysis of the information available in nucleotide databases, has been formulated. The sequences are represented by sets of ?material points? in a 4D space -4D-dynamic graphs. The graphs representing the sequences are treated as ?rigid bodies? and characterized by values analogous to the ones used in the classical dynamics. As the graphical representations of the sequences, the projections of the graphs into 2D and 3D spaces are used. The method has been applied to an analysis of the complete genome sequences of the 2019 novel coronavirus. As a result, 2D and 3D classification maps are obtained. The coordinate axes in the maps correspond to the values derived from the exact formulas characterizing the graphs: the coordinates of the centers of mass and the 4D moments of inertia. The points in the maps represent sequences and their coordinates are used as the classifiers. The main result of this work has been derived from the 3D classification maps. The distribution of clusters of points which emerged in these maps, supports the hypothesis that SARS-CoV-2 may have originated in bat and in pangolin. Pilot calculations for Zika virus sequence data prove that the proposed approach is also applicable to a description of time evolution of genome sequences of viruses.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available