4.7 Article

G4detector: Convolutional Neural Network to Predict DNA G-Quadruplexes

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2021.3073595

Keywords

Genomics; Bioinformatics; DNA; Task analysis; RNA; Benchmark testing; Protocols; Bioinformatics; deep learning; G-quadruplexes; convolutional neural networks

Funding

  1. Israeli Council for Higher Education (CHE) via the Data Science Research Center, Ben-Gurion University of the Negev, Israel

Ask authors/readers for more resources

This article introduces G4detector, a method based on convolutional neural network, to predict G4 structures in DNA sequences. The method improves prediction accuracy by incorporating RNA secondary structure information and has been shown to outperform existing methods on benchmark datasets.
G-quadruplexes (G4s) are nucleic acid secondary structures that form within guanine-rich DNA or RNA sequences. G4 formation can affect chromatin architecture and gene regulation, and has been associated with genomic instability, genetic diseases, and cancer progression. The experimental data produced by the G4-seq experiment provides unprecedented details on G4 formation in the genome. Still, running the experimental protocol on a whole genome is an expensive and time-consuming process. Thus, it is highly desirable to have a computational method to predict G4 formation in new DNA sequences or whole genomes. Here, we present G4detector, a new method based on a convolutional neural network to predict G4s from DNA sequences. On top of the sequence information, we improved prediction accuracy by the addition of RNA secondary structure information. To train and test G4detector, we compiled novel high-throughput benchmarks over multiple species genomes measured by the G4-seq protocol. We show that G4detector outperforms extant methods for the same task on all benchmark datasets, can detect G4s genome-wide with high accuracy, and is able to extrapolate human-trained measurements to various non-human species. The code and benchmarks are publicly available on github.com/OrensteinLab/G4detector.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available