4.8 Article

DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences

Journal

NUCLEIC ACIDS RESEARCH
Volume 44, Issue 11, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkw226

Keywords

-

Funding

  1. National Institute of Biomedical Imaging and Bioengineering
  2. University of California, National Research Service Award [EB009418]
  3. Irvine, Center for Complex Biological Systems
  4. National Science Foundation Graduate Research Fellowship [DGE-1321846]
  5. National Institute of Health [HG006870]
  6. National Science Foundation

Ask authors/readers for more resources

Modeling the properties and functions of DNA sequences is an important, but challenging task in the broad field of genomics. This task is particularly difficult for non-coding DNA, the vast majority of which is still poorly understood in terms of function. A powerful predictive model for the function of non-coding DNA can have enormous benefit for both basic science and translational research because over 98% of the human genome is non-coding and 93% of disease-associated variants lie in these regions. To address this need, we propose DanQ, a novel hybrid convolutional and bi-directional long short-term memory recurrent neural network framework for predicting non-coding function de novo from sequence. In the DanQ model, the convolution layer captures regulatory motifs, while the recurrent layer captures long-term dependencies between the motifs in order to learn a regulatory 'grammar' to improve predictions. DanQ improves considerably upon other models across several metrics. For some regulatory markers, DanQ can achieve over a 50% relative improvement in the area under the precision-recall curve metric compared to related models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available