4.7 Review

Predicting 3D chromatin interactions from DNA sequence using Deep Learning

Journal

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL
Volume 20, Issue -, Pages 3439-3448

Publisher

ELSEVIER
DOI: 10.1016/j.csbj.2022.06.0472001-0370

Keywords

3D Chromatin Interaction; Deep Learning; Epigenetics; Genome folding; Chromosome conformation capture (3C)

Funding

  1. Technical University of Munich Institute - German Excellence Initiative
  2. European Seventh Framework Programme [291763]
  3. SFB Sonderforschungsbereich924 of the Deutsche Forschungsgemeinschaft (DFG)
  4. DFG

Ask authors/readers for more resources

This article explores the application of Deep Learning models in gene regulation, analyzing their model accuracy, training strategies, and data preparation steps. It finds that transfer learning combined with functionally curated interactions is the most promising approach for learning cell-type specific sequence features in the future.
Gene regulation in eukaryotes is profoundly shaped by the 3D organization of chromatin within the cell nucleus. Distal regulatory interactions between enhancers and their target genes are widespread and many causal loci underlying heritable agricultural or clinical traits have been mapped to distal cis -regulatory elements. Dissecting the sequence features that mediate such distal interactions is key to understanding their underlying biology. Deep Learning (DL) models coupled with genome-wide 3C -based sequencing data have emerged as powerful tools to infer the DNA sequence grammar underlying such distal interactions. In this review we show that most DL models have remarkably high prediction accuracy, which indicates that DNA sequence features are important determinants of chromatin looping. However, DL model training has so far been limited to a small set of human cell lines, raising questions about the generalization of these predictions to other tissue-types and species. Furthermore, we find that the model architecture seems less relevant for model performance than the training strategy and the data preparation step. Transfer learning, coupled with functionally curated interactions, appear to be the most promising approach to learn cell-type specific and possibly species-specific sequence features in future applications.(c) 2022 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY-NC-ND license (http://creative-commons.org/licenses/by-nc-nd/4.0/).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available