4.8 Article

Expanding the repertoire of DNA shape features for genome-scale studies of transcription factor binding

期刊

NUCLEIC ACIDS RESEARCH
卷 45, 期 22, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkx1145

关键词

-

资金

  1. Andrew J. Viterbi Fellowship
  2. USC Graduate School, Research Enhancement Fellowship and Manning Endowed Fellowship
  3. National Institutes of Health [R01GM106056, U01GM103804, R01HG003008]
  4. Alfred P. Sloan Foundation
  5. European Union [676556]

向作者/读者索取更多资源

Uncovering the mechanisms that affect the binding specificity of transcription factors (TFs) is critical for understanding the principles of gene regulation. Although sequence-based models have been used successfully to predict TF binding specificities, we found that including DNA shape information in these models improved their accuracy and interpretability. Previously, we developed a method for modeling DNA binding specificities based on DNA shape features extracted from Monte Carlo (MC) simulations. Prediction accuracies of our models, however, have not yet been compared to accuracies of models incorporating DNA shape information extracted from X-ray crystallography (XRC) data or Molecular Dynamics (MD) simulations. Here, we integrated DNA shape information extracted from MC or MD simulations and XRC data into predictive models of TF binding and compared their performance. Models that incorporated structural information consistently showed improved performance over sequence-based models regardless of data source. Furthermore, we derived and validated nine additional DNA shape features beyond our original set of four features. The expanded repertoire of 13 distinct DNA shape features, including six intra-base pair and six inter-base pair parameters and minor groove width, is available in our R/Bioconductor package DNAshapeR and enables a comprehensive structural description of the double helix on a genome-wide scale.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据