4.7 Article

iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework

Journal

BIOINFORMATICS
Volume 32, Issue 16, Pages 2411-2418

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btw186

Keywords

-

Funding

  1. National Natural Science Foundation of China [61300112, 61573118, 61272383]
  2. Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry
  3. Natural Science Foundation of Guangdong Province [2014A030313695]
  4. Shenzhen Foundational Research Funding [JCYJ20150626110425228]
  5. Development Program of China (863 Program) [2015AA015405]

Ask authors/readers for more resources

Motivation: Regulatory DNA elements are associated with DNase I hypersensitive sites (DHSs). Accordingly, identification of DHSs will provide useful insights for in-depth investigation into the function of noncoding genomic regions. Results: In this study, using the strategy of ensemble learning framework, we proposed a new predictor called iDHS-EL for identifying the location of DHS in human genome. It was formed by fusing three individual Random Forest (RF) classifiers into an ensemble predictor. The three RF operators were respectively based on the three special modes of the general pseudo nucleotide composition (PseKNC): (i) kmer, (ii) reverse complement kmer and (iii) pseudo dinucleotide composition. It has been demonstrated that the new predictor remarkably outperforms the relevant state-of-the-art methods in both accuracy and stability.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available