4.6 Article

SD-MSAEs: Promoter recognition in human genome based on deep feature extraction

期刊

JOURNAL OF BIOMEDICAL INFORMATICS
卷 61, 期 -, 页码 55-62

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2016.03.018

关键词

Context features; Promoter recognition; Sparse autoencoder; Statistical divergence; Support vector machine

资金

  1. National Natural Science Foundation of China [61373093, 61402310]
  2. Natural Science Foundation of Jiangsu Province of China [BK20140008]
  3. Natural Science Foundation of the Jiangsu Higher Education Institutions of China [13KJA520001]
  4. Qing Lan Project
  5. National College Students Innovation and entrepreneurship training program of China [201410285032]
  6. Undergraduate Science Research Foundation of Soochow University of China [KY2015544B]
  7. 3I Project of Soochow University of China [29]

向作者/读者索取更多资源

The prediction and recognition of promoter in human genome play an important role in DNA sequence analysis. Entropy, in Shannon sense, of information theory is a multiple utility in bioinformatic details analysis. The relative entropy estimator methods based on statistical divergence (SD) are used to extract meaningful features to distinguish different regions of DNA sequences. In this paper, we choose context feature and use a set of methods of SD to select the most effective n-mers distinguishing promoter regions from other DNA regions in human genome. Extracted from the total possible combinations of n-mers, we can get four sparse distributions based on promoter and non-promoters training samples. The informative n-mers are selected by optimizing the differentiating extents of these distributions. Specially, we combine the advantage of statistical divergence and multiple sparse auto-encoders (MSAEs) in deep learning to extract deep feature for promoter recognition. And then we apply multiple SVMs and a decision model to construct a human promoter recognition method called SD-MSAE5. Framework is flexible that it can integrate new feature extraction or new classification models freely. Experimental results show that our method has high sensitivity and specificity. (C) 2016 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据