4.7 Article

Generalized Query-Based Active Learning to Identify Differentially Methylated Regions in DNA

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2013.38

关键词

Active learning; generalized query; DNA methylation; bioinformatics

资金

  1. NIA NIH HHS [R25 AG046114] Funding Source: Medline
  2. NIEHS NIH HHS [R01 ES012974] Funding Source: Medline

向作者/读者索取更多资源

Active learning is a supervised learning technique that reduces the number of examples required for building a successful classifier, because it can choose the data it learns from. This technique holds promise for many biological domains in which classified examples are expensive and time-consuming to obtain. Most traditional active learning methods ask very specific queries to the Oracle, (e.g., a human expert) to label an unlabeled example. The example may consist of numerous features, many of which are irrelevant. Removing such features will create a shorter query with only relevant features, and it will be easier for the Oracle to answer. We propose a generalized query-based active learning (GQAL) approach that constructs generalized queries based on multiple instances. By constructing appropriately generalized queries, we can achieve higher accuracy compared to traditional active learning methods. We apply our active learning method to find differentially DNA methylated regions (DMRs). DMRs are DNA locations in the genome that are known to be involved in tissue differentiation; epigenetic regulation, and disease. We also apply our method on 1,3 other data sets and show that our method is better than another popular active learning technique.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据