期刊
MOLECULAR BIOLOGY AND EVOLUTION
卷 30, 期 5, 页码 1159-1171出版社
OXFORD UNIV PRESS
DOI: 10.1093/molbev/mst019
关键词
molecular evolution; population genetics; noncoding DNA; regulatory sequences; probabilistic graphical models
资金
- David and Lucile Packard Fellowship for Science and Engineering
- National Science Foundation [DBI-0644111]
- National Institutes of Health (NIGMS) [GM102192]
- Cornell Center for Vertebrate Genomics
- NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [R01GM102192] Funding Source: NIH RePORTER
Complete genome sequences contain valuable information about natural selection, but this information is difficult to access for short, widely scattered noncoding elements such as transcription factor binding sites or small noncoding RNAs. Here, we introduce a new computational method, called Inference of Natural Selection from Interspersed Genomically coHerent elemenTs (INSIGHT), for measuring the influence of natural selection on such elements. INSIGHT uses a generative probabilistic model to contrast patterns of polymorphism and divergence in the elements of interest with those in flanking neutral sites, pooling weak information from many short elements in a manner that accounts for variation among loci in mutation rates and coalescent times. The method is able to disentangle the contributions of weak negative, strong negative, and positive selection based on their distinct effects on patterns of polymorphism and divergence. It obtains information about divergence from multiple outgroup genomes using a general statistical phylogenetic approach. The INSIGHT model is efficiently fitted to genome-wide data using an approximate expectation maximization algorithm. Using simulations, we show that the method can accurately estimate the parameters of interest even in complex demographic scenarios, and that it significantly improves on methods based on summary statistics describing polymorphism and divergence. To demonstrate the usefulness of INSIGHT, we apply it to several classes of human noncoding RNAs and to GATA2-binding sites in the human genome.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据