4.7 Article

RAREsim: A simulation method for very rare genetic variants

期刊

AMERICAN JOURNAL OF HUMAN GENETICS
卷 109, 期 4, 页码 680-691

出版社

CELL PRESS
DOI: 10.1016/j.ajhg.2022.02.009

关键词

-

资金

  1. National Human Genome Research Institute [R35HG011293, U01HG009080, U01HG009080-05S1]
  2. [42614]

向作者/读者索取更多资源

Identification of rare-variant associations is crucial for understanding the genetic architecture of complex traits and diseases. Existing simulation methods have limitations in using real-variant annotation and accurately estimating the number of rare variants. This study presents a flexible and accurate rare-variant simulation algorithm, RAREsim, that can simulate the expected variant distribution and provide real-variant annotations.
Identification of rare-variant associations is crucial to full characterization of the genetic architecture of complex traits and diseases. Essential in this process is the evaluation of novel methods in simulated data that mirror the distribution of rare variants and haplotype structure in real data. Additionally, importing real-variant annotation enables in silico comparison of methods, such as rare-variant association tests and polygenic scoring methods, that focus on putative causal variants. Existing simulation methods are either unable to employ real-variant annotation or severely under-or overestimate the number of singletons and doubletons, thereby reducing the ability to generalize simulation results to real studies. We present RAREsim, a flexible and accurate rare-variant simulation algorithm. Using parameters and haplotypes derived from real sequencing data, RAREsim efficiently simulates the expected variant distribution and enables real-variant annotations. We highlight RAREsim's utility across various genetic regions, sample sizes, ancestries, and variant classes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据