Journal
HUMAN GENOME VARIATION
Volume 6, Issue -, Pages -Publisher
SPRINGERNATURE
DOI: 10.1038/s41439-019-0057-7
Keywords
-
Categories
Funding
- Reconstruction Agency
- Ministry of Education, Culture, Sports, Science and Technology (MEXT)
- Japan Agency for Medical Research and Development (AMED) [JP18km0405205, JP18km0105001, JP18km0105002]
- Center of Innovation Program from the Japan Science and Technology Agency (JST)
Ask authors/readers for more resources
In recent genome analyses, population-specific reference panels have indicated important. However, reference panels based on short-read sequencing data do not sufficiently cover long insertions. Therefore, the nature of long insertions has not been well documented. Here, we assembled a Japanese genome using single-molecule real-time sequencing data and characterized insertions found in the assembled genome. We identified 3691 insertions ranging from 100 bps to similar to 10,000 bps in the assembled genome relative to the international reference sequence (GRCh38). To validate and characterize these insertions, we mapped short-reads from 1070 Japanese individuals and 728 individuals from eight other populations to insertions integrated into GRCh38. With this result, we constructed JRGv1 (Japanese Reference Genome version 1) by integrating the 903 verified insertions, totaling 1,086,173 bases, shared by at least two Japanese individuals into GRCh38. We also constructed decoyJRGv1 by concatenating 3559 verified insertions, totaling 2,536,870 bases, shared by at least two Japanese individuals or by six other assemblies. This assembly improved the alignment ratio by 0.4% on average. These results demonstrate the importance of refining the reference assembly and creating a population-specific reference genome. JRGv1 and decoyJRGv1 are available at the JRG website.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available