4.7 Article

ECOGEMS: efficient compression and retrieve of SNP data of 2058 rice accessions with integer sparse matrices

Journal

BIOINFORMATICS
Volume 35, Issue 20, Pages 4181-4183

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btz186

Keywords

-

Funding

  1. Key Grant Science and Technique Foundation of Henan Province [161100110500-0102]
  2. Research Start-Up Fund to Topnotch Talents of Henan Agricultural University [30500581]

Ask authors/readers for more resources

We proposed to store large-scale genotype data as integer sparse matrices, which consumed much fewer computing resources for storage and analysis than traditional approaches. In addition, the raw genotype data could be readily recovered from integer sparse matrices. Utilizing this approach, we stored the genotype data of 1612 Asian cultivated rice accessions and 446 Asian wild rice accessions across 8 584 244 SNP sites in the ECOGEMS database with 310 MB of disk usage. Graphical interface for visualization, analysis and download of SNP data were implemented in ECOGEMS, which made it a valuable resource for rice functional genomic studies.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available