4.7 Article

Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt

Journal

NATURE PROTOCOLS
Volume 4, Issue 8, Pages 1184-1191

Publisher

NATURE PUBLISHING GROUP
DOI: 10.1038/nprot.2009.97

Keywords

-

Funding

  1. NCI NIH HHS [U24 CA126551, U24 CA126551-01] Funding Source: Medline
  2. NATIONAL CANCER INSTITUTE [U24CA126551] Funding Source: NIH RePORTER

Ask authors/readers for more resources

Genomic experiments produce multiple views of biological systems, among them are DNA sequence and copy number variation, and mRNA and protein abundance. Understanding these systems needs integrated bioinformatic analysis. Public databases such as Ensembl provide relationships and mappings between the relevant sets of probe and target molecules. However, the relationships can be biologically complex and the content of the databases is dynamic. We demonstrate how to use the computational environment R to integrate and jointly analyze experimental datasets, employing BioMart web services to provide the molecule mappings. We also discuss typical problems that are encountered in making gene-to-transcript-to-protein mappings. The approach provides a flexible, programmable and reproducible basis for state-of-the-art bioinformatic data integration.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available