4.5 Article

BioDB extractor: customized data extraction system for commonly used bioinformatics databases

Journal

BIODATA MINING
Volume 8, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s13040-015-0067-z

Keywords

Bioinformatics; Biological databases; Customized data retrieval; Data integration; Database cross-linking

Funding

  1. DeitY, MCIT, Govt. of India under the Center of Excellence (CoE) grant
  2. Department of Biotechnology, Govt. of India, under the Center of Excellence grant
  3. DeitY, MCIT, Govt. of India

Ask authors/readers for more resources

Background: Diverse types of biological data, primary as well as derived, are available in various formats and are stored in heterogeneous resources. Database-specific as well as integrated search engines are available for carrying out efficient searches of databases. These search engines however, do not support extraction of subsets of data with the same level of granularity that exists in typical database entries. In order to extract fine grained subsets of data, users are required to download complete or partial database entries and write scripts for parsing and extraction. Results: BioDBExtractor (BDE) has been developed to provide 26 customized data extraction utilities for some of the commonly used databases such as ENA (EMBL-Bank), UniprotKB, PDB, and KEGG. BDE eliminates the need for downloading entries and writing scripts. BDE has a simple web interface that enables input of query in the form of accession numbers/ID codes, choice of utilities and selection of fields/subfields of data by the users. Conclusions: BDE thus provides a common data extraction platform for multiple databases and is useful to both, novice and expert users. BDE, however, is not a substitute to basic keyword-based database searches. Desired subsets of data, compiled using BDE can be subsequently used for downstream processing, analyses and knowledge discovery.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available