4.6 Article

MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks

Journal

PEERJ COMPUTER SCIENCE
Volume -, Issue -, Pages -

Publisher

PEERJ INC
DOI: 10.7717/peerj-cs.363

Keywords

Gene regulatory networks; GRN inference; large-scale GRN; Systems biology; Network biology

Funding

  1. Teacher Fellowship of University Grants Commission, Ministry of Human Resources Development, Govt. of India [27-(TF-45)/2015]

Ask authors/readers for more resources

The study introduces a gene regulatory network inference method based on multiple kernel learning, which can run on multi-processor machines and learn large-scale GRNs from multiple heterogeneous datasets. This approach demonstrates superior classification accuracy and enhanced speedup potential compared to other state-of-the-art methods.
High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the integration framework. However, the limitation with such an arrangement is the huge computational cost involved when learning from very large datasets in a sequential execution environment. To overcome this issue, we present a multiple kernel learning (MKL) based gene regulatory network (GRN) inference approach wherein multiple heterogeneous datasets are fused using MKL paradigm. We formulate the GRN learning problem as a supervised classification problem, whereby genes regulated by a specific transcription factor are separated from other non-regulated genes. A parallel execution architecture is devised to learn a large scale GRN by decomposing the initial classification problem into a number of subproblems that run as multiple processes on a multi-processor machine. We evaluate the approach in terms of increased speedup and inference potential using genomic data from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The results thus obtained demonstrate that the proposed method exhibits better classification accuracy and enhanced speedup compared to other state-of-the-art methods while learning large scale GRNs from multiple and heterogeneous datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available