4.6 Article

Learning Asynchronous Boolean Networks From Single-Cell Data Using Multiobjective Cooperative Genetic Programming

Journal

IEEE TRANSACTIONS ON CYBERNETICS
Volume 52, Issue 5, Pages 2916-2930

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2020.3022430

Keywords

Boolean functions; Computational modeling; Data models; Gene expression; Genetic programming; Regulators; Sun; Asynchronous Boolean networks (BNs); cooperative coevolution (CC); gene regulatory networks (GRNs); genetic programming (GP); single-cell data

Funding

  1. National Science and Technology Major Special Project on New Drug Innovation of China [2012ZX09503-001-003]
  2. National Key Research and Development Program of China [2018AAA0100300]
  3. Fundamental Research Funds for the Central Universities of China [DUT20YG121]

Ask authors/readers for more resources

This study introduces a novel framework called SgpNet which uses an asynchronous update scheme to infer asynchronous Boolean networks (BNs) from single-cell data while maintaining the sparsity of gene regulatory networks (GRNs). By encoding Boolean functions as trees in genetic programming and evolving them simultaneously through cooperative coevolution, SgpNet efficiently searches the vast solution space. Additionally, a regulator preselection strategy is developed to enhance learning efficiency, with promising scalability to large networks through straightforward parallelization.
Recent advances in high-throughput single-cell technologies provide new opportunities for computational modeling of gene regulatory networks (GRNs) with an unprecedented amount of gene expression data. Current studies on the Boolean network (BN) modeling of GRNs mostly depend on bulk time-series data and focus on the synchronous update scheme due to its computational simplicity and tractability. However, such synchrony is a strong and rarely biologically realistic assumption. In this study, we adopt the asynchronous update scheme instead and propose a novel framework called SgpNet to infer asynchronous BNs from single-cell data by formulating it into a multiobjective optimization problem. SgpNet aims to find BNs that can match the asynchronous state transition graph (STG) extracted from single-cell data and retain the sparsity of GRNs. To search the huge solution space efficiently, we encode each Boolean function as a tree in genetic programming and evolve all functions of a network simultaneously via cooperative coevolution. Besides, we develop a regulator preselection strategy in view of GRN sparsity to further enhance learning efficiency. An error threshold estimation heuristic is also proposed to ease tedious parameter tuning. SgpNet is compared with the state-of-the-art method on both synthetic data and experimental single-cell data. Results show that SgpNet achieves comparable inference accuracy, while it has far fewer parameters and eliminates artificial restrictions on the Boolean function structures. Furthermore, SgpNet can potentially scale to large networks via straightforward parallelization on multiple cores.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available