4.6 Article

Learning Asynchronous Boolean Networks From Single-Cell Data Using Multiobjective Cooperative Genetic Programming

期刊

IEEE TRANSACTIONS ON CYBERNETICS
卷 52, 期 5, 页码 2916-2930

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2020.3022430

关键词

Boolean functions; Computational modeling; Data models; Gene expression; Genetic programming; Regulators; Sun; Asynchronous Boolean networks (BNs); cooperative coevolution (CC); gene regulatory networks (GRNs); genetic programming (GP); single-cell data

资金

  1. National Science and Technology Major Special Project on New Drug Innovation of China [2012ZX09503-001-003]
  2. National Key Research and Development Program of China [2018AAA0100300]
  3. Fundamental Research Funds for the Central Universities of China [DUT20YG121]

向作者/读者索取更多资源

This study introduces a novel framework called SgpNet which uses an asynchronous update scheme to infer asynchronous Boolean networks (BNs) from single-cell data while maintaining the sparsity of gene regulatory networks (GRNs). By encoding Boolean functions as trees in genetic programming and evolving them simultaneously through cooperative coevolution, SgpNet efficiently searches the vast solution space. Additionally, a regulator preselection strategy is developed to enhance learning efficiency, with promising scalability to large networks through straightforward parallelization.
Recent advances in high-throughput single-cell technologies provide new opportunities for computational modeling of gene regulatory networks (GRNs) with an unprecedented amount of gene expression data. Current studies on the Boolean network (BN) modeling of GRNs mostly depend on bulk time-series data and focus on the synchronous update scheme due to its computational simplicity and tractability. However, such synchrony is a strong and rarely biologically realistic assumption. In this study, we adopt the asynchronous update scheme instead and propose a novel framework called SgpNet to infer asynchronous BNs from single-cell data by formulating it into a multiobjective optimization problem. SgpNet aims to find BNs that can match the asynchronous state transition graph (STG) extracted from single-cell data and retain the sparsity of GRNs. To search the huge solution space efficiently, we encode each Boolean function as a tree in genetic programming and evolve all functions of a network simultaneously via cooperative coevolution. Besides, we develop a regulator preselection strategy in view of GRN sparsity to further enhance learning efficiency. An error threshold estimation heuristic is also proposed to ease tedious parameter tuning. SgpNet is compared with the state-of-the-art method on both synthetic data and experimental single-cell data. Results show that SgpNet achieves comparable inference accuracy, while it has far fewer parameters and eliminates artificial restrictions on the Boolean function structures. Furthermore, SgpNet can potentially scale to large networks via straightforward parallelization on multiple cores.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据