4.7 Article

A Low-Latency Communication Design for Brain Simulations

期刊

IEEE NETWORK
卷 36, 期 2, 页码 8-15

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/MNET.008.2100447

关键词

-

资金

  1. National Key Research and Development Program of China [2019YFA0709502]
  2. National Natural Science Foundation of China [61873309, 92146002, 92046024]
  3. Shanghai Science and Technology Innovation Action Plan Project [19510710500, 18510732000]

向作者/读者索取更多资源

This article discusses brain simulation and its application in large-scale computing, focusing on addressing low-latency communication issues in multi-GPU architectures. By proposing a partitioning algorithm and routing method, experimental results demonstrate the effectiveness of this approach in improving communication performance. The article also identifies research directions for low-latency communication design in brain simulations.
Brain simulation, as one of the latest advances in artificial intelligence, facilitates better understanding about how information is represented and processed in the brain. The extreme complexity of the human brain makes brain simulations only feasible on high-performance computing platforms. Supercomputers with a large number of interconnected graphical processing units (GPUs) are currently employed for supporting brain simulations. Therefore, high-throughput low-latency inter-GPU communications in super-computers play a crucial role in meeting the performance requirements of brain simulation as a highly time-sensitive application. In this article, we first provide an overview of the current parallelizing technologies for brain simulations using multi-GPU architectures. Then we analyze the challenges to communications for brain simulation and summarize guidelines for communication design to address such challenges. Furthermore, we propose a partitioning algorithm arid a two-level routing method to achieve efficient low-latency communications in multi-GPU architecture for brain simulation. We report experiment results obtained on a supercomputer with 2000 GPUs for simulating a brain model with 10 billion neurons (digital twin brain, DTB) to show that our approach can significantly improve communication performance. We also discuss open issues and identify some research directions for low-latency communication design for brain simulations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据