4.5 Article

High-throughput Ant Colony Optimization on graphics processing units

期刊

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jpdc.2017.12.002

关键词

Agnostic vectorization; ACO; TSP; GPUs; Atomic operations

资金

  1. EU FP7 NoE HiPEAC [IST-217068]
  2. European Network of Excellence on High Performance and Embedded Architecture and Compilation
  3. Fundacion Seneca (Agenda Regional de Ciencia y Tecnologia, Region de Murcia) [18946/JLI/13]
  4. Spanish MEC [TIN2016-78799-P]
  5. European Commission FEDER [TIN2016-78799-P]

向作者/读者索取更多资源

Nowadays, computer researchers can face ever-complex scientific problems by using a hardware and software co-design. One successful approach is exploring novel massively-parallel Natural-inspired algorithms, such as the Ant Colony Optimization (ACO) algorithm, through the exploitation of high-throughput accelerators such as GPUs, which are designed to provide high levels of parallelism and low Energy per instruction (EP) cost through heavy vectorization. In this paper, we demonstrate how to take advantage of contemporary hardware-based CUDA vectorization to optimize the ACO algorithm when applied to the Traveling Salesman Problem (TSP). Several parallel designs are proposed and analyzed on two different CUDA architectures. Our results reveal that our vectorization approaches can obtain good performance on these architectures. Moreover, atomic operations are under study showing good benefits on latest generations of CUDA architectures. This work lays the groundwork for future developments of ACO algorithm on high-performance platforms. (C) 2017 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据