☆ 4.7 Article

Decoupling Algorithms from Schedules for Easy Optimization of Image Processing Pipelines

ACM TRANSACTIONS ON GRAPHICS (2012)

Journal

ACM TRANSACTIONS ON GRAPHICS

Volume 31, Issue 4, Pages -

Publisher

ASSOC COMPUTING MACHINERY

DOI: 10.1145/2185520.2185528

Keywords

Image Processing; Compilers; Performance

Funding

Quanta T-Party
NSF [0964004, 0964218, 0832997]
DOE [DE-SC0005288]
U.S. Department of Energy (DOE) [DE-SC0005288] Funding Source: U.S. Department of Energy (DOE)
Direct For Computer & Info Scie & Enginr
Div Of Information & Intelligent Systems [964004] Funding Source: National Science Foundation
Division of Computing and Communication Foundations
Direct For Computer & Info Scie & Enginr [0832997] Funding Source: National Science Foundation
Div Of Information & Intelligent Systems
Direct For Computer & Info Scie & Enginr [0964218] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Using existing programming tools, writing high-performance image processing code requires sacrificing readability, portability, and modularity. We argue that this is a consequence of conflating what computations define the algorithm, with decisions about storage and the order of computation. We refer to these latter two concerns as the schedule, including choices of tiling, fusion, recomputation vs. storage, vectorization, and parallelism. We propose a representation for feed-forward imaging pipelines that separates the algorithm from its schedule, enabling high-performance without sacrificing code clarity. This decoupling simplifies the algorithm specification: images and intermediate buffers become functions over an infinite integer domain, with no explicit storage or boundary conditions. Imaging pipelines are compositions of functions. Programmers separately specify scheduling strategies for the various functions composing the algorithm, which allows them to efficiently explore different optimizations without changing the algorithmic code. We demonstrate the power of this representation by expressing a range of recent image processing applications in an embedded domain specific language called Halide, and compiling them for ARM, x86, and GPUs. Our compiler targets SIMD units, multiple cores, and complex memory hierarchies. We demonstrate that it can handle algorithms such as a camera raw pipeline, the bilateral grid, fast local Laplacian filtering, and image segmentation. The algorithms expressed in our language are both shorter and faster than state-of-the-art implementations.

Decoupling Algorithms from Schedules for Easy Optimization of Image Processing Pipelines

Journal

ACM TRANSACTIONS ON GRAPHICS

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Decoupling Algorithms from Schedules for Easy Optimization of Image Processing Pipelines

Journal

ACM TRANSACTIONS ON GRAPHICS

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper