Tiling Optimizations for Stencil Computations Using Rewrite Rules in L ift

Stoltzfus, Larisa; Hagedorn, Bastian; Steuwer, Michel; Gorlatch, Sergei; Dubach, Christophe

Published in

Association for Computing Machinery (ACM), ACM Transactions on Architecture and Code Optimization, 4(16), p. 1-25, 2019

DOI: 10.1145/3368858

Tools

Export citation

Search in Google Scholar

Tiling Optimizations for Stencil Computations Using Rewrite Rules in L ift

Journal article published in 2019 by Larisa Stoltzfus, Bastian Hagedorn, Michel Steuwer

, Sergei Gorlatch, Christophe Dubach

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving forbidden

Postprint: archiving forbidden

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Stencil computations are a widely used type of algorithm, found in applications from physical simulations to machine learning. Stencils are embarrassingly parallel, therefore fit on modern hardware such as Graphic Processing Units perfectly. Although stencil computations have been extensively studied, optimizing them for increasingly diverse hardware remains challenging. Domain-specific Languages (DSLs) have raised the programming abstraction and offer good performance; however, this method places the burden on DSL implementers to write almost full-fledged parallelizing compilers and optimizers. Lift has recently emerged as a promising approach to achieve performance portability by using a small set of reusable parallel primitives that DSL or library writers utilize. L ift ’s key novelty is in its encoding of optimizations as a system of extensible rewrite rules which are used to explore the optimization space. This article demonstrates how complex multi-dimensional stencil code and optimizations are expressed using compositions of simple 1D L ift primitives and rewrite rules. We introduce two optimizations that provide high performance for stencils in particular: classical overlapped tiling for multi-dimensional stencils and 2.5D tiling specifically for 3D stencils. We provide an in-depth analysis on how the tiling optimizations affects stencils of different shapes and sizes across different applications. Our experimental results show that our approach outperforms existing compiler approaches and hand-tuned codes.

Published in

Links

Tools

Tiling Optimizations for Stencil Computations Using Rewrite Rules in L ift

Abstract