4.3 Article

Sparse triangular solves for ILU revisited: data layout crucial to better performance

Publisher

SAGE PUBLICATIONS LTD
DOI: 10.1177/1094342010389857

Keywords

sparse triangular solve; ILU factorization; matrix-vector product; data access pattern; data layout

Funding

  1. Office of Advanced Scientific Computing Research, Office of Science, US Department of Energy [DE-AC02-06CH11357]

Ask authors/readers for more resources

A key to good processor utilization for sparse matrix computations is storing the data in the format that is most conducive to fast access by the memory system. In particular, for sparse matrix triangular solves the traditional compressed sparse matrix format is poor, and minor adjustments to the data structure can increase the processor utilization dramatically. Such adjustments involve storing the L and U factors separately and storing the U rows 'backwards' so that they are accessed in a simple streaming fashion during the triangular solves. Changes to the PETSc libraries to use this modified storage format resulted in over twice the floating-point rate for some matrices. This improvement can be accounted for by a decrease in the cache misses and TLB (transaction lookaside buffer) misses in the modified code.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available