4.5 Article

Numerically Stable Coded Matrix Computations via Circulant and Rotation Matrix Embeddings

期刊

IEEE TRANSACTIONS ON INFORMATION THEORY
卷 68, 期 4, 页码 2684-2703

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIT.2021.3137266

关键词

Coded computation; Vandermonde matrix; condition number; numerical stability

资金

  1. National Science Foundation (NSF) [CCF-1718470, CCF-1910840, DOI: 10.1109/ISIT45174.2021.9517750]

向作者/读者索取更多资源

Polynomial based methods can mitigate the effect of stragglers in distributed matrix computations. However, they suffer from serious numerical issues. This research proposes a novel approach using circulant permutation matrices and rotation matrices for coded matrix computation, and demonstrates an upper bound on the condition number of the recovery matrices.
Polynomial based methods have recently been used in several works for mitigating the effect of stragglers (slow or failed nodes) in distributed matrix computations. For a system with n worker nodes where s can be stragglers, these approaches allow for an optimal recovery threshold, whereby the intended result can be decoded as long as any (n - s) worker nodes complete their tasks. However, they suffer from serious numerical issues owing to the condition number of the corresponding real Vandermonde-structured recovery matrices; this condition number grows exponentially in n. We present a novel approach that leverages the properties of circulant permutation matrices and rotation matrices for coded matrix computation. In addition to having an optimal recovery threshold, we demonstrate an upper bound on the worst-case condition number of our recovery matrices which grows as approximate to O(n(s+5.5)); in the practical scenario where s is a constant, this grows polynomially in n. Our schemes leverage the well-behaved conditioning of complex Vandermonde matrices with parameters on the complex unit circle, while still working with computation over the reals. Exhaustive experimental results demonstrate that our proposed method has condition numbers that are orders of magnitude lower than prior work.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据