4.4 Article

Toward Bayesian Data Compression

期刊

ANNALEN DER PHYSIK
卷 533, 期 3, 页码 -

出版社

WILEY-V C H VERLAG GMBH
DOI: 10.1002/andp.202000508

关键词

Bayesian statistics; data compression; Gaussian likelihood; information theory; lossy compression; signal reconstruction

资金

  1. Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy [EXC-2094 - 390783311]
  2. German Federal Ministry of Education and Research (BMBF) [05A17PB1]

向作者/读者索取更多资源

Efficient compression algorithms, such as the Bayesian data compression (BDC) algorithm, are necessary for handling large datasets in modern science. BDC adapts to specific measurement situations during signal reconstruction, minimizing information loss while conserving posterior structure. The algorithm's applicability has been demonstrated through synthetic and radio astronomical data, but further improvements are needed to address computational time challenges.
In order to handle large datasets omnipresent in modern science, efficient compression algorithms are necessary. Here, a Bayesian data compression (BDC) algorithm that adapts to the specific measurement situation is derived in the context of signal reconstruction. BDC compresses a dataset under conservation of its posterior structure with minimal information loss given the prior knowledge on the signal, the quantity of interest. Its basic form is valid for Gaussian priors and likelihoods. For constant noise standard deviation, basic BDC becomes equivalent to a Bayesian analog of principal component analysis. Using metric Gaussian variational inference, BDC generalizes to non-linear settings. In its current form, BDC requires the storage of effective instrument response functions for the compressed data and corresponding noise encoding the posterior covariance structure. Their memory demand counteract the compression gain. In order to improve this, sparsity of the compressed responses can be obtained by separating the data into patches and compressing them separately. The applicability of BDC is demonstrated by applying it to synthetic data and radio astronomical data. Still the algorithm needs further improvement as the computation time of the compression and subsequent inference exceeds the time of the inference with the original data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据