☆ 4.6 Article

Stochastic Control for Bayesian Neural Network Training

ENTROPY (2022)

期刊

ENTROPY

卷 24, 期 8, 页码 -

出版社

MDPI

DOI: 10.3390/e24081097

关键词

Bayesian inference; Bayesian neural networks; learning

类别

Physics, Multidisciplinary

资金

Deutsche Forschungsgemeinschaft (DFG) [318763901-SFB1294]
BIFOLD-Berlin Institute for the Foundations of Learning and Data [01IS18025A, 01IS18037A]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a method to leverage the Bayesian uncertainty information encoded in parameter distributions to inform the learning procedure for Bayesian models. By deriving a Bayesian stochastic differential equation and applying stochastic optimal control, individually controlled learning rates are obtained for variational parameters. The resulting optimizer shows robustness to large learning rates and can adaptively and individually control the learning rates.

In this paper, we propose to leverage the Bayesian uncertainty information encoded in parameter distributions to inform the learning procedure for Bayesian models. We derive a first principle stochastic differential equation for the training dynamics of the mean and uncertainty parameter in the variational distributions. On the basis of the derived Bayesian stochastic differential equation, we apply the methodology of stochastic optimal control on the variational parameters to obtain individually controlled learning rates. We show that the resulting optimizer, StochControlSGD, is significantly more robust to large learning rates and can adaptively and individually control the learning rates of the variational parameters. The evolution of the control suggests separate and distinct dynamical behaviours in the training regimes for the mean and uncertainty parameters in Bayesian neural networks.

Stochastic Control for Bayesian Neural Network Training

期刊

ENTROPY

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Stochastic Control for Bayesian Neural Network Training

期刊

ENTROPY

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文