4.6 Article

Stochastic Control for Bayesian Neural Network Training

期刊

ENTROPY
卷 24, 期 8, 页码 -

出版社

MDPI
DOI: 10.3390/e24081097

关键词

Bayesian inference; Bayesian neural networks; learning

资金

  1. Deutsche Forschungsgemeinschaft (DFG) [318763901-SFB1294]
  2. BIFOLD-Berlin Institute for the Foundations of Learning and Data [01IS18025A, 01IS18037A]

向作者/读者索取更多资源

This paper proposes a method to leverage the Bayesian uncertainty information encoded in parameter distributions to inform the learning procedure for Bayesian models. By deriving a Bayesian stochastic differential equation and applying stochastic optimal control, individually controlled learning rates are obtained for variational parameters. The resulting optimizer shows robustness to large learning rates and can adaptively and individually control the learning rates.
In this paper, we propose to leverage the Bayesian uncertainty information encoded in parameter distributions to inform the learning procedure for Bayesian models. We derive a first principle stochastic differential equation for the training dynamics of the mean and uncertainty parameter in the variational distributions. On the basis of the derived Bayesian stochastic differential equation, we apply the methodology of stochastic optimal control on the variational parameters to obtain individually controlled learning rates. We show that the resulting optimizer, StochControlSGD, is significantly more robust to large learning rates and can adaptively and individually control the learning rates of the variational parameters. The evolution of the control suggests separate and distinct dynamical behaviours in the training regimes for the mean and uncertainty parameters in Bayesian neural networks.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据