4.8 Article

Modeling and Planning Under Uncertainty Using Deep Neural Networks

期刊

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
卷 15, 期 8, 页码 4442-4454

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2019.2917520

关键词

Bayesian neural networks; trajectory optimization; uncertainty; variational inference

向作者/读者索取更多资源

Artificial neural networks (ANNs) have been frequently used in industrial applications to model complex systems. However, using traditional ANNs for longterm planning tasks remains a challenge as they lack the capability to model uncertainty. Process noise and approximation errors cause ANN long-term estimations to deviate from the real behavior of the system. Unlike traditional ANNs, stochastic models provide a natural way to model uncertainty, providing estimations over a range of several possible outcomes. This paper introduces a stochastic modeling and planning approach using deep Bayesian neural networks (DBNNs). We use DBNNs to learn a stochastic model of the system dynamics. Planning is addressed as an open-loop trajectory optimization problem. We present two approaches for learning the dynamics: using single-step predictions and using multistep predictions. The advantages of the proposed methodology are as follows. First, accurate long-term estimations of the system state-trajectory probability distribution without the need for expert knowledge of the dynamics. Second, improved generalization and faster convergence rates in the trajectory optimization task when using multistep predictions to train the model. Third, viable for real-world applications since all expensive optimizations are executed offline while using a reasonable number of data samples. Testing is performed using challenging underactuated benchmark problems: the Cartpole and the Acrobot. The presented methodology successfully learns the swing-up maneuver using a relatively small number of iterations, with less than 125 sampled trajectories, and without any expert knowledge of the dynamics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据