Related references
Note: Only part of the references are listed.Semi-Markov decision processes with limiting ratio average rewards
Sagnik Sinha et al.
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS (2017)
A neural model of hierarchical reinforcement learning
Daniel Rasmussen et al.
PLOS ONE (2017)
Neural inverse reinforcement learning in autonomous navigation
Chen Xia et al.
ROBOTICS AND AUTONOMOUS SYSTEMS (2016)
A Cognitive Model Based on Neuromodulated Plasticity
Jing Huang et al.
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE (2016)
The Q-learning obstacle avoidance algorithm based on EKF-SLAM for NAO autonomous walking under unknown environments
Shuhuan Wen et al.
ROBOTICS AND AUTONOMOUS SYSTEMS (2015)
The skinner automaton: A psychological model formalizing the theory of operant conditioning
Ruan XiaoGang et al.
SCIENCE CHINA-TECHNOLOGICAL SCIENCES (2013)
A bioinspired neural network for real-time concurrent map building and complete coverage robot navigation in unknown environments
Chaomin Luo et al.
IEEE TRANSACTIONS ON NEURAL NETWORKS (2008)
A neural network approach to complete coverage path planning
SX Yang et al.
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS (2004)
Learning obstacle avoidance with an operant behavior model
DA Gutnisky et al.
ARTIFICIAL LIFE (2004)