4.6 Article

Temporal-Difference Reinforcement Learning with Distributed Representations

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Neurosciences

Low-serotonin levels increase delayed reward discounting in humans

Nicolas Schweighofer et al.

JOURNAL OF NEUROSCIENCE (2008)

Article Neurosciences

The temporal precision of reward prediction in dopamine neurons

Christopher D. Fiorillo et al.

NATURE NEUROSCIENCE (2008)

Article Computer Science, Artificial Intelligence

Stimulus Representation and the Timing of Reward-Prediction Errors in Models of the Dopamine System

Elliot A. Ludvig et al.

NEURAL COMPUTATION (2008)

Review Neurosciences

Is a bird in the hand worth two in the future? The neuroeconomics of intertemporal decision-making

Tobias Kalenscher et al.

PROGRESS IN NEUROBIOLOGY (2008)

Article Multidisciplinary Sciences

Internally generated cell assembly sequences in the rat hippocampus

Eva Pastalkova et al.

SCIENCE (2008)

Article Neurosciences

Dopamine release is heterogeneous within microenvironments of the rat nucleus accumbens

R. Mark Wightman et al.

EUROPEAN JOURNAL OF NEUROSCIENCE (2007)

Article Computer Science, Artificial Intelligence

Multiple model-based reinforcement learning explains dopamine neuronal activity

Mathieu Bertin et al.

NEURAL NETWORKS (2007)

Article Neurosciences

Statistics of midbrain dopamine neuron spike trains in the awake primate

Hannah M. Bayer et al.

JOURNAL OF NEUROPHYSIOLOGY (2007)

Review Neurosciences

Efficient reinforcement learning: computational theories, neuroscience and robotics

Mitsuo Kawato et al.

CURRENT OPINION IN NEUROBIOLOGY (2007)

Review Behavioral Sciences

A review of delay-discounting research with humans: relations to drug use and gambling

Brady Reynolds

BEHAVIOURAL PHARMACOLOGY (2006)

Article Biochemical Research Methods

Humans can adopt optimal discounting strategy under real-time constraints

N. Schweighofer et al.

PLOS COMPUTATIONAL BIOLOGY (2006)

Article Computer Science, Artificial Intelligence

Representation and timing in theories of the dopamine system

Nathaniel D. Daw et al.

NEURAL COMPUTATION (2006)

Review Behavioral Sciences

Neuroeconomics: cross-currents in research on decision-making

AG Sanfey et al.

TRENDS IN COGNITIVE SCIENCES (2006)

Article Multidisciplinary Sciences

Representation of action-specific reward values in the striatum

K Samejima et al.

SCIENCE (2005)

Article Behavioral Sciences

Dopamine, uncertainty and TD learning

Yael Niv et al.

BEHAVIORAL AND BRAIN FUNCTIONS (2005)

Article Neurosciences

Dopamine operates as a subsecond modulator of food seeking

MF Roitman et al.

JOURNAL OF NEUROSCIENCE (2004)

Article Multidisciplinary Sciences

Addiction as a computational process gone awry

AD Redish

SCIENCE (2004)

Article Multidisciplinary Sciences

Separate neural systems value immediate and delayed monetary rewards

SM McClure et al.

SCIENCE (2004)

Article Psychology, Biological

Pathological gambling severity is associated with impulsivity in a delay discounting procedure

SM Alessi et al.

BEHAVIOURAL PROCESSES (2003)

Article Computer Science, Artificial Intelligence

Inter-module credit assignment in modular reinforcement learning

K Samejima et al.

NEURAL NETWORKS (2003)

Article Multidisciplinary Sciences

Discrete coding of reward probability and uncertainty by dopamine neurons

CD Fiorillo et al.

SCIENCE (2003)

Review Neurosciences

Getting formal with dopamine and reward

W Schultz

NEURON (2002)

Article Computer Science, Artificial Intelligence

Dopamine: generalization and bonuses

S Kakade et al.

NEURAL NETWORKS (2002)

Article Computer Science, Artificial Intelligence

Multiple model-based reinforcement learning

K Doya et al.

NEURAL COMPUTATION (2002)

Article Computer Science, Artificial Intelligence

Opponent interactions between serotonin and dopamine

ND Daw et al.

NEURAL NETWORKS (2002)

Article Behavioral Sciences

The role of the hippocampus in trace conditioning: Temporal discontinuity or task difficulty?

AV Beylin et al.

NEUROBIOLOGY OF LEARNING AND MEMORY (2001)

Article Multidisciplinary Sciences

Impulsive choice induced in rats by lesions of the nucleus accumbens core

RN Cardinal et al.

SCIENCE (2001)

Letter Computer Science, Artificial Intelligence

Temporal difference model reproduces anticipatory neural activity

RE Suri et al.

NEURAL COMPUTATION (2001)

Article Psychology

Hyperbolic value addition and general models of animal choice

JE Mazur

PSYCHOLOGICAL REVIEW (2001)

Review Neurosciences

Complementary roles of basal ganglia and cerebellum in learning and motor control

K Doya

CURRENT OPINION IN NEUROBIOLOGY (2000)

Review Psychology

Time, rate, and conditioning

CR Gallistel et al.

PSYCHOLOGICAL REVIEW (2000)