4.8 Article

Interplay of approximate planning strategies

Publisher

NATL ACAD SCIENCES
DOI: 10.1073/pnas.1414219112

Keywords

planning; hierarchical reinforcement learning; memoization; pruning

Funding

  1. British Academy
  2. German Research Foundation
  3. Wellcome Trust-National Institutes of Health (NIH)
  4. Sackler Fellowship in Psychobiology
  5. NIH [T32GM007753, F30MH100729]
  6. Swiss National Science Foundation
  7. University of Zurich
  8. Neuroscience Centre Zurich
  9. Gatsby Charitable Foundation

Ask authors/readers for more resources

Humans routinely formulate plans in domains so complex that even the most powerful computers are taxed. To do so, they seem to avail themselves of many strategies and heuristics that efficiently simplify, approximate, and hierarchically decompose hard tasks into simpler subtasks. Theoretical and cognitive research has revealed several such strategies; however, little is known about their establishment, interaction, and efficiency. Here, we use model-based behavioral analysis to provide a detailed examination of the performance of human subjects in a moderately deep planning task. We find that subjects exploit the structure of the domain to establish subgoals in a way that achieves a nearly maximal reduction in the cost of computing values of choices, but then combine partial searches with greedy local steps to solve subtasks, and maladaptively prune the decision trees of subtasks in a reflexive manner upon encountering salient losses. Subjects come idiosyncratically to favor particular sequences of actions to achieve subgoals, creating novel complex actions or options.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available