TU Darmstadt / ULB / TUprints

Catching heuristics are optimal control policies

Belousov, Boris ; Neumann, Gerhard ; Rothkopf, Constantin A. ; Peters, Jan (2022):
Catching heuristics are optimal control policies. (Publisher's Version)
In: Advances in Neural Information Processing Systems 29 : 30th Annual Conference on Neural Information Processing Systems 2016,
Darmstadt, Neural Information Processing Systems, Advances in Neural Information Processing Systems 29 (NIPS 2016), Barcelona, Spain, 05.-10.12.2016, DOI: 10.26083/tuprints-00020556,
[Conference or Workshop Item]

[img] Text
NIPS-2016-catching-heuristics-are-optimal-control-policies-Paper.pdf
Copyright Information: CC BY 4.0 International - Creative Commons, Attribution.

Download (1MB)
Item Type: Conference or Workshop Item
Origin: Secondary publication service
Status: Publisher's Version
Title: Catching heuristics are optimal control policies
Language: English
Abstract:

Two seemingly contradictory theories attempt to explain how humans move to intercept an airborne ball. One theory posits that humans predict the ball trajectory to optimally plan future actions; the other claims that, instead of performing such complicated computations, humans employ heuristics to reactively choose appro- priate actions based on immediate visual feedback. In this paper, we show that interception strategies appearing to be heuristics can be understood as computa- tional solutions to the optimal control problem faced by a ball-catching agent acting under uncertainty. Modeling catching as a continuous partially observable Markov decision process and employing stochastic optimal control theory, we discover that the four main heuristics described in the literature are optimal solutions if the catcher has sufficient time to continuously visually track the ball. Specifically, by varying model parameters such as noise, time to ground contact, and perceptual latency, we show that different strategies arise under different circumstances. The catcher’s policy switches between generating reactive and predictive behavior based on the ratio of system to observation noise and the ratio between reaction time and task duration. Thus, we provide a rational account of human ball-catching behavior and a unifying explanation for seemingly contradictory theories of target interception on the basis of stochastic optimal control.

Book Title: Advances in Neural Information Processing Systems 29 : 30th Annual Conference on Neural Information Processing Systems 2016
Place of Publication: Darmstadt
Publisher: Neural Information Processing Systems
Collation: 9 Seiten
Classification DDC: 000 Allgemeines, Informatik, Informationswissenschaft > 004 Informatik
100 Philosophie und Psychologie > 150 Psychologie
Divisions: 20 Department of Computer Science > Intelligent Autonomous Systems
03 Department of Human Sciences > Institute for Psychology
TU-Projects: EC/H2020|640554|SKILLS4ROBOTS
Event Title: Advances in Neural Information Processing Systems 29 (NIPS 2016)
Event Location: Barcelona, Spain
Event Dates: 05.-10.12.2016
Date Deposited: 18 Nov 2022 14:23
Last Modified: 18 Nov 2022 14:23
DOI: 10.26083/tuprints-00020556
Corresponding Links:
URN: urn:nbn:de:tuda-tuprints-205568
URI: https://tuprints.ulb.tu-darmstadt.de/id/eprint/20556
PPN:
Export:
Actions (login required)
View Item View Item