Kadner, Florian (2024)
Active vision as sequential decision-making under uncertainty.
Technische Universität Darmstadt
doi: 10.26083/tuprints-00026598
Ph.D. Thesis, Primary publication, Publisher's Version
Text
thesis-kadner.pdf Copyright Information: In Copyright. Download (15MB) |
Item Type: | Ph.D. Thesis | ||||
---|---|---|---|---|---|
Type of entry: | Primary publication | ||||
Title: | Active vision as sequential decision-making under uncertainty | ||||
Language: | English | ||||
Referees: | Rothkopf, Prof. Constantin A. ; Hayhoe, Prof. Mary M. | ||||
Date: | 27 February 2024 | ||||
Place of Publication: | Darmstadt | ||||
Collation: | viii, 159 Seiten | ||||
Date of oral examination: | 23 January 2024 | ||||
DOI: | 10.26083/tuprints-00026598 | ||||
Abstract: | Interacting with our visual environment can be challenging due to its highly dynamic nature and richness in complex interrelationships. With the human visual system's constraint of having a narrow field of high resolution, we must actively shift our attention between different visual areas to acquire relevant visual information to accomplish our tasks. Extracting this task-relevant information from our environment can be challenging and further amplified by our world’s inherently probabilistic nature. Sensory perception often presents ambiguities with varying results from identical measurements and vice versa. Similarly, the consequences of our actions are usually governed by uncertainty, which originates from several internal and external factors. Finally, the relevance of completing a particular task or even the definition of the task and its associated costs are highly variable across individuals. Thus, uncertainty is a fundamental factor at multiple stages while interacting with our visual environment. Sensory perception, decision-making, and actions are inseparably intertwined, and it is, therefore, all the more critical that we deal with the arising uncertainties and develop strategies to reduce them as far as possible. Computationally, this aligns with the concept of planning. In this thesis, we are investigating the active nature of visual planning as a probabilistic decision-making process under uncertainty. We designed various experimental paradigms to quantify sensory uncertainty, action variability, and the behavioral costs of human behavior in sequential visual tasks. For this purpose, we use the framework of Partially Observable Markov Decision Processes (POMDPs), which allow us to normatively model decision-making processes by incorporating different sources of uncertainty. Using three case studies, we demonstrate its use, advantages, and possibilities, starting with the most straightforward visual action - blinking. Even this simple action has to be planned since every blink briefly interrupts the visual information stream. We then move on to more complex visual actions such as saccades and gaze selection. First, we consider one-step ahead predictions in the context of free viewing and saliency models before moving on to a complex example of a gaze-contingent paradigm task where, in addition to observations, rewards are dynamic and uncertain. Last, we consider two other studies more detached from the experimental environment and devoted to more natural stimuli. We investigate how humans navigate mazes and their associated planning strategies of eye movements to find the solution. Also, we designed a reading experiment including an adaptive font system that maximizes the subjects' individual reading speed and thus reduces the underlying internal behavioral costs. Our results conclude that human visual behavior should be seen as an active sequential decision process under uncertainty where POMDPs can provide a powerful tool for modeling. |
||||
Alternative Abstract: |
|
||||
Status: | Publisher's Version | ||||
URN: | urn:nbn:de:tuda-tuprints-265989 | ||||
Additional Information: | In reference to IEEE copyrighted material which is used with permission in this thesis, the IEEE does not endorse any of Technical University Darmstadt’s products or services. Internal or personal use of this material is permitted. If interested in reprinting/republishing IEEE copyrighted material for advertising or promotional purposes or for creating new collective works for resale or redistribution, please go to http://www.ieee.org/publications_standards/publications/rights/rights_link.html to learn how to obtain a License from RightsLink. If applicable, University Microfilms and/or ProQuest Library, or the Archives of Canada may supply single copies of the dissertation. |
||||
Classification DDC: | 100 Philosophy and psychology > 150 Psychology | ||||
Divisions: | 03 Department of Human Sciences > Institute for Psychology > Psychology of Information Processing | ||||
TU-Projects: | DFG|RO4337/3-1|Aktives Sehen: Kontr | ||||
Date Deposited: | 27 Feb 2024 13:20 | ||||
Last Modified: | 29 Feb 2024 07:25 | ||||
URI: | https://tuprints.ulb.tu-darmstadt.de/id/eprint/26598 | ||||
PPN: | 515852538 | ||||
Export: |
View Item |