Digital Repository

Visual Exploration as Epistemic Inference in Artificial Agents: A Biologically Inspired Augmented State-Space Framework

Show simple item record

dc.contributor.advisor Shimazaki, Hideaki
dc.contributor.author SAJEEV, ALFI
dc.date.accessioned 2026-05-20T10:02:08Z
dc.date.available 2026-05-20T10:02:08Z
dc.date.issued 2026-05
dc.identifier.citation 77 en_US
dc.identifier.uri http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/11083
dc.description.abstract Despite extensive research into human visual attention, a comprehensive computational model capturing end-to-end free-viewing behavior remains elusive. In this thesis, we address this gap by designing a biologically inspired autonomous agent that reproduces human visual exploration of natural scenes. The agent is built upon two distinct latent-space frameworks: a linear generative Recognition Model for visual processing and a nonlinear Motor Execution Model for proprioceptive saccadic control. Through a developmental analysis of the Recognition Model, we demonstrate that unbiased, random visual exploration is fundamentally required for the emergence of V1 simple-cell-like basis functions, computationally mirroring the biological "critical period" of heightened plasticity. By evaluating the mature agent against human behavioral data, we reveal a dichotomy in visual processing. Saliency analysis indicates that the V1-level generative model is highly robust and sufficient to predict spatial gaze allocation (where humans look) using marginal log-likelihood. However, temporal analysis shows this low-level representation cannot account for fixation durations (how long humans look) . Instead, fixation duration correlates significantly with the model’s residual reconstruction error. Viewed through the lens of predictive coding, this suggests that high-error stimuli necessitate the recruitment of time-intensive, higher-order cognitive processes. Finally, our main sequence analysis demonstrates that the nonlinear Motor Execution Model, utilizing a novel inverse-observation prior framework, successfully reproduces the kinematic trajectories of human saccades. Ultimately, this work provides a rigorous computational framework that successfully bridges early visual development, spatial-temporal attention, and goal-directed motor execution. en_US
dc.description.sponsorship Honda Research Institute Japan Co., Ltd. en_US
dc.language.iso en en_US
dc.subject Theoretical Neuroscience en_US
dc.subject Agent based Modeling en_US
dc.subject Computer Vision en_US
dc.subject State Space Modeling en_US
dc.subject Human Behavioral Modeling en_US
dc.subject Intelligent Systems Modeling en_US
dc.title Visual Exploration as Epistemic Inference in Artificial Agents: A Biologically Inspired Augmented State-Space Framework en_US
dc.type Thesis en_US
dc.description.embargo Two Years en_US
dc.type.degree BS-MS en_US
dc.contributor.department Dept. of Data Science en_US
dc.contributor.registration 20211266 en_US


Files in this item

This item appears in the following Collection(s)

  • MS THESES [2120]
    Thesis submitted to IISER Pune in partial fulfilment of the requirements for the BS-MS Dual Degree Programme/MSc. Programme/MS-Exit Programme

Show simple item record

Search Repository


Advanced Search

Browse

My Account