Visual Exploration as Epistemic Inference in Artificial Agents: A Biologically Inspired Augmented State-Space Framework

SAJEEV, ALFI

DR Home
→
THESES & PROJECT REPORTS
→
MS THESES
→
View Item

dc.contributor.advisor	Shimazaki, Hideaki
dc.contributor.author	SAJEEV, ALFI
dc.date.accessioned	2026-05-20T10:02:08Z
dc.date.available	2026-05-20T10:02:08Z
dc.date.issued	2026-05
dc.identifier.citation	77	en_US
dc.identifier.uri	http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/11083
dc.description.abstract	Despite extensive research into human visual attention, a comprehensive computational model capturing end-to-end free-viewing behavior remains elusive. In this thesis, we address this gap by designing a biologically inspired autonomous agent that reproduces human visual exploration of natural scenes. The agent is built upon two distinct latent-space frameworks: a linear generative Recognition Model for visual processing and a nonlinear Motor Execution Model for proprioceptive saccadic control. Through a developmental analysis of the Recognition Model, we demonstrate that unbiased, random visual exploration is fundamentally required for the emergence of V1 simple-cell-like basis functions, computationally mirroring the biological "critical period" of heightened plasticity. By evaluating the mature agent against human behavioral data, we reveal a dichotomy in visual processing. Saliency analysis indicates that the V1-level generative model is highly robust and suﬃcient to predict spatial gaze allocation (where humans look) using marginal log-likelihood. However, temporal analysis shows this low-level representation cannot account for fixation durations (how long humans look) . Instead, fixation duration correlates significantly with the model’s residual reconstruction error. Viewed through the lens of predictive coding, this suggests that high-error stimuli necessitate the recruitment of time-intensive, higher-order cognitive processes. Finally, our main sequence analysis demonstrates that the nonlinear Motor Execution Model, utilizing a novel inverse-observation prior framework, successfully reproduces the kinematic trajectories of human saccades. Ultimately, this work provides a rigorous computational framework that successfully bridges early visual development, spatial-temporal attention, and goal-directed motor execution.	en_US
dc.description.sponsorship	Honda Research Institute Japan Co., Ltd.	en_US
dc.language.iso	en	en_US
dc.subject	Theoretical Neuroscience	en_US
dc.subject	Agent based Modeling	en_US
dc.subject	Computer Vision	en_US
dc.subject	State Space Modeling	en_US
dc.subject	Human Behavioral Modeling	en_US
dc.subject	Intelligent Systems Modeling	en_US
dc.title	Visual Exploration as Epistemic Inference in Artificial Agents: A Biologically Inspired Augmented State-Space Framework	en_US
dc.type	Thesis	en_US
dc.description.embargo	Two Years	en_US
dc.type.degree	BS-MS	en_US
dc.contributor.department	Dept. of Data Science	en_US
dc.contributor.registration	20211266	en_US

Files in this item

Name: 20211266_ALFI_SAJ ...

Size: 9.610Mb

Format: PDF

Description: MS Thesis

View/Open

This item appears in the following Collection(s)

MS THESES [2219]
Thesis submitted to IISER Pune in partial fulfilment of the requirements for the BS-MS Dual Degree Programme/MSc. Programme/MS-Exit Programme

Show simple item record

Search Repository

Advanced Search

Browse

All of Repository
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Department

Visual Exploration as Epistemic Inference in Artificial Agents: A Biologically Inspired Augmented State-Space Framework

Files in this item

This item appears in the following Collection(s)

Search Repository

Browse

All of Repository

This Collection

My Account