Alexandria Digital Research Library

Modeling Eye Tracking Data with Application to Object Detection

Author:
Shanmuga Vadivel, Karthikeyan
Degree Grantor:
University of California, Santa Barbara. Electrical and Computer Engineering
Degree Supervisor:
B.S. Manjunath
Place of Publication:
[Santa Barbara, Calif.]
Publisher:
University of California, Santa Barbara
Creation Date:
2014
Issued Date:
2014
Topics:
Engineering, Computer, Computer Science, and Engineering, Electronics and Electrical
Keywords:
Visual Saliency
Eye Tracking
Text Detection
Object Detection
Multiple Object Tracking
Genres:
Online resources and Dissertations, Academic
Dissertation:
Ph.D.--University of California, Santa Barbara, 2014
Description:

This research focuses on enhancing computer vision algorithms using eye tracking and visual saliency. Recent advances in eye tracking device technology have enabled large scale collection of eye tracking data, without affecting viewer experience. As eye tracking data is biased towards high level image and video semantics, it provides a valuable prior for object detection in images and object extraction in videos. We specifically explore the following problems in the thesis: 1) eye tracking and saliency enhanced object detection, 2) eye tracking assisted object extraction in videos, and 3) role of object co-occurrence and camera focus in visual attention modeling.

Since human attention is biased towards faces and text, in the first work we propose an approach to isolate face and text regions in images by analyzing eye tracking data from multiple subjects. Eye tracking data is clustered and region labels are predicted using a Markov random field model. In the second work, we study object extraction in videos using eye tracking prior. We propose an algorithm to extract dominant visual tracks in eye tracking data from multiple subjects by solving a linear assignment problem. Visual tracks localize object search and we propose a novel mixed graph association framework, inferred by binary integer linear programming. In the final work, we address the problem of predicting where people look in images. We specifically explore the importance of scene context in the form of object co-occurrence and camera focus. The proposed model extracts low-, mid- and high-level and scene context features and uses a regression framework to predict visual attention map. In all the above cases, extensive experimental results show that the proposed methods outperform current state-of-the-art.

Physical Description:
1 online resource (177 pages)
Format:
Text
Collection(s):
UCSB electronic theses and dissertations
ARK:
ark:/48907/f3sf2tc1
ISBN:
9781321568608
Catalog System Number:
990045118950203776
Rights:
Inc.icon only.dark In Copyright
Copyright Holder:
Karthikeyan Shanmuga Vadivel
File Description
Access: Public access
ShanmugaVadivel_ucsb_0035D_12363.pdf pdf (Portable Document Format)