Exploitation of Gaze Data for Photo Region Labeling in an Immersive Environment (Test talk for the Multimedia Modelling Conference 2014)[go to overview]
Metadata, describing the content of photos, are of high importance for applications like image search or as part of training sets for object detection algorithms. In this work, we apply tags to image regions for a more detailed description of the photo semantics. This region labeling is performed without additional effort for the user, just from analyzing eye tracking data, recorded while users are playing a gaze-controlled game. In the game EyeGrab, users classify and rate photos falling down the screen. The photos are classified according to a given category under time pressure. The game has been evaluated in a study with 54 subjects and the results show that it is possible to assign the given categories to image regions with a precision of up to 61%. This shows that we can perform an almost equally good region labeling in an immersive environment like EyeGrab, compared to a previous classification experiment that was much more controlled.
12.12.13 - 09:15