Back to articles
Volume: 32 | Article ID: art00013
End-to-End Multitask Learning for Driver Gaze and Head Pose Estimation
  DOI :  10.2352/ISSN.2470-1173.2020.16.AVM-110  Published OnlineJanuary 2020

Modern automobiles accidents occur mostly due to inattentive behavior of drivers, which is why driver’s gaze estimation is becoming a critical component in automotive industry. Gaze estimation has introduced many challenges due to the nature of the surrounding environment like changes in illumination, or driver’s head motion, partial face occlusion, or wearing eye decorations. Previous work conducted in this field includes explicit extraction of hand-crafted features such as eye corners and pupil center to be used to estimate gaze, or appearance-based methods like Convolutional Neural Networks which implicitly extracts features from an image and directly map it to the corresponding gaze angle. In this work, a multitask Convolutional Neural Network architecture is proposed to predict subject’s gaze yaw and pitch angles, along with the head pose as an auxiliary task, making the model robust to head pose variations, without needing any complex preprocessing or hand-crafted feature extraction.Then the network’s output is clustered into nine gaze classes relevant in the driving scenario. The model achieves 95.8% accuracy on the test set and 78.2% accuracy in cross-subject testing, proving the model’s generalization capability and robustness to head pose variation.

Subject Areas :
Views 18
Downloads 6
 articleview.views 18
 articleview.downloads 6
  Cite this article 

Mahmoud Ewaisha, Marwa El Shawarby, Hazem Abbas, Ibrahim Sobh, "End-to-End Multitask Learning for Driver Gaze and Head Pose Estimationin Proc. IS&T Int’l. Symp. on Electronic Imaging: Autonomous Vehicles and Machines,  2020,  pp 110-1 - 110-6,

 Copy citation
  Copyright statement 
Copyright © Society for Imaging Science and Technology 2020
Electronic Imaging
Society for Imaging Science and Technology
7003 Kilworth Lane, Springfield, VA 22151 USA