Back to articles
Articles
Volume: 32 | Article ID: art00013
Image
End-to-End Multitask Learning for Driver Gaze and Head Pose Estimation
  DOI :  10.2352/ISSN.2470-1173.2020.16.AVM-110  Published OnlineJanuary 2020
Abstract

Modern automobiles accidents occur mostly due to inattentive behavior of drivers, which is why driver’s gaze estimation is becoming a critical component in automotive industry. Gaze estimation has introduced many challenges due to the nature of the surrounding environment like changes in illumination, or driver’s head motion, partial face occlusion, or wearing eye decorations. Previous work conducted in this field includes explicit extraction of hand-crafted features such as eye corners and pupil center to be used to estimate gaze, or appearance-based methods like Convolutional Neural Networks which implicitly extracts features from an image and directly map it to the corresponding gaze angle. In this work, a multitask Convolutional Neural Network architecture is proposed to predict subject’s gaze yaw and pitch angles, along with the head pose as an auxiliary task, making the model robust to head pose variations, without needing any complex preprocessing or hand-crafted feature extraction.Then the network’s output is clustered into nine gaze classes relevant in the driving scenario. The model achieves 95.8% accuracy on the test set and 78.2% accuracy in cross-subject testing, proving the model’s generalization capability and robustness to head pose variation.

Subject Areas :
Views 32
Downloads 8
 articleview.views 32
 articleview.downloads 8
  Cite this article 

Mahmoud Ewaisha, Marwa El Shawarby, Hazem Abbas, Ibrahim Sobh, "End-to-End Multitask Learning for Driver Gaze and Head Pose Estimationin Proc. IS&T Int’l. Symp. on Electronic Imaging: Autonomous Vehicles and Machines,  2020,  pp 110-1 - 110-6,  https://doi.org/10.2352/ISSN.2470-1173.2020.16.AVM-110

 Copy citation
  Copyright statement 
Copyright © Society for Imaging Science and Technology 2020
72010604
Electronic Imaging
2470-1173
Society for Imaging Science and Technology
7003 Kilworth Lane, Springfield, VA 22151 USA