Back to articles
Proceedings Paper
Volume: 38 | Article ID: AVM-100
Image
A Comparative Analysis of Video- and Pose-based Action Recognition for In-cabin Driver Monitoring
  DOI :  10.2352/EI.2026.38.16.AVM-100  Published OnlineMarch 2026
Abstract
Abstract

We present a comparative study of pose-based vs. video-based Human Action Recognition (HAR) methods for driver monitoring in car cockpits. In this context, comparisons of neural network architectures from the field of deep learning-based video understanding are scarce. However, pose- and video-based HAR has significant potential for advanced driver-assistance systems in semi-autonomous driving on public roads. We compare prediction performance, per-class false-negative rate, model size, computational requirements, and inference latency on the established Drive&Act and the proprietary Driver Action Insight datasets. While the diversity and scale of available datasets make comparisons challenging, results suggest that both approaches benefit from pretraining, but pose- and video-based techniques perform differently for specific action classes, such as those that depend on body motion or the appearance of objects.

Subject Areas :
Views 66
Downloads 28
 articleview.views 66
 articleview.downloads 28
  Cite this article 

Lukas Brunner, Dominik Schörkhuber, "A Comparative Analysis of Video- and Pose-based Action Recognition for In-cabin Driver Monitoringin Electronic Imaging,  2026,  pp 100-1 - 100-7,  https://doi.org/10.2352/EI.2026.38.16.AVM-100

 Copy citation
  Copyright statement 
Copyright ©2026 Society for Imaging Science and Technology 2026
ei
Electronic Imaging
2470-1173
2470-1173
Society for Imaging Science and Technology
IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA