Ashwin Ramachandran,1 Kartik Gokhale,1 Maike Kripps,2,3 Thomas Deserno2,3
1Indian Institute of Technology (India) 2Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig (Germany) 3Hannover Medical School (Germany)
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
Driver action recognition is essential in vehicle safety and smart car systems targeting vehicles as a diagnostic space. In the context of video-based action recognition, attention-based architectures have surpassed conventional methods in deep learning. Former frameworks have produced excellent results on the public Drive&Act dataset. However, present frameworks do not consider the temporal ordering of frames in the video and the spatial layout of the relevant interacting objects yielding poor performance in certain actions that include semantic reversals. This includes so-called conjugate actions that originate when performed backward in time. An example would be moving the hand rightward vs. leftward. We propose a feature engineering approach to model the motion of human pose. We use key points relevant to the action to incorporate the sequential order. We implement video swin architecture on the Drive&Act dataset. Then, we utilize the histogram of oriented displacements on human joint locations and their displacements and train a support vector machine to classify actions in conjugate pairs. Performance increases in two conjugate actions namely fastening/ unfastening seat belt and taking off/ putting on sunglasses. Integrating our module with existing deep learning models increases the overall accuracy by 3% to 72%. Furthermore, our approach can be extended to other action classes.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
The alert did not successfully save. Please try again later.
Ashwin Ramachandran, Kartik Gokhale, Maike Kripps, Thomas Deserno, "Video-based in-vehicle action recognition for continuous health monitoring," Proc. SPIE 12469, Medical Imaging 2023: Imaging Informatics for Healthcare, Research, and Applications, 124690V (10 April 2023); https://doi.org/10.1117/12.2655116