Paper
16 July 2021 Human detection from low-resolution video images using 3D convolutional neural network
Hiroki Kanazawa, Yuta Nakamoto, Jiaxin Zhou, Takashi Komuro
Author Affiliations +
Proceedings Volume 11794, Fifteenth International Conference on Quality Control by Artificial Vision; 117941G (2021) https://doi.org/10.1117/12.2589829
Event: Fifteenth International Conference on Quality Control by Artificial Vision, 2021, Tokushima, Japan
Abstract
In this paper, we propose a method for human detection from low-resolution camera images. The proposed method uses video images as input and uses 3D-CNN for classification, which is an extension of 2D-CNN and that can take into account temporal features such as gait motion. In our experiments, we used Caltech Pedestrian Detection Benchmark to make datasets of low-resolution still and video images and compared the performance between 2D-CNN and 3D-CNN. As a result, 3D-CNN with low-resolution video images achieved 91.8 % accuracy rate, 99.0 % precision rate, and 82.8 % recall rate, and showed higher performance than 2D-CNN with low-resolution images, and comparable performance than 2D-CNN with high-resolution images.
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hiroki Kanazawa, Yuta Nakamoto, Jiaxin Zhou, and Takashi Komuro "Human detection from low-resolution video images using 3D convolutional neural network", Proc. SPIE 11794, Fifteenth International Conference on Quality Control by Artificial Vision, 117941G (16 July 2021); https://doi.org/10.1117/12.2589829
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

3D image processing

Convolutional neural networks

Cameras

Gait analysis

Image classification

Back to Top