Crowd counting using deep learning based head detection

Maryam  Hassan; Farhan  Hussain; Sultan Daud Khan; Mohib  Ullah; Mudassar  Yamin; Habib  Ullah

doi:10.2352/EI.2023.35.9.IPAS-293

Abstract

Scale invariance and high miss detection rates for small objects are some of the challenging issues for object detection and often lead to inaccurate results. This research aims to provide an accurate detection model for crowd counting by focusing on human head detection from natural scenes acquired from publicly available datasets of Casablanca, Hollywood-Heads and Scut-head. In this study, we tuned a yolov5, a deep convolutional neural network (CNN) based object detection architecture, and then evaluated the model using mean average precision (mAP) score, precision, and recall. The transfer learning approach is used for fine-tuning the architecture. Training on one dataset and testing the model on another leads to inaccurate results due to different types of heads in different datasets. Another main contribution of our research is combining the three datasets into a single dataset, including every kind of head that is medium, large and small. From the experimental results, it can be seen that this yolov5 architecture showed significant improvements in small head detections in crowded scenes as compared to the other baseline approaches, such as the Faster R-CNN and VGG-16-based SSD MultiBox Detector.

Electronic Imaging

2470-1173

Society for Imaging Science and Technology

IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA

10.2352/EI.2023.35.9.IPAS-293

IPAS-293

Article

Crowd counting using deep learning based head detection

HassanMaryam

NUST College of Electrical & Mechanical Engineering, Norway

HussainFarhan

NUST College of Electrical & Mechanical Engineering, Norway

KhanSultan Daud

National University of Technology, Pakistan

UllahMohib

Norwegian University of Science and Technology, Norway

YaminMudassar

Norwegian University of Science and Technology, Norway

UllahHabib

Norwegian University of Life Sciences (NMBU), Norway

Abstract

1612023

IPAS

Image Processing: Algorithms and Systems XXI

293--1

293-6

This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

2023

Object detectionConvolutional Neural NetworksDeep LearningYOLOYolov5PrecisionMean average Precision

articleview.keywords