Deep learning approaches for crowd counting in images

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Deep learning approaches for crowd counting in images Jiang, Minyang

Abstract

Crowd analysis has been widely used in everyday life. Among different crowd analysis tasks, crowd counting is the most basic but essential one that measures the number of people in a particular area. Such counting and crowd density information is crucial to determine the maximum occupancy of a room or public area to address safety concerns. Counting by hand can yield an accurate number, but it is a tedious job and may not fulfill the time requirement for analysis. Therefore, automated crowd counting for accurately estimating the number of people in crowded scenes is needed. Deep learning has shown beyond human-level accuracy in many computer vision tasks. Recently, researchers have shown that deep learning methods can achieve the state-of-the-art performance in the task of crowd counting in images. The majority of these methods are based on density regression, where a density map is predicted and its integral is further calculated to obtain the final count. However, this learned density map is uninterpretable and could deviate largely from the actual person distribution even if the final count is accurate. Besides, existing crowd counting methods are mainly based on cumbersome feature extraction networks and can not be deployed in edge devices with limited computational power. In this thesis work, we proposed two models that tackle the above interpretability and computational efficiency concerns respectively: the joint crowd counting and localization model extends traditional counting only methods and provides precise localization results without additional model complexity; the ShuffleCount is a computationally efficient and accurate model that is trained through the specially designed knowledge distillation process and can satisfy the real-time crowd analysis requirement on edge devices. We evaluated both models by training and testing on public crowd counting benchmark datasets. Both quantitative and qualitative results are obtained and compared with existing state-of-the-art methods. Our superior results show the potential of deploying the proposed methods to real-life applications for efficient crowd analysis. The methods proposed in both JCCL and ShuffleCount can be generalized to other ideas to improve the interpretability and computational efficiency of the models.

Item Metadata

Title	Deep learning approaches for crowd counting in images
Creator	Jiang, Minyang
Publisher	University of British Columbia
Date Issued	2021
Description	Crowd analysis has been widely used in everyday life. Among different crowd analysis tasks, crowd counting is the most basic but essential one that measures the number of people in a particular area. Such counting and crowd density information is crucial to determine the maximum occupancy of a room or public area to address safety concerns. Counting by hand can yield an accurate number, but it is a tedious job and may not fulfill the time requirement for analysis. Therefore, automated crowd counting for accurately estimating the number of people in crowded scenes is needed. Deep learning has shown beyond human-level accuracy in many computer vision tasks. Recently, researchers have shown that deep learning methods can achieve the state-of-the-art performance in the task of crowd counting in images. The majority of these methods are based on density regression, where a density map is predicted and its integral is further calculated to obtain the final count. However, this learned density map is uninterpretable and could deviate largely from the actual person distribution even if the final count is accurate. Besides, existing crowd counting methods are mainly based on cumbersome feature extraction networks and can not be deployed in edge devices with limited computational power. In this thesis work, we proposed two models that tackle the above interpretability and computational efficiency concerns respectively: the joint crowd counting and localization model extends traditional counting only methods and provides precise localization results without additional model complexity; the ShuffleCount is a computationally efficient and accurate model that is trained through the specially designed knowledge distillation process and can satisfy the real-time crowd analysis requirement on edge devices. We evaluated both models by training and testing on public crowd counting benchmark datasets. Both quantitative and qualitative results are obtained and compared with existing state-of-the-art methods. Our superior results show the potential of deploying the proposed methods to real-life applications for efficient crowd analysis. The methods proposed in both JCCL and ShuffleCount can be generalized to other ideas to improve the interpretability and computational efficiency of the models.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2021-04-22
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0396905
URI	http://hdl.handle.net/2429/77900
Degree (Theses)	Master of Applied Science - MASc
Program (Theses)	Electrical and Computer Engineering
Affiliation	Applied Science, Faculty of; Electrical and Computer Engineering, Department of
Degree Grantor	University of British Columbia
Graduation Date	2021-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Deep learning approaches for crowd counting in images Jiang, Minyang

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights