MRCNet: Crowd Counting and Density Map Estimation in Aerial and Ground Imagery

27 Sep 2019  ยท  Reza Bahmanyar, Elenora Vig, Peter Reinartz ยท

In spite of the many advantages of aerial imagery for crowd monitoring and management at mass events, datasets of aerial images of crowds are still lacking in the field. As a remedy, in this work we introduce a novel crowd dataset, the DLR Aerial Crowd Dataset (DLR-ACD), which is composed of 33 large aerial images acquired from 16 flight campaigns over mass events with 226,291 persons annotated. To the best of our knowledge, DLR-ACD is the first aerial crowd dataset and will be released publicly. To tackle the problem of accurate crowd counting and density map estimation in aerial images of crowds, this work also proposes a new encoder-decoder convolutional neural network, the so-called Multi-Resolution Crowd Network MRCNet. The encoder is based on the VGG-16 network and the decoder is composed of a set of bilinear upsampling and convolutional layers. Using two losses, one at an earlier level and another at the last level of the decoder, MRCNet estimates crowd counts and high-resolution crowd density maps as two different but interrelated tasks. In addition, MRCNet utilizes contextual and detailed local information by combining high- and low-level features through a number of lateral connections inspired by the Feature Pyramid Network (FPN) technique. We evaluated MRCNet on the proposed DLR-ACD dataset as well as on the ShanghaiTech dataset, a CCTV-based crowd counting benchmark. The results demonstrate that MRCNet outperforms the state-of-the-art crowd counting methods in estimating the crowd counts and density maps for both aerial and CCTV-based images.

PDF Abstract

Datasets


Introduced in the Paper:

DLR-ACD

Used in the Paper:

ShanghaiTech UCF-QNRF UCF-CC-50

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Crowd Counting DLR-ACD MRCNet (ours) MAE 906 # 4
MNAE 0.21 # 5
RMSE 1307.4 # 2
Precision 0.51 # 2
Recall 0.48 # 2
F1-score 0.49 # 1
Crowd Counting DLR-ACD Liu et al MAE 833.3 # 5
MNAE 0.25 # 4
RMSE 1085.9 # 1
Precision 45 # 1
Recall 0.44 # 3
F1-score 0.44 # 3
Crowd Counting DLR-ACD CSRNet MAE 3388.8 # 1
MNAE 0.71 # 3
RMSE 4456.5 # 5
Precision 0.2 # 5
Recall 0.33 # 5
F1-score 0.24 # 5
Crowd Counting DLR-ACD ic-CNN MAE 1481.3 # 3
MNAE 0.72 # 2
RMSE 2087 # 3
Precision 0.44 # 3
Recall 0.52 # 1
F1-score 0.46 # 2
Crowd Counting DLR-ACD MCNN MAE 1989.7 # 2
MNAE 0.87 # 1
RMSE 3016.3 # 4
Precision 0.43 # 4
Recall 0.41 # 4
F1-score 0.39 # 4

Methods