Documente Academic
Documente Profesional
Documente Cultură
Graduation report
D. Duives
March 30th, 2012
pARTIES iNVOLVED
D.C. (Dorine) Duives
stud.nr: 1312677
m.:
d.c.duives@gmail.com
t.:
+31 (0) 6 46 24 1883
Supervisory team
Chair:
Prof. dr. ir. S.P. (Serge) Hoogendoorn
m.:
S.P.Hoogendoorn@tudelft.nl
t.:
+31 (0)15 27 85475
Daily advisor:
Dr. ir. W. (Winnie) Daamen
m.:
W.Daamen@tudelft.nl
t.:
+31 (0) 15 27 85927
External member:
Prof. Dr. Ir. B. (Ben) Gorte
m.:
B.G.H.Gorte@tudelft.nl
t.:
+31 (0)15 27 81737
Graduation Coordinator:
Ir. P.B.L. (Paul) Wiggenraad
m.:
P.B.L.Wiggenraad@tudelft.nl
t.:
+31 (0) 15 27 84916
(Geomatics - CEG)
D.Duives
Preface
Reading guide
Motion has always moved me. This fascination grew from ant hives
when I was a little girl, to pedestrian movements in contemporary days.
The seemingly random movement of individuals form logical patterns at
the scale of crowds. Until four years ago I did not know it was possible
to study these transportation patterns. During an extracurricular research
program, prof. dr. ir. Hoogendoorn brought me into contact with the field
of transportation, which captivated me from the start. In the field of
transportation, individual movements can be anticipated but human crowd
behaviour remains a mystery. Religious, music and sports events can still go
horribly wrong. During my MSc. thesis at Northwestern University I found
out that also the Dutch regulation on major events involving pedestrian
crowds is limited and in some parts inadequate. Not a lot is known about
how the individual behaviour causes crowds to organise. Therefore the
movements of pedestrian crowds remain almost impossible to simulate
due to the sheer lack of knowledge.
The report lying in front of you can either be read in detail or in a skimming
fashion. Each chapter starts with an indication of the work to be presented
within that chapter and ends with a summary of the results. Both the
introductions and summaries are highlighted by light blue text boxes.
For a general gesture of the proceedings in this master thesis reading
these boxes should be sufficient. An in-depth explanation of research
motives, research methodologies and data analysis can be found within
the chapters themselves.
This thesis has been divided in four parts. The first part (A) reviews the
reasoning behind the set-up of this project. It reviews the background,
problem statement, research questions, objectives and methodologies
used in Chapters 1 and 2. Part B (chapters 3, 4 and 5) elaborates on the
state-of-the-art of pedestrian movements and tracking techniques. In
the third part (C) (chapters 6 - 8) the necessary software development is
commented upon. Based on the literature review and a global analysis of
the video data, the list of data analysis software requirements is posed in
chapter 6. In chapter 7 the actual development of the analysis software
is reviewed. The usage of the software is validated in chapter 8. Part D
answers the research question by means of the analysis of pedestrian
movement within the footage in chapter 10. Subsequently a theory is
proposed on the relation between the macroscopic flow characteristics
and the lane-formation characteristics in chapter 11. A general discussion
and recommendations for further research close this master thesis in
chapter 12.
D.Duives
III
Executive summary
Introduction
Several pedestrian crowd events that became disasters occurred in
the last decade. Most disasters occurred due to a combination of high
densities, small passage ways, panic and the determination of people to
get somewhere. Most of these dangerous events were either religious
festivals, sport or music events. Even though pedestrian crowd events
are organised frequently, not a lot is known about pedestrian crowd
movement. One of the big pop festivals in the Netherlands (Lowlands)
offered the Delft University of Technology (DUT) an opportunity to gather
unique data of undisturbed pedestrian crowd movements by means
of UAV. However, the capabilities of the UAV recording technique for
pedestrian movement analysis are unknown. Furthermore no software is
available to analyze the UAV footage.
Research set-up
The main research objective of this MSc. thesis is to develop a theory
that qualitatively and quantitatively describes why transition(s) between
self-organising crowd movement regimes happen. To do so, this research
investigates which macroscopic flow parameters (walking velocity,
occuring density or flow) influence the transition(s) between selforganising pedestrian crowd movement regimes and tries to describe
these transition(s) qualitatively and quantitatively. To answer the main
research questions, equipment, software, theory and data analysis related
issues needed to be solved.
To limit the scope of the research, this MSc. thesis only considers the
pedestrian movement seen within the footage recorded at the circulation
area of Lowlands and the influence of technology used to record the
D.Duives
Theory development
Visible movement in the stabilized footage, the slight changes in pixel
colour between consecutive frames and problems with high density
detection made a few adaptations of the macroscopic flow parameters
calculation methods necessary. Another velocity calculation method was
proposed to decrease the error introduced by the previous mentioned
issues. Also a Kalman filter is implemented to increase tracking robustness.
Furthermore, the macroscopic variables are filtered in order to exclude
stationary pedestrians.
Using all methods described above, the pedestrian crowd movement
behavior found at Lowlands 2011 has been analyzed. Walking velocities
between 0 and 1.8 m/s were found. The densities range from 0 - 4 P/m2.
Mainly two forms of self-organisation were encountered (no organisation
and lane-formation). Because both sequences 7 and 8 better represent
the previous empirically found fundamental diagrams for goal-oriented
pedestrian behaviour, only these two sequences are used in the remaining
analyses. Because especially lane formation is present in the sequences,
the focus in the remainder of this thesis is on lane formation. To describe
the exact lane formation present in the footage two characteristics are
used:
the number of lanes at a given cross-section
the width of the lane a pedestrian is currently experiencing
Both characteristics are deduced from a macroscopic velocity calculation
first proposed by Helbing et al. (2007).
It was hypothesised that there is a correlation between between the
macroscopic flow characteristics and lane-formation characteristics.
The correlations between both groups of characteristics are analyzed.
The results are displayed in table S.1. In the table
and
represents
D.Duives
respectively the gradient and the intercept of the linear regression line,
the standard deviation and Tb the used test-statistic.
Table S.1. Results of the regression analysis
significant?
number of lanes vs. velocity
0.0088
0.85
0.10
2.01
yes
-00.15
0.89
0.10
-1.04
no
-0.019
1.57
0.36
-2.31
yes
0.024
1.28
0.35
9.16
yes
-0.0019
1.28
0.44
-0.2
no
0.018
1.12
0.44
6.30
yes
D.Duives
XI
Table of contents
List of Figures
List of Tables
List of Graphs
List of Symbols
XIV
XVI
XVII
XVIII
19
19
19
20
21
23
24
25
29
29
29
31
33
35
35
35
37
45
45
47
51
52
54
4.6 Calibration effort os pedestrian simulation models
Chapter 5. Tracking techniques
5.1 Object detection techniques
5.2 Object tracking techniques
5.3 Direct estimation of macroscopic flow characteristics
59
63
63
67
70
73
75
75
77
79
79
85
87
87
87
90
90
92
95
95
95
97
98
99
99
99
99
100
D.Duives
7.6.5 Trajectory calculation
7.6.6 Scatter plot calculations
7.7 Final software package
Chapter 8. Software evaluation
8.1 Review list of requirements
8.2 Usability research
8.3 Calculation speed
8.4 Robustness w.r.t. erroneous input
8.5 Conclusion
101
101
103
105
105
109
109
110
110
113
115
115
115
116
117
117
117
119
121
121
123
125
127
129
129
131
XI
lIST OF FIGURES
Figure 1:
Figure S.1:
Figure S.2:
Figure S.3:
II
X
XI
XI
Figure A.1:
Chapter 1:
Figure 1.1:
Figure 1.2:
Figure 1.3:
Figure 1.4:
17
18
20
22
26
Chapter 2:
Figure 2.1:
Figure 2.2:
Figure 2.3:
Figure B.1:
Chapter 3:
Figure 3.1:
Figure 3.2:
Figure 3.3:
Figure 3.4:
Figure 3.5:
Figure 3.6:
Figure 3.7:
Figure 3.8:
Figure 3.9
Figure 3.10:
Figure 3.11:
Figure 3.12:
40
42
42
Chapter 4:
Figure 4.1:
Figure 4.2:
Figure 4.3:
Figure 4.4:
Figure 4.5:
45
46
46
48
48
Square - (source:Hachiko)
Example point detector
Example background modelling
Example segmentation
Taxonomy of tracking methods
Kalman filter
62
63
63
63
66
68
Hadj
73
74
75
66
66
66
77
77
28
28
30
33
Chapter 5:
Figure 5.1:
Figure 5.2:
Figure 5.3:
Figure 5.4:
Figure 5.5:
Figure 5.6:
34
36
36
36
36
38
38
40
40
Figure C.1:
Chapter 6
Figure 6.1:
Figure 6.2:
Figure 6.3:
Figure 6.4:
Figure 6.5:
Figure 6.6:
Figure 6.7:
Trajectory example
Explanation variables used by Hoogendoorn (2007)
Analysis trajectory data - names variables
Non-homogeneous space-time region
Trajectory example
Pedestrian trajectories
Example addition of vectors
Visual specification of equation 3.14 - 3.16
Problem equation 3.19
Figure 6.8:
Figure 6.9:
Figure 6.10:
Figure 6.11:
Figure 6.12:
Figure 6.13:
Figure 6.14:
Figure 6.15:
Chapter 7:
Figure 7.1:
Figure 7.2:
Figure 7.3:
Figure 7.4:
Figure 7.5:
Figure 7.6:
Figure 7.7:
Figure 7.8:
Figure 7.9:
Figure 7.10:
Figure 7.11:
Figure7.12:
Figure 7.13:
Figure 7.14:
Figure 7.15:
Figure 7.16:
Figure 7.17:
Figure 7.18:
D.Duives
78
78
78
78
80
80
82
83
86
88
88
88
92
92
92
94
94
94
96
98
98
98
100
100
102
102
Figure 7.19:
Figure 7.20:
Figure 7.21:
Figure 7.22:
Figure 7.23:
Figure 7.24:
102
102
103
103
103
104
Chapter 8:
Figure 8.1:
110
113
114
116
116
118
120
120
122
122
123
123
124
125
125
126
Figure D.1:
Chapter 9:
Figure 9.1:
Figure 9.2:
Figure 9.3:
Figure 9.4:
Chapter 10:
Figure 10.1:
Figure 10.2:
Figure 10.3:
Figure 10.4:
Figure 10.5:
Figure 10.6:
Figure 10.7:
Figure 10.8:
Figure 10.9:
Figure 10.10:
XV
lIST OF tABLES
Figure 10.11:
Figure 10.12:
128
128
Chapter 5:
Table 5.1:
Table 5.2:
66
66
Chapter 11:
Figure 11.1:
Figure 11.2:
Figure 11.3:
Figure 11.4:
Figure 11.5:
140
140
140
142
142
Chapter 12:
Figure 12.1:
Figure 12.2:
Figure 12.3:
Figure 12.4:
Figure 12.5:
Figure 12.6:
Figure 12.7:
144
146
146
148
148
156
152
Chapter 11:
Table 11.1:
Table 11.2:
Table 11.3:
Table 11.4:
Table 11.5:
Table 11.6:
Table 11.7:
Table 11.8:
Researched correlations
Hypothesis test lane-formation vs. walking velocity
Regression analysis number of lanes vs. density
Regression analysis width of lanes vs. density
Regression analysis the density vs. width of lanes
Regression analysis the number of lanes vs. flow
Regression analysis the width of lanes vs. flow
Summary regression analysis
133
135
137
137
139
139
139
143
lIST OF gRAPHS
D.Duives
Chapter 4:
Graph 4.1:
Graph 4.2:
Graph 4.3:
Graph 4.4:
Graph 4.5:
Chapter 9:
Graph 9.1:
Graph 9.2:
Graph 9.3:
Graph 9.4:
Graph 9.5:
Chapter 10:
Graph 10.1:
Graph 10.2:
Graph 10.3:
Graph 10.4:
Graph 10.5:
Graph 10.6:
Graph 10.7:
Graph 10.8:
Graph 10.9:
Graph 10.10:
Graph 10.11:
Graph 10.12:
50
50
50
50
52
Graph 10.13:
Graph 10.14:
130
130
Chapter 11:
Graph 11.1:
Graph 11.2:
Graph 11.3:
Graph 11.4:
Graph 11.5:
Graph 11.6:
Graph 11.7:
132
132
134
134
137
138
138
120
121
122
122
124
124
124
126
126
126
127
130
XVII
List of Symbols
A
A0
d i
Di
ei
f(x)
Fi(t)
h i
k
l
Li
mb
me
n
n a
nv
q
ri
R
R0
R2
si
t
T
UL
Uij
Area of gridcell
[m2]
Pedestrian type specific constant
[-]
Distance covered by pedestrian i
[m]
Destination of pedestrian i
[-]
Direction of pedestrian i
[-]
Function of x
[-]
Summation of forces working on i at time t
[-]
Time headway of pedestrian i
[s]
Density
[P/m2]
number of lanes
[-]
Proximity costs incurred by pedestrian i
[-]
Number of positions at moment tbegin
[-]
Number of positions at movement tend
[-]
Number of objects
[-]
Number of arrivals at cross-section at x1
[-]
Number of departures at cross-section x0
[-]
Flow
[P/m/s]
Period pedestrian i is present
[s]
Influence
[-]
Pedestrian type specific constant
[-]
Linear regression - coefficient of determination [-]
Distance headway of pedestrian i
[m]
Linear regression - Estimator
[-]
Time interval
[s]
Time instance
[s]
Time interval
[s]
Linear regression - test statistic
[-]
Local mean velocity
[m/s]
Difference in location between ped i and object j[m]
Velocity of vehicle/pedestrian i
[m/s]
w
x
xi
X
yi
Acceleration constant
Linear regression - correlation intercept
Linear regression - correlation gradient
Linear regression - correlation coefficient
Relative constant longitudinal vs. lateral acceleration
Standard deviation
Reaction time of pedestrian i
Potential of attraction of pedestrian i
Directional effect
[m]
[m]
[-]
[m]
[-]
[-]
[-]
[-]
[-]
[-]
[-]
[-]
[-]
[s]
[-]
[-]
17
Figure A.1: Shibuya crossing pedestrian flow (source: Panaramio - Chrisjongkind)
D.Duives
D.Duives
19
1.2 Lowlands
Also in the Netherlands big pedestrian events with massive pedestrian
attendance are organized (Amstel Gold Race, Bavaria CityRacin, Pinkpop,
etc.). The regulation for pedestrian crowd events is almost non-existent
in the Netherlands. The few rules that do exist are unclear and only
account for safe egress and overall attendance. The organization of
pedestrian crowds events is helped by a better understanding of the
underlying pedestrian crowd movements that determine whether a
festival safe or actually deadly. One of the music festivals (Lowlands) has
given the DUT access to do pedestrian crowd movement research.
Lowlands is a music festival that over the course of three days presents
a diverse range of entertainment, i.e. music, literature, film, cabaret,
theatre, etc. The festival was founded in 1967 and has since 1993 been
held at the event terrain Walibi Holland at Biddinghuizen (Wikipedia,
2011). In 2011 Lowlands is held on 19 21 of August. 55.000 visitors
attended (Zantigh, 2011). The grounds consist of camping and festival
terrains. Lowlands has several stages and tents, both in- and outdoor
performances were scheduled. The 19th edition of the three-day
festival was organized by the event organizer loc7000 in combination
with concert promoter MoJo Concerts. The Lowlands terrains covers
140 hectares, accommodates 3000 employees, 20.000 m2 of podia
and 50.000 m2 of festival tents (Jorritsma (2009)). The in-house safety
is guaranteed by 500 security employees and 20 security cameras.
The main acts are held at the GROLSCH, ALPHA and BRAVO tents (see
fig. 1.2). At Lowlands high density crowds are frequently observed.
Therefore Lowlands renders a unique possibility to analyze high density
crowd movements and improve our knowledge on crowd behaviour.
1.3.2 Sub-problems
The main problem can be split in several distinct technology and theory
related questions.
Technology related
Very high density behaviour is rarely observed due to lack of stable video
coverage that can be afterwards analyzed. A new specialist methodology
is needed that is able to film major pedestrian events without interfering
with the actual crowd movement. In this experiment a high velocity
high resolution airborne camera has been used. But as this technique
has not been used for the evaluation of pedestrian crowd movements
before, the implementation of this new technique introduces a lot of
uncertainties (airborne instabilities, unknown GPS location and unknown
altimetry).
1.3.1 Problem
But when and why do regimes present themselves? And what parameters
influence there coming into existence? Simulation by means of pedestrian
models can be used to predict the possible hazardous locations at crowd
events and as such can help prevents crowd crushes from happening.
However, most existing models are not valid anymore under densities
where these self-organising movements are presented. To calibrate
the models for these situations a better qualitative and especially
quantitative description of the macroscopic pedestrian crowd movement
regimes is needed. Furthermore an, at least empirical, quantification of
the transitions point between pedestrian crowd movement regimes is
necessary in order to give crowd managers a tool/guideline to act upon.
Ultimately a model/theory would be the sought answer.
Unlike CCTV (Closed Circuit TeleVision) footage, a high flying air borne
camera gives far less information about each person within a frame.
As such matching individuals across images is a challenge. Because the
images consist predominantly of pedestrian movements, the current
vehicular recognition software cannot be adopted. Adaptation is needed
to account for the continuous 3D-movement (x,y-location and time),
instead of the largely 2D-movement of vehicles. Also in contrast to
vehicles, pedestrians have no rectangular shape, but organic and noncircular as the images were recorded under a slight angle. Due to the
small amount of information per pedestrian and the huge number of
pedestrians per frame also existing pedestrian tracking methods do not
function optimal. Therefore also a new tracking method needs to be
developed.
21
D.Duives
Theory related
The theories describing crowd movements currently range from
microscopic movement descriptions to macroscopic fluid and wave
descriptions. But whether any of the simulation models can actually
predict high density crowd movement is still an unsolved question. There
is no unified theory on how individuals interact within a crowd, and on
how the crowd as a whole moves. To the writers knowledge, most of
the models that are currently on the market and/or used in practice
for prediction, give inaccurate results pedestrian crowd movement
simulations (Helbing 2005). Furthermore, the qualitative description of
the higher density regimes is very limited. Also the description of the
regimes needs to be improved.
There are no quantitative guidelines describing up to what crowd
densities the current models are valid/calibrated. Furthermore, the
usability of the existing pedestrian crowd movements theory is limited
due to coarse descriptions and questionable assumptions about
pedestrian local operative behaviour. Therefore, most contemporary
models have limited applicability in very dense situations.
1.4 Objective(s)
The general objective of this MSc. project can be described as the following:
To build a theory that qualitatively and quantitatively describes the transition(s) between self-organising pedestrian crowd movements regimes.
Sub-objectives
The project-based sub-objectives are formulated as follows:. This project will
1.
explore the new airborne camera technique (UAV)
The objective is to gather unique crowd movement footage at a music festival.
2.
explore new data exploration techniques
The objective is to acquire the most precise data possible from the gathered footage.
3.
describe pedestrian crowd behaviour based on macroscopic flow characteristics
4.
describe the quantitative differences between self-organising crowd movement regimes found at Lowlands
5.
define the transition point(s) between self-organising crowd movements regimes at Lowlands
23
D.Duives
Theory related
How can the macroscopic flow parameters be defined for pedestrian multi-directional movement?
What are self-organising crowd movements?
How are pedestrian movements currently described in simulation models?
Which techniques can be used to translate pedestrian movement footage into pedestrian movement data?
f.
g.
Software related
Can the unstable distorted footage be stabilized, orthorectified and radiometrically corrected?
Can software be developed that translates pedestrian crowd movement footage into pedestrian movement data?
h.
i.
j.
k.
l.
m.
n.
Data related
How are the crowd movements at Lowlands described by means of the macroscopic flow parameters?
What kind of pedestrian self-organising crowd movements are visible at the festival terrain?
Is there a relation between the macroscopic flow parameters and the forms of self-organising crowd movements found?
Can this relation be described qualitatively?
Can the relation be described quantitatively?
Can a theory be proposed describing the relation between the macroscopic flow parameters and the forms of selforganizing crowd movements?
Can transition regimes be indicated?
1.6.4 Boundaries
1.6.3 Assumptions
The weather conditions are assumed to be of no influence on the
pedestrian crowd movements that were present in the footage shot
at the Lowlands festival.
The alcohol usage of the visitor of Lowlands has no influence on their
movement behaviour at the moment that the footage was shot.
The behaviour seen in the footage is supposed to be representative
for the general crowd movements at the Lowlands festival.
1.6.2 Constraints
Only the at Lowlands recorded footage is analyzed in the remainder
of this research.
At Lowlands only two specific areas could be covered by the UAV.
D.Duives
25
1.6.5 Context
Combining the above, it can be said that the context of this project
consists of only the pedestrian behaviour visible within the footage at the
circulation areas of Lowlands and the influence of the technology used to
record the footage on the quality of the footage.
27
27
D.Duives
29
When the software was deemed ready for usage, the second process was
(the definition of the crowd movement theory) started. The second part
started with the qualitative and quantitative analysis of the pedestrian
movement in the footage. Because this second analysis could only be
done after the analysis tool is finished, the second process was shifted in
time and was initialized at the end of the first part. The analysis consists
of the in-depth review of the behaviour found in the data and the possible
reasons for the existence of these trends. Accordingly a provisional
theory has been proposed. However, due to the complicated nature of
the analysis software design and the short period available for this MSc.
thesis, the proposed theory has not been validated.
In figure 2.3 the total followed methodology is depicted in a structure
scheme. The chapters in which each of the steps is treated is displayed
next to the name of the step. The structure scheme does not display the
time necessary to complete each of the blue steps.
In the methodology a regular design process was followed for the design of
the analysis software. This started with a short literature study on detection
and tracking methods. Afterwards the synthesis and simulation contain
the stabilization of the images as well as all other operations necessary
to transform the footage into a workable data set. This tool should, when
finished, at least be able to transform the footage from Lowlands into
usable quantitative data on pedestrian crowd movement, possibly also
other footage recorded by UAV. The exact form of this data will be specified
throughout the software synthesis phase in chapter 7. The feedback loop
between evaluation and synthesis is afterwards used to bring more or less
detail in the synthesis of the analysis software.
D.Duives
31
33
Figure B.1: A packed Alpha tent at Lowlands (source: Heideroosjes Blog, 2012)
D.Duives
3.1
3.2
D.Duives
3.2.1 Flow
Flow, which is defined as the number of vehicles passing a cross-section of
a road in a unit of time, is determined by
(3.1)
Where n represents the number of vehicles passing during the unit of
time T and h the mean time headway. In the case of vehicular traffic the
flow is represented as one flow per direction.
3.2.2 Density
The number of vehicles present on a unit of road length at a given moment
is called density. The density is computed using:
(3.2)
Where n represents the number of vehicles present at road section with
length X and si the mean distance headway.
3.2.3 Velocity
Velocity can be determined in two ways. First of all by averaging the
velocities vi of vehicles passing a cross-section during a certain period of
time, the so-called local mean velocity UL.
(3.3)
However, when using the velocities of vehicles vj present on a road section
at a given moment, the arithmetic mean is denoted as the space mean
velocity.
35
D.Duives
Edies formulation is applicable to all arbitrary closed surfaces in the timespace plane. By means of the vehicle coordinates at the border of the
time-space region the production and travel time can be calculated using
the following two formula.
(3.8)
(3.9)
See figure 3.3 for a visual representation of equations 3.8 and 3.9. The
formulation above is limited to homogeneous space-time regions in
which all vehicles progress with on average unchanged velocity. There
one only records the positions where vehicles enter and exit the spacetime region, the measured average velocities will be lower than the
actual instantaneous velocity throughout a part the space-time region
(fig. 3.4). Besides that, the density and flow change causes a distortion of
the fundamental diagram. Using homogeneous space-time zones such as
indicated in fig. 3.5 solves the difficulties.
D.Duives
(3.15)
See figure 3.3 & 3.8 for a visual representation of the difference in
interpretation. The length of the trajectory is seen as the travelled distance
within the time-space zone minus the travelled distance outside the timespace zone during the time interval.
For their work at the Jamarat bridge Helbing et al. (2007) proposed a
39
more intuitive way of presenting local velocities. Their method weights
D.Duives
would result in comparing characteristics that are not defined for the same
area, which would provide inconsistent results.
Another solution is provided by Helbing et al. (2007). Instead of defining
an area over which the density is calculated, density is calculated using
a Gaussian distance-dependent weight function. See fig. 3.12 for a visual
explanation of the method. The formula is the following:
(3.23)
Fig. 3.11: Example Voronoi diagram calculation
Thereby both density and velocity can be calculated at any point within
a region without losing consistency between the two characteristics.
Helbing et al.s method allows for a continuous estimation of both density
and velocity. This solves the problem of dependence of the classic density
formulation on gridcell locations. However, the usage of the influence
function is questionable, since the found densities depend severely on
the radius of influence and the influence function chosen. Furthermore,
compared to the classic definition of density (eq. 3.22), the by Helbing et al.
(2007) proposed function underestimates the density. Since the influence
of surrounding pedestrians decreases exponentially with the distance, only
a pedestrian exactly at the location of density determination accounts for a
whole pedestrian. The remainder of detected pedestrians within the radius
of influence is accounted for as less than a whole person. Therefore the
method has no advantages over the classic density calculation method.
The density formulation proposed by Edie for traffic flow is to the writers
knowledge the only formulation that provides the means to also record
flow in the same formulation as the other two characteristics. But, idem
to the formulation of velocity, also in this case adaptations are necessary
to provide a density value that makes sense. For the density especially
the denominator has to be adapted. Looking at the manner in which
the formula is derived, the denominator describes the total area of the
D.Duives
43
D.Duives
D.Duives
4.2.3 Bottlenecks
At bottlenecks the effects seen dependent on the length of the bottleneck.
The compression in the crowd is related to shock waves that produce the
impression in the crowd that it is moving forward while this may in fact
not be the case. The clogging of a bottleneck leads to irregular or even
stopped outflows with an oscillation in the movement (Helbing et al.
(2006)).
Besides clogging also the zipper-effect (Hoogendoorn 2005) is observed.
In this situation pedestrians will allow others within the territorial space 47
just in front of them, as long as the direct space in front of their feet is still
clear. It allows for smaller lanes than the expected width of a pedestrians
territorial zone. In bottlenecks with a moderate width of less than 3
meters, the capacity of the bottleneck increases in a stepwise fashion
with the width of the bottleneck.
Side Note
We have searched for a quantitative description of self-organising
movements. But the quest was fruitless. To the writers knowledge
no researcher has linked the existing flow patterns to quantitative
macroscopic flow parameters or other even given a quantitative
description one of the patters.
49
D.Duives
Graph 4.3: Summary of the fundamental diagrams found by Fruin, Ando et al,
Predtechenshkii & Millinskii by Thompson and Marchant
Graph 4.2: Average pedestrian fundamental diagram calculated from 25 district investigations for pedestrians walking normally at-grade, deduced by Weidmann
D.Duives
51
D.Duives
part I
Direction of movement
The fluid crowd modelling method of Henderson (1984) has been the
starting point of development of the Social Force model (Helbing et al.
(1995) and Helbing et al. (2001)). This is a microscopic continuous model
with deterministic force-based interactions. The concept is based on the
assumption that changes in the movement of pedestrians are guided by
social force fields. Sensory stimuli causes behavioural reactions which
depend on personal characteristics. The model assumes pedestrians mostly
face standard conditions. Therefore they will apply optimized behavioural
strategies that they have learned over time such as cooperation and
object avoidance. The main attributions of his model are:
(5.1)
(5.2)
(5.3)
(5.4)
(5.5)
Total Model
D.Duives
(5.8)
(5.9)
(5.10)
(5.11)
55
The total effect of the social force model is the summation of the
three effects described above combined with a certain form of random
fluctuation (also called white noise) (eq. 5.11). The last can be seen as the
incorporation of randomness in the behaviour of the modelled pedestrians.
The model describes behaviour of pedestrians through superposition of
generalized forces reflecting motivations and environmental influences.
The actual movement and velocity of pedestrians is thus based on
the macroscopic behaviour of the crowd and less on the personal
characteristics of the pedestrian (e.g. congestion patience and velocities)
(eq. 5.12-5.14). The system searches for the lowest amount of entropy/
energy/interaction it can achieve.
Because this model assumes that a pedestrian feels the interactions
with every other pedestrian around him, every iteration consists of an
optimization of the total pedestrian behaviour in that step. This causes
very long calculation times.
The input of this model are destinations, free walking velocities, max
direction of sight, vision field shape parameters and behavioural
characteristics. The output are the changes in walking velocity and
locations, which accordingly can be used to calculate the walking velocity
and locations of every pedestrian during every time step. In the socialforce model the time step size only controls the accuracy of the movement
of pedestrians and the numerical stability of the model, but is not related
to their adopted walking velocity.
Putting all this together, the notion of this model can be summarized
as the superposition of attracting and repulsive effects determining
the behaviour of individuals. The exact effects might differ between
Walker model
Optimal velocity
Velocity pedestrian
Acceleration ped.
where:
(5.15 -5.16)
(5.17)
(5.18)
(5.19 - 5.21)
D.Duives
59
that should display the underlying behavior is dubious at the most in all
pedestrian models.
All models depend to a large extend on the assumed behaviour of
pedestrians, that in turn depends greatly on the fundamental diagram
used. However, the literature review elaborating on the pedestrian
fundamental diagram showed that it is very hard to establish which of
the proposed fundamental diagrams represents walking behaviour best.
At the moment, every time a simulation model is used, calibration has to
be done all over again, since it is unknown how the individual pedestrian
characteristics influence the fundamental diagram. By understanding
the behaviour of pedestrians better, the models better simulate reality.
Therefore more empirical research into pedestrian crowd movements is
necessary. Not only small experiments (<20 persons) but also at the level
of crowds (>300 persons).
D.Duives
63
The first main step of image tracking is the detection of moving objects.
Mainly four different methods are used:
5.1.5 Conclusion
In practice only one of the methods is used for pedestrian detection, being
the background modelling (Khan&Shah (2009), Velastin et al. (1994), etc.).
Via background modelling blobs/contours are constructed that indicate
pixels of the moving objects with respect to the surrounding picture. There
the foreground is a 2 dimensional binary matrix the required data storage
is very limited. Because the results are good, also this MSc. research will
use foreground modelling to detect pedestrians automatically.
65
D.Duives
Table 5.1: Comparison of Point Trackers (#: number of objects, M: multiple objects, S: single object)
Table5.2: Qualitative comparison of geometric Model-Based Trackers (#: number of objects, M: multiple objects, S: single object)
Among the first line are several scientists that use background subtraction
in combination with Kalman filters to track pedestrians in consecutive
images. A Kalman filter uses a predefined dynamics model, known
control inputs to the system and the measurements/data gathered from
the system to form an prediction of the true value within the system
(Wikipedia A. (2011)). Most Kalman filters in pedestrian tracking assume
that a person doesnt alter its current movement direction and velocity
drastically. Whereby an initial velocity estimation of 0 m/s is used when
a pedestrian is first detected. See figure 5.6 for a visual explanation. The
more information the Kalman filter uses, the more reliable the results
become. In general the combination of the background method with a
Kalman filter works reasonably well for individual moving objects in a
still background. But for instance Stauffer & Grimson (2002) found that
this method cannot track two individuals walking in contact in the same
direction. By hanging the camera vertically above a crowd, this problem
D.Duives
69
of
macroscopic
flow
5.3.1 Density
Pedestrian density estimation has evolved in two distinct directions.
The scientist along the first direction wanted to estimate the number of
pedestrians solely based on the image information. For instance (Khan
2009), who have shown how crowd densities can be estimated from
digitised aerial photographs by measuring the number of edge pixels in the
images. Also (Velastin 1994) determined two pixel-based methods, namely
thinned edges and Background removal. The first being a computationally
efficient three-pixel-neighbourhood edge detector is applied to a
single original image. The latter method is based on subtraction from a
background-only image which, ideally, identifies the pixels occupied by
moving objects/pedestrians. In both methods a relationship between
the number of pixels and the manually counted pedestrians is computed
and used to in the online density estimation. Also (Yin 1995) adopted
this second method. Both Kalman filtering and geometric correction are
necessary for these methods to work properly. The proposed methods
all suffer reliability when the scaling differs between locations in the
same image. Furthermore an extensive manual calibration is necessary in
order to deduce the general formula translating pixels into the number of
moving objects. Additionally this method assumes that all moving objects
5.3.3 Flow(Q)
Intensity calculations can also be calculated using tracking files, this
method will not be further discussed underneath. But when no trajectory
calculations are available a virtual gate can be used. Ishii et al. (1987)
devised a technique to measure bi-directional pedestrian flow from plain
view images. Other researchers have proposed a pixel count based method
for pedestrian flow intensity estimation (Lee, Kim & Kim 2007). The method
deduces low-level features around the virtual gate. This data is then used
to estimate the blob size passing through the gate. By counting the pixels,
the number of people passing through the gate is then estimated. The
method uses foreground segmentation, motion vector computation and
pixel normalization to prepare for the counting. Since the method does
not require top-view sequences, it does not need specific camera settings
and can be used in conjunction with existing CCTV systems.
But even though the method showed desirable results in test data, it
was also revealed that the method cannot deal with heavy occlusion.
Therefore also flow will be estimated using the trajectory data.
71
71
D.Duives
73
Figure C.1: Hadj
D.Duives
75
D.Duives
Figure 6.8: Image from first data recording sequence, approximately 4:42 p.m.
Figure 6.11: Image from last data recording sequence approximately 8:19 p.m.
Clouds are a second manner in which the spectral power of the sun
decreases. During the data collection a slight haze was continually present.
As such, lightning shifts due to clouding are not visible within the footage.
Because the haze decreased the spectral power of the sun slightly, also no
cast shadows of pedestrians were visible. The differences in light intensity
cause that all image within a sequence cannot be compared based on the
same background image. As such at least one background image needs to
be established for each sequence. And because sundown has a fairly big
effects on the light intensity, the generation of a dynamic background is
necessary.
Figure 6.13 depicts the layout of the visible terrain. The green and yellow
areas are respectively grassy sitting/listening areas and sanitary units or
food courts. The red areas are the pedestrian movement area. In this
area the pedestrians move from the green and yellow areas through the
red areas to the main tent or exits. The pedestrian movement area, the
pedestrian area itself is max. 35 meters wide, and about 150 meters long.
A single pixel in the images is +/- 10 cm by 10 cm, depending on the image
sequence. Due to the relatively small dimensions of a singular pedestrian
there is only limited information available on each pedestrian. Figure 6.13
is a zoomed in part of figures 6.8, which display two images of the same
sequence recorded 15 frames apart. The two red boxes in figure 6.11,
show the cut-out areas shown in figure 6.12. At this level of detail, it is
clearly visible that the footage has been shot under an angle, because the
sides of the pedestrians are visible. Each pedestrian occupies on average
79
10 x 20 pixels. The singular pedestrians are mainly distinguishable through
D.Duives
When looking at the two figures two problems become visible. First of
all, at low densities (such as in fig. 6.14) there is almost no occlusion,
except for when people travel in close-knit groups. For example, parents
with children. As such the whole individual is visible. In figure 6.15 quite
the opposite is happening. Especially in cut 1 people are standing almost
shoulder to shoulder. Every individual body is almost completely obscured
by other bodies in its surrounding. Only the head, and possibly shoulders
are visible. Therefore the area one individual occupies in number of pixels
in figure 6.14 is far smaller than in figure 6.15. This means that depending
on the density, the expected number of pixels a person occupies is actually
decreasing. This fact makes detection of the number of people based on
solely the number of moving pixels impossible.
Secondly, due to the small amount of pixels available per individual at high
densities, the exact number of individuals becomes very hard to determine.
And because the height of every pedestrian is different, the areas that
they theoretically occupy are also different. This might give complications
during the synthesis. Furthermore due to the high occlusion within the
high density regions automated detection might proof very difficult. The
software needs to be able track a pedestrian whose contour is not only
moving and changing but which might also be partly obscured.
The analysis software needs to be able to record the number of people in
one moving throng based on the number of moving pixels and based on
the exact location in the image/ the texture of the moving pixels. However,
it would even be better if the software is able to track singular individuals,
there a trajectory file would solve all density calculation problems at once.
The program does not need to be able do detect the whole individual,
only register that a moving object is a pedestrian.
81
D.Duives
83
List of Requirements
Essential requirements
Desired functionalities
The last division in the list of requirements is based on who posed each
requirement:
based on experience ( )
conclusion of the literature review ( )
conclusion from footage analysis ( )
the client/TU Delft (
)
Those will only be displayed for the required list of requirements, there
they are deemed less important for the desired functionalities. The
requirements can be found on the left page.
The list is split between the necessary functionalities and the desired
functionality of the analysis software. The list can also be split between
the functionalities wanted because of the following data analysis (DA)
and the functionalities wanted because of the usability of the software by
me (UM) and by other users (UO). Both symbols will be displayed behind
each of the requirements. The distinction is made to indicate the relative
strength of each requirement.
85
D.Duives
D.Duives
only cuts of the .mts movie and have the same defects w.r.t. stabilization,
lens calibration, etc.
87
D.Duives
narrowing the step size the search reaches an optimum. The calculated
rectification is accordingly applied to the raw image.
The radiometric correction of the image sequence consists of the
adaptation of brightness and contrast w.r.t. the reference image. However,
due to the stable ambient conditions, radiometric correction was not
deemed necessary for the Lowlands footage.
The last step of the process is the generation of the final stabilized image
and saving of these images into the sequence. Via the three processes
described above the raw image sequence is transformed into a stabilized
sequence with the same number of images as the raw sequence. The
output sequence also has the same extension as the input sequence.
Further explanation of the ImageTracker software can be found in a paper
by P. Knoppers (2012) .
7.2.3 Assumptions
ImageTracker assumes that the observed scene is flat, which is valid for
the pedestrian movement areas but not for the tents and objects in the
image. Therefore a stabilization error is included in the data. Especially the
curved sides of the ALPHA tent introduces this error, due to z-translations
of the UAV. These errors need to be accounted for in the analysis of the
data.
Furthermore it is assumed that the brute-force search algorithm finds the
global optimum, while brute force algorithms are quite well known for
missing or shooting over the global optimum; only finding a local optimum
instead. However, in practice this does not happen often according to P.
Knoppers.
89
If all weights are equal the solution is the median of the surrounding
pixels. In regions without boundaries this faster weighting method
could be adopted without performance loss. The motion boundaries are
detected using a Sobel edge detector. Accordingly using a dilation of the
edges with a 5x5 mask, the flow boundary regions are found. In these
regions a weighted version of w is used in a 15x15 pixels region. In the
non-boundary regions, an equal weight is used per 5x5 neighbourhood,
which computes the median of the pixels motion. As such the method
weights the neighbours adaptively in an extended image region.
region remains the same despite its position change. This assumes that
changing the distance to the light source has no effect, the object does
not rotate and there are no secondary sources of illumination. While this
is seen as unrealistic, the assumption works well in practice (Fleet 2006).
Additionally, the spatial smoothness constraint states that neighbouring
pixels generally belong to the same surface and so have a nearly identical
image motion. But for small pedestrian movements this last assumption
might not hold completely. Also under strongly fluctuating light conditions
this assumption is violated. However, since the Lowlands footage does
not experience fluctuating light conditions, and the pedestrian movement
does not significantly influence the assumption, the resulting deviation is
deemed not significant.
7.3.2.4 Data output
The data output consists of a flow-vector which describes for each pixels of
the current image, what its flow direction and velocity are with respect to
the previous image. This is saved as a -mat file of 1000x1200X2 doubles
(see fig 7.5). The final presented results in the software see figure 7.6.
7.3.2.3 Assumptions
The goal of optical flow estimation is to compute an approximation to
the motion field. Each optical flow method exploits the brightness
constancy and spatial smoothness constraints of an image (Sun 2008).
The brightness constancy constraint is derived from the observation that
surfaces usually persist over time and therefore the intensity of a small
D.Duives
91
Figure7.7: from l.t.r.t.t.b. original image, generated background, generated foreground, filled-in foreground
background images are created for each time step in the sequence.
The foreground is accordingly determined by subtracting the calculated
background from the original image. Pixels locations that differ more than
an amount of 10 in all three primary additive colours are supposed to
be part of the foreground image. The pixel locations that belong to the
foreground of a single frame are accordingly saved in a separate MAT-file,
where the title displays the specifics of the foreground image.
7.3.3.3 Assumptions
The light conditions at Lowlands were changing throughout the afternoon
because light intensity is diminishing. Therefore the width of the catchment
area has been taken reasonably small (1000 frames/ ~ 20 seconds). This is
big enough to filter besides the moving pedestrians also most still standing
pedestrians from the image sequence. But also small enough to cope with
the velocity of the light changes. What the largest width of the catchment
area is, is not known. There the calculation velocity is only dependent
on the decision whether or not to use a dynamic/changing background,
the optimal catchment area is defined as the smallest area where the
most pedestrians are filtered from the background images. Reasoning is
that the smaller the area, the better the background adapts to the light
intensity change.
D.Duives
93
Legenda Flowcharts
D.Duives
Three tabs are used to load these pieces of information. In each of the
tabs, one has the possibility to visualize the information in order to check
their correctness.
7.4.2 Detection
The detection of moving objects (for this master thesis it is assumed that
all moving objects are pedestrians) in a sequence is possible in two ways.
Either via manual detection or by using an automated procedure that
detects a major part of the moving objects within a pre-specified image.
Both procedures are visualized in figure 7.9.
7.4.2.1 Input
The input of the manual detection consists of only the stabilized image.
For the computer detection also the foreground images are loaded into
the function.
7.4.2.2 Method
During the manual detection the user will be asked to indicate the
approximated position of a not-yet-selected pedestrian within the
stabilized images of the current time step. Accordingly an enlarged version
of the indicated location will present itself, also in this image the user
is prompted to select the location of the same pedestrian. This location
is logged as the current location of that pedestrian in the picture. Also
the colour of the selected pixel is recorded. This is necessary to improve
tracking.
95
The automated detection procedure uses the assumption that all moving
4.3 Tracking
The tracking procedure is visualized in figure 7.10. For each tracking
iteration the whole procedure is run. The software program knows three
different methods of calculating trajectories.
1 - NEXT the procedure runs once in a forward direction
2 - PREVIOUS - the procedure runs once in backward direction
3 - TOTAL the procedure iteratively runs forward from the first time
step through all time steps. Accordingly the method runs backward
through all time steps from the last image to the beginning only
updating the tracking file where necessary. Thereby after 1 total run
all detected pedestrians are tracked through all uploaded frames, even
though they might not have been detected in the first image where
they enter the detection area.
is not found due to occlusion or rapid changing pixel colour. Rapid changing
pixel colours cannot be followed since this would allow the software to
shift focus too easily. But occlusion generally occurs only for short periods
of time (2 - 3 seconds) in the footage. By allowing the software program
to lose a pedestrian and tracking its predicted location based on the last
known velocity, the software has the possibility to find the pedestrian
again when it reappears after the occlusion. When the pedestrian has not
been found for longer than two seconds, the pedestrian is accounted for
as lost. A disadvantage of the tracking of predicted locations is the risk of
mismatching reappearing pedestrians. However, unless pedestrians wear
shirts with almost the same colour, the tracking method can distinguish
between pedestrians.
7.4.3.1 Input
The input of this method is the tracking file developed during the detection
phase.
7.4.3.3 Output
The only output of the tracking procedure is an updated tracking file.
Depending on the method used (NEXT, PREVIOUS or TOTAL) the tracking
file contains only the extra tracking information of the next time step, or
the tracking information of all currently detected pedestrians throughout
the whole sequence.
7.4.3.2 Method
During the tracking procedure pedestrians are tracked based on their
current position, the colour mixture of the pixel, the velocity vector at
their current position and the foreground image. In the procedure the new
location is first predicted based on an addition of their current position and
the specified velocity vector field. In the second step, the actual location
is found by means of a local search (within a 20x20 pixel region) for the
approximate correlating pixel colour nearest to the predicted location.
Thereby assuming that the indicated colour of the pedestrian is unique
within the local search area. When a pedestrian is found, its new location
is saved in the tracking file. However, it is also possible that the pedestrian
D.Duives
97
Besides detection and tracking, two extra functions have been added to
help the interaction between software and use. Where DELETE deletes
the total trajectory of a specified pedestrian, SHIFT allows the user to shift
the focus of the software on the pedestrian. Thereby the user is able to
select another point on the pedestrians body that might allow for more
stable tracking results.
A completed tracking file can tell the user for every image in the sequence
where the tracked pedestrians are. In order to better understand the
implications of the tracking file, also visual data analysis is necessary.
The Analysis tab provides this option. In the Analysis tab several plots
can be made, namely: macroscopic velocity plots, microscopic velocity
plots, density plots, intensity graphs, flow characteristic scatter plots and
trajectory visualizations. Next to plots this tab also provides the user with
an option to make a movie of the velocity and density plots. Last of all,
the analysis tab offers the possibility to save the density, intensity and
trajectory information in an excel file for further analysis. The functions
behind the plots will be touched upon based on their data input, method
used, assumptions made and data output.
Therefore the user is able to specify the data collection locations. In total
9 per scatter plot.
The output of the calculation consists of bar plots for each of the specified
lines. Also a Excel-file can be exported which for every line gives the number
of pedestrians who crossed the line in each of the two directions.
The software will accordingly find the data necessary from the velocity
vector matrices and the trajectory file. The same assumptions about the
density and velocity calculations apply as mentioned before. The velocity
calculation is based on the weighted average of the individual velocities
of pedestrians within a 20 pixel radius.
In the plotting process the user is able to declare which detection points
it wants to plot within one graph. Three combinations can be plotted,
namely: velocity against density, velocity against time and density against
time. The software will produce graphs in which every detection point has
its own identifiable colour.
D.Duives
101
103
D.Duives
D.Duives
the detection and tracking function than were initially not expected to be
possible.
4.
It reports a combination of velocities & densities
At the location of the detection points velocities and densities can be
combined and plotted using a scatter plot. This requirement is deemed
fulfilled.
5.
It reports a combination of velocities, densities and forms of
self-organisation
At the location of the detection points, velocities and densities can be
combined and plotted using a scatter plot. This same output is also
recorded and exported to excel. In that case also the timeframe of
each data row (velocity & density) is recorded. Thereby 5.a. is fulfilled.
However it does not provide a tool to automatically attach the forms of
self-organisation to this same graph. But combining the file created at the
detection points with the by the user recorded forms of self-organisation,
5.b. can still be very easily be plotted using Matlab. For now requirement
5 is deemed half-fulfilled. But as the requirement was mainly listed to
simplify the data analysis, the failure to meet this requirement, does not
represent the failure of the software overall. But is deemed an alley for
further improvement in the future.
Software related
6.
New by me written code has clear and open structure of
underlying parts
The newly developed source code is written in a function structure.
Each function has a self-explaining text at the start of the document.
Also the connection between functions is highlighted. Furthermore the
computation steps are also highlighted. This requirement is fulfilled.
7.
New by me written code has a explicit and understandable
code
As far as I can deduce this is true. Therefore this requirement is checked.
8.
The input of the analysis software is an MTS-file
The input of the ImageTracker is a sequence of. png images. Which would
mean this requirement has not been achieved. But an extra piece of open
source software called VirtualDub can be used to cut the .MTS video into
the separate .png images.
9.
The output file is constructed by the user itself from several
different plots and tables stored in multiple directories
In the previous chapter several plot and export possibilities have been
shown. The user specifies where the files are saved, so the possibility
of storing the information is multiple directories is also possible. This
requirement is met.
Precision related
11.
The error on the velocity is no more than 10% of the calculated
velocities
This requirement is very hard to test, there no measurements were done
at ground level within the crowd. Therefore the requirement is said to be
unknown.
12.
The error on the densities is no more than 10% on the
calculated densities
In the hand checks done, the calculated densities are right. However,
not everyone can be detected due to occlusion. At those instances there
are no other means to test want did happen at the ground. So no final
conclusion can be drawn about this requirement.
Usability related
13.
The software can be operated by me
There I developed major parts of the total software package and worked a
lot with the others, it can be concluded that this requirement was met.
14.
The software is robust
This requirement will be further discussed later on in this chapter.
15.
The software is stable
This requirement will be further discussed later on in this chapter.
D.Duives
19.
Reporting of the Angular deviation with respect to local
destination
The software does not yet include this functionality. But similar to 18 an
easily implementable MATLAB script can be written to find this data.
8.2 Usability
A user-test has been done with a test subject who was unaware of the
function and layout of the MODT-tool. Based on the user test it was
determined that the layout of the program still has a lot of room to
improve. Especially the buttons need more explanatory information in
order for the user to understand the underlying functions. This has partly
been fixed based on the remarks made during the test. Further necessary
actions were deemed to laborious for the extra usability. However, they
might need to be included in a further evolved version of the MODTtool. Next to the button explanation also the text-boxes needed more
information on the usage of them. These problems have hopefully been
solved.
D.Duives
the trace. For an image sequence filming people walking across a station
floor, this method works fine and does take considerable less computation
time. However, for the Lowlands footage the sequence gives not enough
information per pedestrians to start the trace.
8.5 Conclusion
Combining all evaluations before it can be concluded that the software
package is functioning better than expected. The required functionality
within requirements has been fully met. Besides that also the list of
desired requirements has been almost completely met. The user-test
left some room for improvement. But considering that the actual user
has far more knowledge about the functionality than the test-subject,
the user-experience was deemed sufficient. For now the calculation time
cannot be decreased, but more research is needed into the preprocessing
and preparatory calculation. The robustness w.r.t. erroneous input was
deemed good. Therefore it is concluded that the software package is
ready for usage.
D.Duives
Usability:
After a few alterations, especially in the layout of the software program,
the MODT-tool is deemed fit for its intended use. More alterations can
be made, these however were not seen as cost-effective adaptations for
the time being.
Calculation speed:
The calculation speed for an image sequence of 10.000 frames with a step
size of 10 using a personal desktop computer is on average 23 days. The
calculation time can be reduced by either increasing the computational
power available or by increasing the effectiveness of the optical flow
calculation.
111
111
113
Figure D.1: Festival Isle of Wright (source: www.flickr.com)
D.Duives
D.Duives
D.Duives
The adaptations to the velocity calculation, the tracking method and the
fundamental diagram are used during all hereafter displayed results.
119
D.Duives
The predominant OD-pairs originate or end at either the ALPHA side exit,
the CHARLIE exit, the toilet entrances or the Hemelbrug exit. Therefore
the walking direction in the footage is mainly east-west or visa versa.
A set of transcribed trajectories of sequence 5 shows the same trend (fig.
10.2). Besides the generally parallel movement a few pedestrians deviate
through the crowd in a downward movement. The red trajectories in
figure 10.2 also indicate that these turning movement proceeds via a
stair-like motion.
D.Duives
121
sequence 5
sequence 7
sequence 8
D.Duives
123
t=400.4 sec
t=560.4 sec
Graph 10.6: Histogram experienced densities sequences 7
Figure 10.7 : Moving high density regions sequence 5 t=320.4 s, 400.4 s, 560.4 s
10.10 the locations where the high density regions are generally found
during the recordings are shown (i.e. the area in front of the cash registers
and the area in front of the toilet entrances). The increased densities
at those locations are caused by the hampering of free flow by groups
of waiting pedestrians. At the same time the movement of the passing
pedestrians shows more search-like behavior. The direction of movement
changes more often and the average walking velocity goes down. Also the
variation in walking velocity increases.
sequence 5
sequence 7
sequence 8
D.Duives
125
D.Duives
127
No organisation /disorder
First of all, the total lack of organisation, which is especially visible in
sequence 5, see fig 10.11. During these periods, every pedestrian moves
where and when he/she wants. Trajectories are almost straight lines and
the variance of the walking velocities of each individual are limited.
Lane-formation
The second form of self-organisation encountered in series 7 and 8 is
the formation of lanes, see fig. 10.12 for a representation of walking
directions. In both sequences the number and the width of the lanes
change dynamically over time and location.
Stop&go
The last form of self-organisation encountered only in a few small
episodes of sequence 7, is the existence of stop&go movements. During
this moments the flow of people temporarily slows down, only to speed
up again afterward for no apparent reason. The stop&go movement is
only present in the top-left corner of the screen. A visual representation
is hard to deduce because pedestrians within this very dense region are
hard to track.
Number of lanes
In the past there has been tried to describe lane formation by means
of counting the number of walking lanes that do appear. In this study a
lane has been defined as a flow of pedestrians all walking in the same
direction, walking directly next to each other. The magnitude walking
velocity of the pedestrians can be equal, but might also be variable within
a lane. However, the number of lanes also depends on the width available
for the total pedestrian bidirectional flow. Therefore the number of lanes
is defined as the number of lanes per total effective width (the passage
width available at the experiment locations at Lowlands 2011)
129
D.Duives
However, in low density regions and at the borders of the passage, this
method also fails to produce a border. Therefore it is assumed that
pedestrians in the middle of the passage consider their local route based
on the forces that are present in their environment and choose the route
of the least resistance. At the border of a lane in the middle of the passage
would mean that the force working on both sides of the pedestrian is
equal. The assumption that this thesis makes, is that the person at the
rim of the passage has the choice to move away from the crowd, but
chooses not to deviate because the force that person experiences from
the side where nobody is walking is non-existent. The assumption is that
this person at the rim, at the border of the last lane, does not experience
any pressure to move away from the crowd, while at the same time he
has the ability to do so at whim. Thereby his own trajectory represents
the exact location of the outer border of the outer lane. It is expected that
this method introduces some random error. However any other way of
estimating the outer borders would probably introduce a larger error.
Coupling the characteristics
The lane-formation characteristics need to be related to the macroscopic
flow parameters as experienced by the pedestrians. Therefore the laneformation characteristics are deduced for every pedestrian detected
in the footage. With a small program both width and the number of
lanes, at the cross-section where the pedestrian is standing, are determined. Thereby the instantaneous macroscopic flow parameters can be
coupled to the exact lane-formation characteristics present at the same
cross-section.
131
131
D.Duives
Velocity
Density
Flow
Characteristic
seq. 5
0.5-1.1 m/s
0-2 P/m2
0-2.2 P/m/s
free flow
seq. 7&8
0.5-1.3 m/s
0.5 - 4 P/m2
0.25-5 /m/s
lane formation
stop&go
(11.4)
(11.5)
(11.6)
(11.7)
(11.8)
(11.9)
(11.10)
(11.11)
(11.12)
D.Duives
Also for this relation the confidence interval of (-0.0045, 0.0015) is very
large and allows for the possibility that the sign of the gradient is actually
positive. Because the number of lanes and the width of the lanes are
negatively correlated, both trends are in accordance.
Both relations are tested against the null-hypothesis that there is no
relation (gradient =0). The hypothesis test (see Tb in table 11.2) confirms
that the gradient of the found relationship between velocity and the width
of the lanes is not significantly different from 0. However, the correlation
between the walking velocity and the number of lanes is found to be
significantly different from 0. From the hypothesis test it can be deduced
that the walking velocity has an influence on lane-formation according to
the footage data.
For both relationships presented above the correlation between the
two parameters is small due to the amount of scatter in the data. The
coefficient of determination is also rather small for both correlations.
Thus a lot of variance is left unexplained by the found simple regression.
It can be concluded that the goodness of fit is very weak.
Parallel to these results, the walking velocity within each lane decreases
slightly with a growing lane width (eq. 11.14).
v = -0.0015 * w + 0.89
(11.14)
Table 11.2: Regression analysis lane-formation vs. velocity
135
D.Duives
lanes=2
lanes=3
lanes=4
lanes=5
lanes=6
lanes=7
Table 11.3: Regression analysis number of lanes vs. the experienced density
D.Duives
that when the experienced density within a lane increases, the width
of the lane also increases. A linear regression of the reversed relation is
represented in the graph 11.6.
The statistical test of the linear regression is positive and larger than 1.96,
so the gradient is significantly different from 0. For the density lanes (>3
pers/m2) found within the Lowlands footage, high density lanes would be
on average 4 meters wider than low density lanes (<1 P/m2).
D.Duives
D.Duives
Parallel to the above, the results show that there is a second relation
between density and the number of lanes. Using the relation presented
above, an increase in density causes a widening of lanes. For a set passage
and very low densities every pedestrian can choose its own path. The
repulsion would force cooperation between pedestrians. In this first phase
the lack of self-organisation can also be seen as the presence of an infinite
number of lanes. With growing densities wider lanes (+/- 5 m) with a lot
of empty space in between would form, where each lane can freely grow
upto the point where the borders of the lanes meet. At the end of this
phase the number of lanes is finite and depending on the width of the
passage. From this moment in time onwards the width of each lane is
dependent on the relative excess repulsion force present in each of the
lanes with respect to the repulsion forces present in the other lanes at
the same cross-section of the passage. Lanes with excess repulsion forces
try to widen, and lanes with relatively less repulsion forces, are forcibly
narrowed.
The found results imply that the width of a lane increases with an increase
of the density within the lane. This result implies that the number of
lanes in a passage with a fixed passage width decreases when density
within all of the lanes increases. This result is quite the opposite of what
Hoogendoorn et al. (2008) found in their research. It is hypothesized that
both results do not contradict each other but only describe the same
phenomenon at different scales using different descriptive measures.
Where Hoogendoorn et al. (2008) use the number of lanes calculated by
means of an optical flow method in a constricted very small passageway
(<10 m), this MSc. thesis describes pedestrian movements by means of
141
macroscopic velocities at far larger passageway. It is hypothesized that
143
0.0088
0.85
0.10
2.01
yes
-00.15
0.89
0.10
-1.04
no
-0.019
1.57
0.36
-2.31
yes
0.024
1.28
0.35
9.16
yes
-0.0019
1.28
0.44
-0.2
no
0.018
1.12
0.44
6.30
yes
It was found that only the relation between the width of lanes and
respectively density and flow are positive significantly different from 0.
Also the relations between respectively velocity, density with respect to
the number of lanes were proven to be significantly other than 0.
However, even though the other correlations cannot be proven without
doubt, due to enormous scatter, it is hypothesized that these will indeed
be found to exist when better research methods are used.
In the last paragraph of this chapter a theory is proposed in which
all visible trends are taken into account. It states that when density
increases, the walking velocity decreases, the width of the lanes increase
sand the number of lanes decreases.
Figure 12.1: Koninginnedag Amsterdam Museumpark (source: Flickr)
D.Duives
Equipment
This research looked at the development of a new recording and analysis
tool. The question was whether a UAV recording technique could actually
be used to gather stable and processable footage. Looking back, it can be
said that the results are astonishing. Not only can footage be recorded,
but the resulting footage is also stable enough to analyze the pedestrian
crowd movements without interference. Yet, the recorded footage
is unstable and displays longitudinal and lateral movements as well as
rotations along the XY-axis.
Software
A combination of VirtualDub and ImageTracker provides a tool to correct
the unstable distorted footage. ImageTracker is capable to solve the
distortion of the image almost completely. However, slight movement
remains visible in the stabilized data in the top-left and bottom-right
corner due to lack of stable objects in those corners. This MSc. thesis
has proven that it is possible to translate pedestrian crowd movement
footage into pedestrian movement data. However, due to the by the
camera movement introduced instabilities, uncertainties are included in
the results.
Data related
By means of the UAV recording technique and the developed software
the crowd movements at Lowlands have been analyzed. It was seen that
mainly velocities between 0 and 1.8 m/s were present. Densities range
from 0 - 4 P/m2. Besides that, only two forms of self-organisation were
analysed (no organisation and lane-formation). An in-depth analysis of
145
the correlations between the self-organisation characteristics and the
D.Duives
During the stabilisation not only longitudinal and lateral movement along
x- and y-axis are visible. Also a slight rotation of the image plane around
the XY-axis is seen, see figure 12.3 for a visual representation. The rotation
results in a reduced representation of world coordinates on one side of
the plane and an enlarged representation of the other side, causing a
misrepresentation of movement velocities.
The expected error on the Kalman tracking is assumed less than that of
the optical flow. However, the optical flow method does present us with
the means to estimate the movement direction of most pedestrians in
the first image of a sequence. Thereby improving the start-up period of
the implemented Kalman filter. However, as soon as more reliable data is
available the Kalman ignores the optical flow data, since the optical flow
method introduces large estimation errors. The total introduced error
due to not entirely stable footage is limited, but still accounts for a part of
the scatter in the velocity data.
does not provide a lot of support for manual detection. People are often
occluded and have no clear outline. See fig. 12.4 for a general detection
image. Based on the t-shirt colour and the presence of black heads most
people are detected. But no 100% detection rate can be guaranteed.
The found densities in this thesis are therefore expected to be slightly
lower than the actual densities present at Lowlands, especially in dense
crowded situations (>1.5 P/m2). In high density crowds one is manually
able to detect most pedestrians, however it is estimated that one in every
15 people is not detected. Depending on the spatial distribution of those
pedestrians, the error is very limited [five pedestrians of which one is not
detected: 100-(5-1)/5*100%= 20%) or fairly big [1 P/m2 of which one is
not detected: 100-(1-1)/1*100=100%]. This might cause the fundamental
diagram to be slightly more elongated than presented at the moment.
The largest introduced errors are found on lower end of the density range.
While the smallest introduced errors are found at the higher end of the
density scales the larger errors are found at the lower end of the scale.
12.2.3 Software
The predominant approximation uncertainties made within the developed
software can be found within the tracking method and the calibration
method. Each of which is touched upon briefly.
detected people of the same height are detected with the same detection
difference. The difference in length within the Dutch adult population with
a 95% reliability interval between the smallest women (158.6 cm) and a
tall man (194.8 cm) is 36 cm (Grandjean 1980). With a footage recording
angle of 49 degrees the maximum difference in the assumed distance
between two people and the actual distance between two people is:
(12.1)
The maximum error is quite big, but since men are generally smaller and
women longer than assumed in the calculation above, it is expected that
the average error will be 20 cm or less. The error in estimated location only
influences the density calculations. Yet it is expected that the detection
error due to a tilted capturing angle is less significant than other errors
presenting themselves in the tracking method.
In the tracking method colour is used as the final characteristic to pinpoint
the location of the pedestrian. However, the pixel colours of pixels
describing the same person in two consecutive frames might vary slightly
due to the instability of the footage, minor differences in the light intensity
or shadows on the persons clothes. Thus, even though the same person
is tracked, the exact movement of the person that the software records
might be off because a wrong pixel on the body has a better colour value.
The detection error and tracking error are related. When one shifts the
detection point on the pedestrians body, also the location (X,Y) where
the person is detected shifts due to the detection error. The tracking error
introduces variability in the detection error for each person. Thereby the
total uncertainty increases. The body of an average visitor of Lowlands
is accounted for by 10x20 pixels. Because the main movement of the
visitors is lateral through the screen, the maximum error arising due to the
shifting of detection point on the body is on average 4 pixels, which would
account for a tracking error of 20 cm in world coordinates. At the same
D.Duives
time the detection error would increase with 23 cm. However, as with the
stabilization error, it is expected that the tracking error decreases when
averaging the movement of pedestrians over a longer period of time. Due
to the variability of the actual detection point, detection error will remain
present and hard to estimate. But this second uncertainty accounts for a
part of the scatter still present in the velocity data.
Besides an error in the detection part of the tracking method, the tracking
method also has a chance to lose a persons track. Especially in situations
where (moving) objects with the same surface colour are present the
tracking method gets confused. Two situations of error can be pointed
out. In first situation two persons with the same shirt colour cross at
short distances, temporary occlusion occurs. In the latter situation two
people with a similar shirt colour walk in the same direction, one in front
of the other. The tracking method skips from the first person in line to the
second when it loses track of the first. The consequence of both situations
is the same, i.e. the persons track is lost. Since the trajectories are used in
the calculations of both velocity and density two errors occur. Especially
the misrepresentation of density is seen as a major problem because
it directly influences the fundamental diagram. The density of an area
where a pedestrian is not present is overestimated due to the tracking of
a person that is not there. At the same time, the density in an area where
the pedestrian is actually present is underestimated.
Most of this kind of tracking errors have been manually detected and
removed. But a few might have escaped notice. Therefore a slight
misrepresentation of the pedestrian macroscopic flow parameters is
expected.
149
12.2.3.2 Calibration
The calibration of the macroscopic parameters is based on a correction
factor based on the x and y-coordinates in the image and the same x
and y-coordinates in reality. Because no GPS measurements were taken
during the experiment calibration was done using the building plan of
the Lowlands grounds. This method introduces two errors. The first one
is related to the dissimilarities between the building plans and reality.
Because the toilet units have standard widths and lengths no major
errors are expected to have arisen from this first uncertainty. The second
expected error is related to the indication of the locations within the
image. It was estimated that the average error on the indication of a
location is maximum 2 pixels off. On the length of a toilet building this
error would result in a error of approximately 0.002 in the y-coordinates
calibration factor. For the width that same indication error would result in
an error of 0.008 on the x-coordinates calibration factor. On themselves
these calibration errors are not expected to influence the data severely.
method does not show the real instantaneous behaviour, but dampens
the abrupt changes in velocity. Using this method diminishes the average
velocity error diminishes to 10% of the original error. However, by not
using the instantaneous walking velocity one introduces an error in the
fundamental diagram, since the densities are instantaneous. However,
since pedestrians are not expected to drastically change their velocity,
the expected error will be rather insignificant relative to the other
uncertainties.
During the data analysis phase four calculations are made, each of which
introduces an error. In the remainder of this sub-paragraph the errors
introduced by the velocity calculation, flow calculation and the lane width
calculation are discussed separately.
12.2.4.1. Velocity calculation
While analyzing the trajectories, it was found that the instantaneous
velocity calculated using the tangent of the trajectory showed very
unstable behaviour (see Ch. 9 for an in-depth explanation of the problem).
Therefore there was chosen to use an approximation of velocity in which
the distance (between the location a person in one frame and the location
of the same person 100 frames later) is divided by the time period (4s). This
The second introduced error is the actual manual recording. The recording
error is of the same magnitude as found during the sensitivity analysis of
the calibration process (2 pixels). This second error is insignificant with
respect to the misinterpretation error.
D.Duives
12.3 Conclusions
This MSc. thesis has looked at three separate but connected issues; the
implementation of a new UAV recording tool, the development of footage
analysis software and the analysis of pedestrian crowd movements at
Lowlands. Conclusions have been drawn with respect to all three issues.
In the according subparagraphs the issues are treated separately.
UAV recording
It was found that even though the footage could not be stabilized
completely, the pedestrian movement could be analyzed both
qualitatively and quantitative in an offline fashion. From the result it can
be concluded that the UAV recording method can indeed be used to study
pedestrian crowd movements. Since the method captures areas with large
dimensions, this method is uniquely equipped to record pedestrian crowd
motions. However, the developed software is very sensitive to scaling and
rotations of the recorded footage. Therefore it is also concluded that the
UAV, because it vibrates, rotates and translates, is not the most stable
equipment available to record footage. But because the UAV can reach
great heights, can respond quickly to commands from the ground and has
a recording installation that is easily adaptable, it is a piece of equipment
that can certainly help improve the understanding of pedestrian crowd
movements.
Chain of software programs
The footage recorded using the UAV equipped with a HDHS camera can
currently not be stabilized entirely. Therefore, slow processes (v<1 m/s)
recorded from great heights (>50 m) proof difficult (but not impossible)
to analyze with the developed chain of software programs. It is concluded
151
that the ImageTracker is currently not precise enough to completely
12.4 Implications
The UAV recording technique, the analysis software and the proposed
framework all have implications for both research applications and the
practice crowd management.
supply. However, at the moment, the limited recording time due to quick
battery depletion renders the method practically less suitable for online
analysis over longer periods of time (> 20 min). However, when one
has the luxury to use a generator and multiple accus (and do not need
continuous eyes in the air), the UAV recording technique is a footage
recording technique to take into consideration. When one has the
possibility to wait for good weather, and one does not need continuous
footage over a long time, the UAV is the tool to use to record pedestrian
crowd movements. This new technique is therefore more suitable for a
research environment.
Besides pedestrian movements, the UAV recording method can also be
used to record other moving object phenomena where a large recording
area is important.
153
the same cues would reduce the error and alleviate the amount of work
involved in this procedure greatly. The MODT tool is functional but not
an optimized software program working on any platform. Also the MODT
interface can use extra development time.
as infrared and night vision cameras might proof useful for situations
were normal techniques cannot be used due to severe fluctuations in
light intensity or the total lack thereof.
But not only video recordings can be used to detect, track and analyze
pedestrian movement. Upcoming in both the analysis of pedestrian and
vehicular traffic is the use of smartphone data. Bluetooth and WIFI are
both being used to gather data about pedestrian movements. However,
at the moment the adaptation rate of smart phones is less than 100%.
So for now a mixture of smartphone data and other data sources is
needed to establish reliable results. Yet, the accuracy of smartphone data
is admirable. Further research in the smartphone related techniques is
needed to increase the capabilities of this already fruitful technique.
Another techniques that is predicted to help pedestrian movement
analysis along is the use of RFID tags. For now these technique is only
used sporadic in very specific situations, but by small adaptations could
prove useful in situations where pedestrians wear their ticket with them
(especially festivals). Further development is necessary for this technique
to reach its full potential.
157
D.Duives
XVI
Postface
Acknowledgment
This MSc. thesis project has only been possible because it was able to
record footage of pedestrian crowd movements at a unique location. This
has been made possible by the help the VPRO, who saw an opportunity
and helped achieving it. I would also like to thank Loc7000 and the
organisation of the festival Lowlands, who both granted us access to the
grounds while the festival was ongoing.
Next to the people who made the research possible, I also want to thank
my graduation committee. Prof. dr. S.P. Hoogendoorn for his wild ideas
and the possibility to play with one of the coolest research toys available.
Looking back, it has been an interesting roller coaster ride. Impressions
have been following each other quickly. Also dr.ir. W. Daamen, my daily
advisor, for her advise and the thorough way she scrutinized everything
I produced. Between the two of them they showed what research is all
about. I further want to thank both prof. dr. ir. B. Gorte and ir. Wiggenraad
as members of my graduation team. Even though the interactions have
not been often, both helped me focus on what was really important.
Last of all, I want to thank the people who have lived with me outside
my work throughout this research process. Their patience with me
when my head was temporarily not available for anything else has
been appreciated. I want to specially thanks Bianca for the tea and the
conversations. And above all Wouter for the drawings, the advise, the
scolding and everything else.
Delft, March 2012
Dorine Duives XVII
D.Duives
Bibliography
Papers:
Abdelghany. A & Abdelghani. K., H.S. Mahmassani, S. A. Al-Gadhi (2005).
Microsimulation Assignment Model for Multidirectional Pedestrian
Movement in Congested Facility. Transportation Research Record: Vol.
?, pp. 123-321.
Al-gadhi. S.A., H.S. Mahmassani, R. Herman (2002). A speedconcentration relation for bidirectional crowd movements,. Pedestrian
and Evacuation Dynamics, Springer: 3-20.
Ali, S., M. Shah (2008). Floor fields for tracking high density crowd
scenes. European Conference on Computer Vision.
Blue, V., J.L. Adler (2001). Cellular Autonomatat for Modeling Bidirectional Pedestrian Walkways. Transportation Research Part B 35: pp.
293 - 312.
Boltes, M., A. Seyfried, B. Steffen and A. Schadschneider (2010).
Automatic extraction of pedestrian trajectories from video recordings.
Pedestrian and evacuation dynamics. Berlin, Springerlink Heidelberg: pp.
43 - 54.
Ando, K., H. Ota, & T. Oki (1988). Forecasting the flow of people (in
Japanese). Railway Research Review: vol. 45, nr. 48, pp. 48-14.
D.Duives
Hoogendoorn, S.P., W. Daamen, M. Campanella (2008). Selforganization and chaos in pedestrian flow: Experiments and modelling,
Proceedings of the Chaotic Modeling and Simulation International
Conference, CHAOS 2008
Horn, B. K. P., B.G. Schunck (1981). Determining Optical Flow. Artificial
Intelligence: Vol. 17, pp. 185-203.
Hughes, R. L. (2003). The flow of human crowds. Annual review of
Fluid Mechanics: vol. 35, pp. 169-182.
Ihaddadene, N., C. Djeraba (2008). Real-time crowd motion analysis.
19th Int. Conference on Pattern Recognition: pp. 1-4.
Ishii, H. T. O., J. Takusagawa & N. Muroi (1987). Bidirectional pedestrian
traffic measuring system using radiance detection. Journal illuminating
engineering instruments: vol. 71, pp. 626-631.
D.Duives
XXI
Polus, A., Schofer, J.L. & Ushpiz, A., (1983). Pedestrian flow and level of
service. Journal of Transportation Engineering: vol. 109, pp. 146-157.
Electronic sources:
FileInfo. (2011, 29/07/2011). .MTS file extension. Retrieved
16/10/2011. http://www.fileinfo.com/extension/mts
Jorritsma, A. (2009) Loc7000 zet puzzelstukjes Lowlands in elkaar.
http://www.logistiek.nl/supply-chain/supply-chain-management/
did12614-loc7000-zet-puzzelstukjes-lowlands-in-elkaar.html
Love Parade (2009). Dokumentation Love Parade. http://www.
dokumentation-loveparade.com/
Yin, J. H., S.A. Velastin & A.C. Davies (1995). Image processing
techniques for crowd density estimation using a reference image.
Second Asian Converence on Computer Vision.
D.Duives