Documente Academic
Documente Profesional
Documente Cultură
A Dissertation
of Cornell University
Doctor of Philosophy
by
May 2002
TABLE OF CONTENTS
Why Simulation? 2
Role of Randomness 2
Agent-Based Models 2
Spatial Issues 4
Properties of Location—Q-Vector 25
AGGLOMERATION MODEL 33
vii
Specifications 34
Simulation Results 36
Discussion 44
POLLUTION ACCUMULATION 50
Specifications 50
Specifications 65
Simulation Results 67
Specifications 72
Simulation Results 76
BIBLIOGRAPHY 84
viii
LIST OF TABLES
ix
LIST OF FIGURES
Figure 3.10: Emergence of cities with 98 family units, π = 0.9 , c = 0.01 , t = 0.2 . 24
Figure 3.11: Further development of cities with 98 family units, π = 0.9 , c = 0.01 ,
t = 0 .5 .
24
Figure 4.1: Distribution of attractive forces around cell i where p = 1 and G j = 1 for
all j .
30
Figure 4.2: Distribution of attractive forces around cell i where p = 1 and Gi = 1 for
all j except for Γ(Q j ) = 70 at d ij = 15 31
x
Figure 6.1: Spatial patterns in equilibrium under various sets of parameters. 37
Figure 6.6: Spatial patterns with travel cost uniformly distributed from 0 to 1. 46
Figure 6.8: Set of p and c that offer realistic population distributions added to figure
6.5. 49
Figure 7.2: Formation of clusters when resources exist with low transport cost. 55
Figure 7.3: Formation of clusters when resources exist with high transport cost. 56
Figure 7.4: Formation of clusters and behavior of mega-cluster with both resources
and pollution (low transport cost). 57
Figure 7.7: Behavior of agents with depletable resources concentrated in the center of
the space. 60
xi
Figure 7.8: Behavior of agents with renewable resources concentrated in the center of
the space and with accumulation of pollutants. 62
Figure 7.9: Behavior of agents with renewable resources concentrated in the center of
the space and with pollutant purification. 64
Figure 9.1: Initial condition for qi , 4 for all i (North American Map). 74
Figure 9.2: Formation of urban clusters of North America simulated under various
transport cost ranges. 79
xii
CHAPTER ONE:
INTRODUCTION
The role of science is to expand our knowledge about the universe and supply
the findings to the fields that make use of them. For example, mathematics arms
physicists with vigorous logical reasoning; physics allows astronomers to understand
how objects interact with each other; astronomy helps engineers construct a guidance
system for a space shuttle; and, we go to outer space. Conversely, the progress of
science also has been helped by engineering—especially, in the late 20th century, the
advance of computer capability changed the way most scientific work is done.
Likewise, regional science has also exploited the computer capabilities of
recent years in order to expand our knowledge about our economy in terms of space
and offer the findings to policy makers.
1
Guy Orcutt (1990), who is known as the pioneer of microsimulation, designed the first multiple
1
2
Why Simulation?
Role of Randomness
Agent-Based Models
local. Also, it is known that even very simple rules can cause extremely complex
behaviors. The use of agent-based models is an ideal approach to make use of all the
advantages mentioned above; namely, the model can be dynamic and simultaneously
capable of simulating complex and random phenomena.
One of the implications of using simulation would be that we could possibly
“experiment” with new policies before applying them to the real world. With agent-
based models, in particular, we could also experiment on an individual’s preferences
and responses to the environment. Indeed, with agent-based approaches, it is possible
to even model adaptive behaviors of economic agents without allowing traditional
economic assumptions such as rational expectation. The difficulty in, and significance
of, modeling the complex behaviors of the real-world economy were pointed out by
Isard (1956) in the preface of “Location and Space-Economy”:
When this was written in 1956, analytical tools were the only ways for general
researchers; in fact, the methods introduced in the book attack complex problems
within the framework of analytical approaches. With agent-based models, however, it
is not a demanding task to create a model to satisfy the kind of complex behaviors
4
described in the citation above. The third objective of this research is to demonstrate
this.
In general, agent-based models are uniquely capable of handling the
followings: (1) Interaction between agents. Unlike traditional microeconomics
theories (not to mention macroeconomics), where only a representative agent is
considered with the interactions with no or a few agents at most, agent-based
approaches can explicitly model the interaction between each agent and an arbitrary
number of agents; (2) Asymmetrical space. Agent-based models allow us to assess
complex-shaped spaces. For example, the shape of the space could be as complex as a
map of the United States; and (3) Heterogeneous agents. Since the concept of agent-
based models is object-oriented, it is relatively easy to model heterogeneity of agents.
Note that we integrate all of these into one set of simple mathematical expressions.
Spatial Issues
As implied in the last part of the citation, one of the dominant factors that
complicate the real-life economy is the existence of space, which is, of course, the
motivation of regional science. As a matter of fact, traditional methods in regional
science such as those in “Methods of Interregional and Regional Analysis” (1998) do
answer some economic questions with regard to location, but they by themselves have
the limitation that the geography must be highly simplified so that the questions can
be solved analytically. In other words, if we are to apply an existing method to the real
geography, there is no way for the method to recognize the complex
shapes/dimensions of political and economic units.
It is an advantage of agent-based models that the physical complexity of the
space does not matter because an agent’s local rules are concerned with its
5
neighborhood with a simple shape2. For this reason, the existing methods with simple
scalar dimension such as the gravity model could be soundly incorporated into an
agent-based model and used for vectorial analyses.
The final objective of this study is to integrate a few existing regional science
methods into agent-based models and construct a general spatial agglomeration model
that can be used in other methods in regional science.
2
Moreover, as will be shown in later chapters, the intricate spatial distribution of population emerge
rather than be exogenously given. Obviously; this concept of emergence—which is often seen in
6
7
Furthermore, in order to utilize the model for analysis, one may also:
6. Construct hypotheses.
7. Determine simulations that would test these hypotheses.
8. Run simulations.
9. Analyze the results.
In the following sections, these steps (except for 2.c and 3) are followed to
build the model.
8
9
Specifications of Space
The space in which economic agents live is set to be a lattice of cells, each of
which can contain multiple agents. Furthermore, we forge a space of infinite horizon
by connecting one horizontal edge to the other and one vertical edge to the other3,
making a torus, in order to avoid potential border effects, which are undesirable for
the purpose of this chapter4.
The distance between any two cells is defined by its Moore neighborhood.
Figure 3.1 shows an example of Moore neighborhood5 where the distance from the
central cell is marked in each cell. We choose this type of distance ( L ∞ distance)
rather than Euclidean ( L 2 ) distance because of the space being a lattice and for the
3
This is a setting commonly seen in spatial interaction models.
4
For example, if we are to construct a model that includes trades between locations, all agents might
simply gather at the central cell of the lattice to minimize the transport cost. In this case, there could be
4
4 3 3
4 3 2 2 2
4 3 2 1 2 2 3
4 3 2 1 0 1 2 3 4
4 3 2 1 2 3 3
4 3 2 3 4
4 3 4
4 ,
which is characterized by L1 distance or Manhattan distance, which might be, as its name suggests,
appropriate for certain problems in regional science.
10
ease of coding. It is also know that the choice of distance does not affect the
qualitative results (Durrett and Levin, 1994).
4 4 4 4 4 4 4 4 4
4 3 3 3 3 3 3 3 4
4 3 2 2 2 2 2 3 4
4 3 2 1 1 1 2 3 4
4 3 2 1 0 1 2 3 4
4 3 2 1 1 1 2 3 4
4 3 2 2 2 2 2 3 4
4 3 3 3 3 3 3 3 4
4 4 4 4 4 4 4 4 4
Productivity of Agents
Consider each agent as a family unit and assume that there are N
6
Of course, we may introduce other factors—in fact, as many as factors we want including non-
economic ones. However, the main concern of this chapter is a smooth transition from conventional
economics; thus, it is appropriate to stick to the traditional microeconomic way. The generalized model
which indeed reveals increasing returns to scale. Therefore, since all ni agents are
ni2
f i (ni ) = = ni (2)
ni
This suggests that in order for family α to gain more real wage, it needs to increase
the number of the other family units—by migrating to another location that has a
larger population7.
Law of Motions
obtained by subtracting transport cost of migration from the real wage at the
destination. Therefore, if α relocates from i to j , the net gain of family α is
Gα (n j ) = n j − πd ij (3)
where π is the transport rate per unit distance and d ij is the distance between i and
j . Thus, α chooses j that maximizes Gα .
7
The family could also propagate and increase the population in the same location. The idea of birth
Note that family α is assumed to perceive the gross regional product of every
is in the neighborhood of i due to the existence of transport cost and behaves as if the
family ignored distant locations. The models presented in the following chapters have
a more realistic approach toward agents’ scope.
Algorithm
Once we set up the characteristics of the space and the behaviors of agents, we
need to build a flow of simulation—or an algorithm.
In this model, first, parameters are initialized as follows:
The initial distribution of agents is shown in Figure 3.3. Second, the first family unit
a is randomly chosen to consider relocation. (Let the location of a is denoted by ia )
Note that this is the main part where the role of randomness enters into this model.
Third, family unit a evaluates all other locations by equation (3) and finds the
location, jmax , that gives the highest gain. If there are more than one location that give
the highest gain, then the family unit chooses one of them randomly; this is another
8
The assumption of uniform initial distribution of family units is based on extensive historical facts.
part where randomness might take effect. Fourth, if the gain by relocating to location
jmax is larger than that by staying in location i , the family moves to jmax ; otherwise,
it stays in location ia . It also stays if those two gains are the same. Last, the algorithm
goes back to the random selection of a family unit and repeats this process. Note that
when the next family is randomly chosen, every family has an equal chance for
relocation, so the same family could also be chosen again by a probability of 1/2477.
Time
The unit of time in this model is assumed to be the average length between the
times when a family unit is chosen and when the same family unit is chosen again, so
for every unit of time, every agent has a chance to be selected once. Therefore, in this
model’s settings, one unit of time is equivalent to 2477 selections of family units since
there are 2477 family units in total. This definition of time allows us to change the
number of family units yet keep the normalized time consistent.
This way of updating the system, or letting only one agent move at the same
time, is called asynchronous update, and allows us to have continuous time. The
alternative is synchronous update, where all agents make their moves at once (usually
with each agent not knowing how the other agents move), and is equivalent to discrete
time. Synchronous update causes additional (and sometimes undesirable) effects
(Durrett and Levin, 1994), but in certain economic cases, it could be more
appropriate.9
9
For instance, economic agents (consumers, investors, governments, etc.) often make their decisions
based on the information that is available after a certain period time or only periodically, such as after
( )
Gα1 n jd =1 = n jd =1 − πd ij = 2 − 0.9 × 1 = 1.1 (4)
( )
Gα1 ni1 = ni1 − πd ij = 1 − 0.9 × 0 = 1 . (5)
16
And this is smaller than the payoff from relocation. Therefore, the family relocates to
one of its adjacent locations (recall that when a family unit is indifferent between
destinations, it chooses one of them randomly), making the first “village” that consists
of two family units. The state at this point in time is shown in Figure 3.4, where there
is one cell marked “2” next to a black cell without a white dot.
fact, such a critical value exists and it can be algebraically derived; that is, the first
family moves only if
( )
Gα1 n jd =1 = n jd =1 − πd ij = 2 − π × 1 > 1 (6)
17
or
π <1 (7)
In an anthropological sense, this could partially explain that when and/or where the
cost for migration was too high, ancient family movements did not form groups that
eventually led to a civilization, further explaining why civilizations emerged on
plains.
After some random selections of families making villages of “2”, one family
eventually joins a village of “2” and makes the first village with three family units
(See Figure 3.5). Notice that the third family came from a cell two units distance
away. Indeed, this is possible since
( )
Gα n jd =2 = 3 − 0.9 × 2 = 1.2 > 1 (8)
( )
Gα n jd =3 = 3 − 0.9 × 3 = 0.3 < 1 . (9)
18
observed as shown in figure 3.6. In this figure, a color bar is added in order to estimate
the numbers of families in the cells with more than nine family units.
19
figures, the vertical axis is log-scaled in order to depict larger concentration. Note how
the spatial pattern changes and starts to form a clear hierarchical structure in spite of
the simplicity of the model.
20
2
log10POP
1
50
0
40
30
0 y
20
10
20
10
30
x 40
0
50
2
log10POP
1
50
0
40
30
0 y
20
10
20
10
30
x 40
0
50
This hierarchical structure, however, does not last long in the highly simplified
model because all families eventually go to a single location (most likely the one with
the highest peak in figure 3.8). This is always the case because the model assumes
infinite capacity, which allows an urbanized area to simply keep attracting more
families. Obviously, it is unrealistic. In the next section, we tweak the model and
attempt to solve this problem.
Gα (n j ) = n j − πd ij − c j n 2 , (10)
negligible as n increases. Figure 3.9 illustrates this tendency under π = 0.9 and d = 1 .
We can observe that Gα (n ) indeed starts to decline after obtaining a certain number of
25
c = 0.01
c = 0.02
20 c = 0.05
15
10
G(n)
5
-5
-10
0 20 40 60 80 100
n
Based on the modification above, let us run a simulation with c = 0.01 with all
other parameters kept the same as in the previous section. For a better observation of
the effect of the new term, − c j n 2 , on the result, we start the simulation at t = 0.1 with
the population distribution identical to that in figure 3.8.
Figure 3.10 shows the result after 495 random selections or t = 0.2 . The
primary node clearly seen in figure 3.8 has not developed; instead, there have emerged
other cities of equal size. In actuality, any city stops to grow when its population
reaches 98 family units for the following reason: if an agent is alone in location i and
stays there, its net gain is 0.99. Similarly, if the family moves to the next cell10 to
10
We do not need to consider the cases where the family moves to a location more than one unit
distance away because it is obviously more costly to make a village of 98 families by migrating from a
make a village (or city) of 98 families, it gains 1.06, which is larger than 0.99. But if
the family made a village of 99 families, it would gain only 0.09, which is, of course,
smaller than 0.99. Thus, any village that already has 98 families cannot accommodate
another family.
Let us move the simulation forward. Figure 3.11 is a snapshot of the same
simulation at t = 0.5 . We observe the further development of cities of 98 families.
Notice also that some smaller villages have disappeared; the families there have
chosen to migrate to make new cities of 98 families each. Actually, in equilibrium,
with c = 0.01, the families most often make 25 cities of 98 and the residual families
make one or more smaller village(s). In general, given a value for c , all families
attempt to make a certain number of cities of equal size. Unfortunately, this still is an
unrealistic result.
In the next chapter, we construct an even more realistic and general model for
spatial agglomeration, which is capable of capturing arbitrarily many factors, in order
to simulate better the real world.
24
2
log10POP
1
50
0
40
30
0 y
20
10
20
10
30
x 40
0
50
Figure 3.10: Emergence of cities with 98 family units, π = 0.9 , c = 0.01 , t = 0.2 .
2
log10POP
1
50
0
40
30
0 y
20
10
20
10
30
x 40
0
50
Figure 3.11: Further development of cities with 98 family units, π = 0.9 , c = 0.01 ,
t = 0 .5 .
CHAPTER FOUR:
GENERALIZED SPATIAL AGGLOMERATION MODEL
This chapter involves two major tasks for the development of a generalized
agent-based model for spatial agglomeration. First, we mathematically formalize the
relationship between properties of a location (e.g. population, resources,
environmental pollution, etc.) and the perception of each agent in the location. This
will enable us to systematically add any quantifiable factors involved in the urban
agglomeration/diffusion phenomenon. Second, we examine the way individuals in the
real world make their decisions when they choose a place to live, and incorporate the
procedure into the algorithm. This will result in the formation of highly realistic
spatial patterns.
Properties of Location—Q-Vector
qi ,1
qi , 2
q
Qi = i ,3 (11)
M
q
i ,M −1
q
M
properties we choose. For convenience, we reserve the first element, q1 , for the
25
26
domestic population of agents throughout this study. For example, if one wishes to
include resources and pollution into consideration, the Q-vector is written as:
qi ,1 population of agents
Qi = qi , 2 = amount of resources at cell i (12)
q pollution level at cell i
i ,3
Properties such as amount of resources and pollution level are dynamic: they
change over time depending on human activities, other properties, and sometimes
themselves. The dynamics of properties is explicitly defined by an ordinary
differential equation (ODE) of Q-vector:
Q& = f (Q )
(13)11
Qt =0 = Qo
11
One may point out that there is inflow or outflow of agents; therefore, the ODE for q1 should be
adjusted accordingly—We do not have to include inflow or outflow of agents here because equation
(13) defines the changes in Qi during the time between one migration and another. Recall that we
q&i ,1 r1
&
Qi = q&i , 2 = − r2 qi ,1 (14)
q& r q
i ,3 3 i ,1
Note that this single ODE indeed embraces the fact that both the amount of resources
and pollution level are influenced by the presence of agents. Once the amount of
resources reaches zero, equation (14) needs to be replaced by:
q&i ,1 r1
Q& i = q&i , 2 = 0 (15)
q& r q
i ,3 3 i ,1
where
0 0 0 r1
A = − r2 0 0 and B = 0 (17)
r 0 0 0
3
Therefore, the stability of equation (14) can also be obtained analytically by using the
eigenvalue of A .
Our strategy for modeling migration is to simply make each agent “mimic” a
real person’s behaviors without making the model too complicated. Thus, we divide
the whole decision making process into two parts:
28
Global Search
12
From this chapter on, when we use the word, “city”, imagine a cluster that consists of more than a
Γ(Q j )
Gj = p
(19) 14
d ij
consider jmax as the economic center mentioned above. Although the agent evaluates
all cells in the space for the sake of the algorithm’s consistency, it gives the same jmax
as the case where she recognizes clusters and evaluates the economic centers of the
clusters.
Note that in terms of gravity models, G j can be viewed as the attractive force
13
Another possible form for D(dij) would be D(d ij ) = p / 2π exp(− pd ij2 / 2) , which is equivalent to the
p.d.f. of Brownian motion. This form is appropriate if we assume that the agent has to search for a city
many distances from the center of the cell and within the cell.
15
If there are more than one cell with the highest Gj, then the agent randomly chooses one.
30
the search is still global, the agent is strongly biased for her local cells16 and tends to
dismiss distant cells17.
4.5
3.5
2.5
j
G
1.5
0.5
0
0 5 10 15 20 25
d
ij
Figure 4.1: Distribution of attractive forces around cell i where p = 1 and G j = 1 for
all j .
The agent, however, will not dismiss a cell in a distant location if Γ(Q j ) is
very large. One such case is presented in figure 4.2. In this example, Γ(Q j ) has been
increased to 70 only at d ij = 15, keeping everything else unchanged from the settings
in figure 4.2. In this case, the agent will indeed choose the cell at d ij = 15 because her
net welfare at d ij = 15 is higher than that in the original location although she is far
from the cell. As shown in this example, the gravity type approach enables the agent
16
However, note that if p=0, the scope of the search is purely global.
17
This is analogous to the case in astronomy where a rocket experiences zero “gravity” when it is far
4.5
3.5
2.5
j
G
1.5
0.5
0
0 5 10 15 20 25
d
ij
Figure 4.2: Distribution of attractive forces around cell i where p = 1 and Gi = 1 for
all j except for Γ(Q j ) = 70 at d ij = 15 .
Local Search
After choosing a city (more precisely, the economic center of the city), the
agent begins to look for a place to settle in by performing a local search starting from
jmax . First, the agent evaluates eight adjacent cells h(= 1, 2 K8) in her neighborhood
and chooses h1max such that Lh is the largest18. The function Λ(Qh ) (Λ : R M → R) is a
utility function that gives a larger information set (or simply, more detailed
information) than that of Γ(Q j ) 19. The second term of the equation is the contribution
of the agent’s neighborhood—it captures the assumption that she gains or suffers from
adjacent cells. The existence of this term encourages the agent to stay with the cluster,
ensuring its growth outwards. V is a vector with M positive elements, each of which
corresponds to an element of Qk . It controls, for each property, how much she gains
18
As in the global search, if more than one possibility exist, she randomly chooses one.
19
This specification is important because otherwise the agent would find her final destination just by
To determine the Q-vector, we simply have to list the quantifiable properties that
belong to a cell, including the population of agents. To construct ordinary differential
equations for the Q-vector’s elements, we examine the relationships between each
property and the rest of the properties and mathematically formulate those
relationships. To define utility functions, we need to define Γ(Q j ) in equation (18),
Unless mentioned, other settings are the same as in the highly simplified model
in chapter 3, except that we no longer assume the 23 non-inhabitable cells. That is, (1)
the size of the space is 50×50; the shape of the space is a torus and (3) the number of
agents is 2500 for the models without birth or death.
33
34
CHAPTER SIX:
APPLICATION #1: A SIMPLE MODEL
Specifications
Q-Vector
qi ,1 n
Qi = = i 2 (21)
qi , 2 − cni
ODE
q&i ,1 0
Q& i = = (22)
q&i , 2 0
The initial distribution of agents is random, so the initial condition of this ODE is:
35
qi ,1 (0 )
Qi (0 ) = (23)
− cqi ,1 (0 )
2
2500
∑ q (0) = 2500
i
i ,1 (24)
Utility Functions
For the global search, we assume that each agent is only concerned with
positive externalities. So,
Γ(Qi ) = qi ,1 = ni (25)
Therefore,
Γ(Qi ) ni
G (Qi ) = = p (26)
d ijp d ij
Furthermore, we assume:
1
V = (28)
0
This means each agent benefits from the economies in her surrounding cells without
loss and will not be distressed by the diseconomies emanating from those cells. Thus,
the utility function for the local search is:
36
Lh = Λ(Qh ) + ∑ Λ(V ⊗ Q ) k
k = h 's neighborhood
(29)
= nh − cnh2 + ∑ nk
k = h 's neighborhood
Simulation Results
c toward the rightmost column. Different colors indicate population densities, which
density. This result is consistent with the predictions of the model itself. Similarly, the
larger p is, the smaller the distances between clusters. This is also a natural
consequence, given that p is the parameter that represents the relocation cost.
Notice that for each c there seems to be a transitional state between the state
where there is one mega-cluster and that where there are arbitrary many small clusters.
In fact, outside that transitional state, the effect of p appears to be very small. There is
a possibility that this is a phase shift, which is commonly seen in the natural sciences.
Furthermore, in the transitional state, the system offers clear hierarchical structures.
Since this is a significant topic in regional science, it is examined in detail later in this
chapter.
37
c
p\ 0.01 0.1 0.3 0.5 0.7
0.1
0.3
0.5
0.7
0.9
1.1
who have chosen to relocate directly move to that cluster to create a mega-cluster.
As p increases, however, this tendency changes. That is, agents first create
small clusters all over the space and some of the small clusters start to grow further.
This can be most prominently observed at p = 0.5 in the figure: At t = 0.5 there are
only small clusters of equal size, but at t = 1, there have emerged a few larger clusters,
which further attract other agents who may or may not belong to other clusters—thus,
creating a hierarchical structure in equilibrium.
The case where p = 0.4 elicits behavior somewhat in between that seen in the
two examples above. Agents do make small clusters first; but, after creating a mega-
cluster, some who are already in the “small” clusters also move to the mega-cluster, as
do those who do not belong to any emerged clusters, resulting in a hierarchical
structure. This implies that agents are repeating long-distance relocations to larger
clusters.
At p = 0.7, agents choose to travel only for short distances to form relatively
One could claim a condition for hierarchical structures of urban clusters from
the fact that this model covers all types of migration patterns introduced by
Ravenstein. Note for example, that we observe realistic spatial patterns around
p = 0.4 − 0.5 under c = 0.3 (or the “transitional” area mentioned earlier in this
section) where all or most classes of migrants exist. This might tell us that hierarchical
structures take place when all types of migrants co-exist in the process of pattern
formation.
40
p
t\ 0.1 0.3 0.4 0.5 0.7
0.5
1.5
2.5
Analysis
patterns that emerge, a series of simulations were performed for analyses. Analysis is
possible often only by repeating the simulation as many times as possible under the
same setting due to the stochastic nature of the model. For the analyses in this section,
simulations were done 20 times for each set of p and c .
Figure 6.3 shows the effect of p and c on the number of clusters in
equilibrium. For any c , there emerges only one mega-cluster when p is close to zero,
and the number of clusters increases as p increases. This is, of course, because the
higher the transport cost, the harder it is for agents to travel long distances; therefore,
they choose to form clusters near their original locations.
For small c, however, the number of clusters stays one for relatively large p
(e.g. for c = 0.1, it takes p = 0.5 for the number of clusters to be more than one). This
is because when c, which is the magnitude for diseconomy, is small, the attractive
force of the mega-cluster is large enough to compensate for higher travel costs.
Conversely, when c is large, agents are more sensitive to transport cost.
smaller, it requires fewer cells to hold the same number of agents—making it possible
for the space to contain more distinct clusters. In fact, this is the reason that in
equilibrium, smaller c makes more clusters.
42
70
60
50
Number of Clusters 40
30
c
20
0.1
0.3
10 0.5
0.7
0.9
0
0 0.2 0.4 0.6 0.8 1
p
Let us consider the role of the two parameters in terms of the size of clusters.
Figure 6.4 shows the average sizes of clusters, which is measured by the number of
agents that belong to the cluster for the same set of p and c in figure 6.3. Since the
total number of cluster is 2500, each point in the figure 6.4 is equal to 2500 divided by
the number of clusters in figure 6.3.
For small p and c, the size of clusters is 2500 for the same reason that
2500
c 0.1
0.3
0.5
0.7
2000
1500
1000
500
0
0 0.2 0.4 0.6 0.8 1
p
Recall that in figure 6.1, we have observed that certain combinations of p and
c apparently result in hierarchical structures and other combinations of p and c
make either one mega-cluster or many small clusters of approximately equal size. It
would be of interest to quantitatively classify p and c by those three types of spatial
patterns.
One way to assess the classifications above is to compute the variance of
cluster sizes. Notice that (1) if there is only one cluster in the space, the variance
should be equal to zero; (2) if there are only small clusters of approximately equal
size, the variance should still be small and (3), if the sizes of clusters describe a
hierarchy, the variance of clusters’ sizes should be larger than that of the two other
spatial patterns.
Figure 6.5 shows the variance of cluster sizes on the p − c space. There clearly
exists an area with large variance, so we now know that this model will generate a
hierarchical structure as long as p and c are within that area. Below that area, where
44
the variance is zero in the figure (the flat deep blue area with low p and low c ), is the
domain where we observe a mega-cluster. Above the area (high p and high c ) is the
domain where there emerge small clusters of approximately equal size. For the
simulations performed for this analysis, the variances in this domain ranged
approximately from 500 to 50,000.
Discussion
spatial patterns that include (1) one mega-cluster, (2) many small clusters of
approximately equal size and (3) hierarchical structure.
Recall that, in the highly simplified model presented in chapter 3, hierarchical
spatial patterns cannot be sustained over time. In fact, the only possible spatial
patterns in equilibrium are the single mega-cluster or multiple clusters of exactly equal
size. Note that the utility functions of the model in this chapter are analogous to that in
the highly simplified model in chapter 3; yet the model in this chapter has clearly been
proven to show sustainable hierarchical structures—or more realism.
Even though the generalized model in this chapter does produce relatively
realistic spatial patterns, the conditions under which those spatial patterns emerge are
restricted to a small range of parameters as seen in figure 6.5. In other words, a
hierarchical structure in this model would be inherently unstable, collapsing with very
little change in either travel cost or the coefficient of diseconomy.
Remember, earlier in this chapter, it has been pointed out that hierarchical
structures emerge when there are all kinds of migrants (as introduced by Ravenstein).
The instability of hierarchical structure arises from the fact that only a small range of
both p and c allows multiple migrant types to exist. Actually, it is possible, for
instance, to assign different travel costs to the agents to solve this problem.
Figure 6.6 shows sample spatial patterns that emerge under such a
circumstance. The travel costs were uniformly assigned to agents with p ranging
do not exist; thus, no hierarchy. Similarly, one could also impose different
distributions of transport costs and/or heterogeneous diseconomy coefficients if
desired.
Figure 6.6: Spatial patterns with travel cost uniformly distributed from 0 to 1.
Although we have charted, in figure 6.5, the area of p and c that generate
hierarchy, let us further examine the model’s consistency with the real world.
The Rank-Size Rule is a commonly used theory, originally claimed by Zipf
(1949). According to this theory, the second largest city should contain 1/2 as many
people as the largest, the third should contain 1/3, the fourth should contain 1/4, and
so on; in other words, a city’s population weighted by its rank is expected to be
roughly constant. Using this empirical rule, one can evaluate the performance of an
urban agglomeration model.
To quantify “how consistent a population distribution is with the rank-size
rule,” one way is to use the variance of the product of the cluster size times its rank. If
a population distribution is perfectly consistent with the rank-size rule, the variance
should be equal to zero. As it becomes less consistent with the rank-size rule, the
variance should increase. In addition, in order for this index to be independent of the
47
total population size, divide the variance with the square of the total population. The
resulting value should be able to measure the closeness to the perfectly rank-sized
hierarchical structure. Let us call it the RSR index.
For example, consider the perfectly rank-sized distribution where the largest
cluster contains 1200 agents; the second largest contains 600 agents; the third contains
400 agents; and the fourth contains 300 agents, as shown in table 6.1. In this case, of
course, the RSR index is zero. In table 6.2, the population distribution is a similar
hierarchical structure, but it is slightly closer to the uniform distribution than the
previous case. In this case, the RSR index is 0.0036.
Figure 6.7 illustrates how the RSR index changes over the values of p under
c = 0.2 where each plot shows the average of the RSR indices obtained from 50
simulations. The RSR index appears to have its minimum level, or 0.006, at p = 0.59 .
According to data from the US Census 1996, the RSR index of the Unites States
ranges from approximately 0.001 to 0.00620, so the model successfully gives realistic
spatial patterns as far as population distribution over cities is concerned.
Similarly, in figure 6.8, other values of p that offer the minimum RSR indices
are overlaid to figure 6.5. It is shown that the combinations of p and c that generate
the most realistic distributions reside on the upper edge of the strip with high
variances in the figure.
0.040
0.035
0.030
0.025
RSR Index
0.020
0.015
0.010
0.005
0
0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9
20
This value depends on how many cities are included in the calculation. I used the largest 10 to 50
cities.
49
Figure 6.8: Set of p and c that offer realistic population distributions added to figure
6.5.
CHAPTER SEVEN:
APPLICATION #2: RESOURCE DEPLETION AND POLLUTION
ACCUMULATION
The settings in the previous application have only homogeneous land and the
qualities of the land do not change over time. In other words, all elements in the Q-
vector are functions of population only. Hence, since there is no population growth, no
substantial dynamics is defined in the ODE of the Q-vector as shown in equations (22)
and (23).
In this chapter, resources and pollution are introduced as properties of the
space to demonstrate how the properties of cells are used in the generalized spatial
agglomeration model, and to observe how they affect the resulting spatial patterns the
agents create.
Specifications
Q-Vector
In addition to the two properties, qi ,1 and qi , 2 , in the previous application,
there are two new properties to cell i . Define qi ,3 as the amount of resources that are
consumed by agents in i and qi , 4 as the level of the pollution from which agents in i
qi ,1 ni
qi , 2 − cni2
Qi = = (30)
q c × amount of resources at i
i ,3 r
q − c × pollution level at i
i,4 p
pollution.
50
51
ODE
written as:
q&i ,1 0
q&i , 2 0
Q& i = = (31)
q& −r q
i ,3 φ 1
q& r q
i,4 τ 1
zero or a random positive integer (≤ 2500) that satisfies equation (24). As for the
initial condition of resources, we will try different distributions. Also, we assume
pollution does not exist at the beginning of time, so qi , 4 (0) = 0 for all i . Thus, write:
qi ,1 (0)
− cqi ,1 (0 )
2
Qi (0) = (32)
q (0)
i ,3
0
52
Utility Functions
For the global search, simply add q3 and q4 to equation (25) and take the
Γ(Qi ) = qi ,1 + qi ,3 + qi , 4 (33)
Γ(Qi ) qi ,1 + qi ,3 + qi , 4
G (Qi ) = = (34)
dijp dijp
For the local search, take the linear sum of all elements in the Q-vector.
Λ (Q h ) = q h ,1 + q h , 2 + q h , 3 + q h , 4 (35)
Moreover, we assume that each agent cannot use the resources in her neighborhood
except for that in her own cell, but does suffer from the pollution produced in her
neighborhood. Therefore,
1
0
V = (36)
0
1
Lh = Λ(Qh ) + ∑ Λ(V ⊗ Q ) k
k = h 's neighborhood
(37)
= qh ,1 + qh , 2 + qh ,3 + qh , 4 + ∑q + qk , 4
k ,1
k = h 's neighborhood
53
and p=0.1.
For our first example, let us focus on the effect of limited resources distributed
over the space. The initial distribution of resources is random; qi ,3 (0) is a random
value in [0,1] (See figure 7.1). The coefficient for resources, cr , is 50, which enables
the cell with enough resources to be more attractive than increased productivity
simply by agglomeration. The depletion rate of resources is 0.2; therefore, one single
agent can deplete all resources for five units of time at most. Transport cost is low
enough ( p = 0.1), that agents are able to travel for long distances. The parameter c, is
0.3. Finally, we assume that agents’ decisions are not influenced by pollution for this
example; thus c p = 0 . All other conditions are the same as before.
Figure 7.2 shows an example of simulations under these conditions. The agents
first move to nearby cells that are initially endowed with more resources (t = 1) . As a
54
result, those cells attract more agents due to their resources and agglomeration
economies (t = 2 − 3) . As the simulation advances, however, the resources in those
clusters are depleted and the agents start to move out from the clusters (t = 4) . Then,
the agents create new clusters where there are still resources (t = 5 − 7) . When most of
the resources in the whole space have been consumed, agents begin to decide more by
economies of scale than by the amount of resources, making larger clusters
(t = 8 − 10) . Eventually, there emerges only one mega-cluster in the entire space
(t = 11) .
Recall when p = 0.1 and c = 0.3 in the simpler version presented in the
previous chapter, agents form a mega-cluster from the beginning of simulations (see
figure 6.2). There, it takes only about two units of time until most agents create the
final mega-cluster whereas with the presence of resources it takes about 10 units of
time.
Notice also that before the simulation reaches its equilibrium, the spatial
pattern exhibits a hierarchical structure (t = 8) . Although this is not a sustainable
hierarchical structure, it should be noted that there are factors other than the specific
range of transport costs that could cause hierarchical structures to emerge.
55
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö
Figure 7.2: Formation of clusters when resources exist with low transport cost.
and p=0.5.
This is the case where p = 0.5 with all other parameters unchanged from the
previous example. A sample result is shown in figure 7.3. Unlike the case with
p = 0.1, the agents do not create “temporary” clusters that are fated to disappear
before the equilibrium. Instead, they stay in their original locations or make only short
trips to consume local resources first (t = 1 − 2) . Only after most of the resources are
depleted do the agents start forming clusters (t = 3) and eventually the system reaches
its equilibrium (t = 7) .
56
Ö Ö Ö Ý
Ö Ö Ö
Figure 7.3: Formation of clusters when resources exist with high transport cost.
and p=0.1.
We now add pollution to Case 1. The agents’ sensitivity to pollution, c p , is 0.1
instead of zero and the accumulation rate, rτ , is 0.1. All other parameters are the same
as Case 1.
Figure 7.4 is a sample result under these conditions. We observe that the result
shows a behavior similar to Case 1 up to the point when the agents form a mega-
cluster (t = 0 − 12) . In Case 1, this would be the equilibrium and the cluster would
stay there forever. With pollution, however, the cluster stays only for a while
57
(t = 14 − 19) while agents accumulate pollutants in the cluster. When the pollution
reaches a certain level, the agents start to “escape” from the cells with pollutants to
other cells nearby (t = 20 − 22) . After agents finish this group migration, they stay in
the new cluster for a while (t = 24 − 26) and move again (t = 28 − 30), repeating the
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö
Figure 7.4: Formation of clusters and behavior of mega-cluster with both resources
and pollution (low transport cost).
58
and p=0.1.
In this example, c p and rτ have been increased to 50 and 1 respectively, with
other parameters held unchanged from Case 3. In other words, this is the case where
pollution is dominant.
The unique spatial pattern under this condition is shown in figure 7.5. We
clearly observe the formation of “orphaned” cells instead of clusters that consist of the
cells adjacent to each other. This happens because each agent is sensible of the
pollution in his neighborhood, not only in his own cell; thus, he avoids having
neighbors to minimize the negativity of pollution.
Ö Ý
t=0 t=2
t=4 t=6
and p=0.5.
This case is the same as Case 2 except that the initial distribution of resources
is not random. The resources are initially distributed weighted by a Gaussian
distribution with its peak in the center of the space as shown in figure 7.6. This is an
example where a specific part of the space has absolute advantage that does not last
(e.g. mining, etc.).
Figure 7.7 shows a result under this situation. First, due to the high
concentration of resources, agents create the first cluster in the center (t = 0.2) and it
grows (t = 0.4 − 0.8) . However, as the central part of the space loses its resources, it
also starts to lose inhabitants (t = 1) . Eventually, it completely loses its attractive force
and creates a sprawl (t = 1.8 − 2.0) . Agents then develop satellite cities around the
center of the space (t = 3 − 7) .
60
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö
Figure 7.7: Behavior of agents with depletable resources concentrated in the center of
the space.
61
In the previous case, agents leave the center of the cluster because of the
depletion of resources. In this example, resources are never depleted, or rφ = 0 .
(Imagine the resources to be wood or other kinds of plants that can be naturally
replenished) However, let us assume again that agents pollute indiscriminately,
eventually expelling themselves (c p = 1, rτ = 0.1) .
immigrants first and has held agents longer than other areas, the pollution level is the
highest. Thus, the center loses its inhabitants first (t = 4), creating a hole in the cluster
(t = 4.5 − 6) . The hole continues to grow (t = 6.5 − 7) . Eventually, the “ring” breaks
into several smaller clusters that will continue to move around through the space
(t = 13) .
Notice that although the environment set for this example is quite different
from the previous case, it forms a similar spatial pattern in the early stages.
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö
Figure 7.8: Behavior of agents with renewable resources concentrated in the center of
the space and with accumulation of pollutants.
q&i ,1 0
q&i , 2 0
Q& i = = (38)
q& − rφ q1
i ,3
q& r q − r
i , 4 τ 1 p
where rp is the rate of purification. Note that this rate does not depend on the
the agents form a ring-like cluster (t = 0 − 8) . Then, while agents are gone from the
central part of the lattice, the pollutants are purged (t = 10 − 12) . Since this area is still
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö Ý
Ö Ö Ö
Figure 7.9: Behavior of agents with renewable resources concentrated in the center of
the space and with pollutant purification.
CHAPTER EIGHT:
APPLICATION 3: PROPAGATION
In the real world, population growth plays a significant role in the formation of
spatial patterns. One of the benefits of using agent-based models is that when we need
to deal with population growth, the procedure is straightforward—simply add more
agents.
The addition of new agents should be done by certain rules. For example, it is
reasonable to place the new agents in the cell in which other agents (or parents) live or
in their neighborhood. Also, the birth rate could be a constant or a function of the Q-
vector.
In this application, newborn agents are initially placed in the same cell as their
parents and the birth rate is assumed constant and each agent gives birth to a new
agent stochastically with a constant probability.
Specifications
Q-Vector
The Q-vector of this version is the same as the one in the last chapter—or
equation (30)—except that we do not consider pollution (for the sake of simplicity).
Thus,
qi ,1 ni
Qi = qi , 2 = − cni2
(39)
q c × amount of resources at i
i ,3 r
65
66
ODE
The main difference from previous applications is that q&i ,1 is no longer always
equal to zero. In fact, if we have a constant probability of birth, q&i ,1 can be any value
in [0, 1] and it changes at every moment. However, on the average or when qi ,1 is
large enough, q&i ,1 becomes equal to the constant probability. Thus,
q&i ,1 rn
Q& i = q&i , 2 = crn2 (40)
q& − r q
i ,3 φ 1
where rn is the probability of birth. Note that we assume that a single agent can give
birth to a new agent without a mate although this is not the case for human beings. To
solve this problem, consider the agents as family units rather than individuals. For
future studies, this can be improved by introducing gender and by imposing more
strict conditions on propagation.
At t = 0, only one agent is placed in one cell21, so the initial condition for
qi ,1 (0)
Qi (0) = − cqi2,1 (0) (41)
q (0)
i ,3
21
Although, the first agent is located in the center of the lattice in the figures in this chapter, where the
“center” is does not affect the result since the space is a torus.
67
Simulation Results
Figure 8.1 shows sample results for three different transport costs
( p = 0.1, 0.5 and 0.9) . The probability of birth, rn , was set to 0.1. The graph on the top
row, for each level of transport cost, shows the population growth (dashed lines) and
the change in the number of clusters (solid lines). Note that although the population
explosions take place at different times for those three examples (e.g., p = 0.1 at t = 0
and p = 0.5 around t = 20 ), this is not because of different transport costs. Under the
same probability of birth, they happen stochastically, especially at the early stages
when there are but a small number of agents.
Initially, for each level of transport cost, when the agents have not formed
clusters, the population and the number of clusters are the same; thus, the dashed line
and the solid line overlap. For this period, individual agents appear to walk
randomly—they actually repeat short journeys seeking new resources in their
neighborhoods since resources are a more attractive force than agglomeration.
Eventually, resources in the whole space run short and agglomeration becomes more
attractive to the agents. Consequently, the number of clusters diverges from the
trajectory (see the top graphs in figure 8.1) and the rate of growth of the number of
clusters slows down. Finally, it starts to decline due to further agglomeration.
Compare p = 0.1 and p = 0.5 . For p = 0.1, the number of clusters begins to
decline, or the agents start to agglomerate, when the population is around 60 whereas
for p = 0.5 the population is around 100. This is because when transport cost is high,
the population needs to be dense in order for an agent to find another agent in her
vicinity to form a new cluster.
68
Number of Clusters/Agents
Number of Clusters/Agents
120 120 120
80 80 80
60 60 60
40 40 40
20 20 20
0 0 0
0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100
Time Time Time
10
20
30
69
40
50
60
70
70
80
90
100
Larger Space
Recall that in the previous example, as agents diffuse some of them reach the
lattice’s boundaries, which are connected to each other. Therefore, the resulting spatial
patterns are equivalent to those from a space of infinite horizon with a “seed” agent
71
deployed every 50 cells. By expanding the unit size of the space, we should be able to
observe the spatial patterns where the boundary effect is reduced.
In figure 8.2, a case of p = 0.5 is presented. The only difference from the
previous example (see figure 8.1) is that the size of the space is now 100×100 cells
instead of 50×50. Initially, (up to t = 30 ), agents simply disperse forming a “cloud” of
agents as in the previous case for t = 0 to 30. Then, at t = 40, they form the first
clusters in the center of the cloud due to the denser population in that area. This
continues as the cloud expands while the initial clusters grow further (t = 50) . At
t = 60, we observe the hierarchical spatial pattern, where the larger clusters are
located in the center and small ones are located in the outer tier.
Ö Ö Ý
Ö Ö
Specifications
Q-Vector
The first three elements of the Q-vector ( q1 , q2 and q3 ) are the same as
equation (39) in the previous chapter. In order to include geography to the model, we
add another element to the vector; that is,
qi ,1 ni
qi , 2 − cni 2
Qi = = (42)
qi ,3 cr × amount of resources at i
q g
i,4 i
where g i is an index that represents the “habitability” and the attraction level of
location i . For example, if location i is on the ocean or the lake, g i is 0, making it
72
73
ODE
Unlike the model in the previous chapter, the birth rate here is not constant.
Instead, we assume that the more urbanized the location is, the less children people
decide to have; thus, the birth rate is now a function of population. One way to realize
this would be to divide the birth rate by the population. Also, the ODE’s for qi , 2 and
qi ,3 are inherited from equation (40) and qi , 4 is assumed to be constant22. So, the ODE
is written as:
q&i ,1 rn / q1
2
q&i , 2 cr
Q& i = = n (43)
q& −r q
i ,3 φ 1
q& 0
i,4
The initial condition also inherits elements from the previous chapter with an
additional element for geographic features; that is,
qi ,1 (0 )
− cqi ,1 (0 )
2
Qi (0 ) = (44)
q ( 0)
i ,3
q ( 0)
i,4
given externally for each, shown in figure 9.1. As suggested earlier, cells facing the
ocean or lakes are given high values (especially the ones in California have even
higher values because of the attraction of gold early in the history of the United
States). However, coasts further north than the Great Lakes receive a handicap, due to
their cold climate.
22
This assumption should be relaxed if we consider the time scale of natural history.
74
Figure 9.1: Initial condition for qi , 4 for all i (North American Map).
75
Utility Functions
Unlike the utility functions used in previous chapters, we do not simply take
the linear sum of all elements of the Q-vector. Instead, we treat q j , 4 in a special
manner since it affects how each agent perceives the other elements. For example, if
location j is inhabitable ( q j , 4 = 0 ) because it is on the ocean, then that fact should
invalidate the other elements. Hence, an effective form for Γ(Q j ) might be:
Γ(Q j ) = q j , 4 (q j ,1 + q j , 2 + q j ,3 ) (45)
q j , 4 (q j ,1 + q j , 2 + q j ,3 )
Gj = (46)
d ijpa
The parameter p has the subscript a because we distribute various transport costs
Λ(Qh ) = q j , 4 (q j ,1 + q j , 2 + q j ,3 ) (47)
Let us make the same assumptions for the neighborhood effect as before for
population and resources. In addition, the geographic features of location h' s
neighborhood by themselves do not affect one’s preferences over location h . Hence,
1
0
V = (48)
0
0
Therefore, the total utility level for the local search is:
76
Lh = Λ(Qh ) + ∑ Λ(V ⊗ Q ) k
k = h 's neighborhood
(49)
= qh , 4 (qh ,1 + qh , 2 + qh ,3 ) + ∑ qk ,1 ⋅ qk ,4
k = h 's neighborhood
Simulation Results
offspring of those agents explore other areas all over the continent and find other
attractive areas such as the Great Lakes and Florida (t = 30 − 40) . Those in California
and around the Great Lakes develop urban clusters (t > 50) . Since most agents tend to
belong to a cluster due to their low transport costs, the whole population does not
grow fast. (Recall that the birth rate is inversely proportional to population density.)
The rightmost column shows a case of agents with high transport costs. Since
p can be up to 100, most of the agents are “sluggish”. In fact, in the simulation
shown in the figure, the early agents stay in the east coast for a while (t < 30) and
start to shape the first clusters there. Then, instead of jumping to California, the agents
first make short trips to the nearer side of the Great Lakes (t = 40) . Sometime
between t = 40 and 50, a few agents with low transport costs are born and they move
to the west coast while some other agents stay around the Great Lakes (t = 50 − 60) .
Those agents with high transport costs slowly expand their territories and create
clusters in the eastern half of the continent whereas those in California form their own
77
cluster by reproducing and by attracting newborn agents with low transports costs
(t > 60) .
The middle columns shows a case between those explained above. The early
agents do form clusters around the east coast first (t < 50), but as soon as more mobile
agents are born they move to California and create new urban clusters (t = 50 − 70) .
Eventually, agents in both east and west sides of the continent develop equally large
clusters.
10
20
78
30
40
50
60
70
79
80
90
100
Figure 9.2: Formation of urban clusters of North America simulated under various
transport cost ranges.
As far as these three cases are concerned, the one with medium transport cost
range (0 < p < 10) appears more realistic than the others. Figure 9.3 shows various
results (t = 100) of simulations under this transport cost range. Naturally, the resulting
spatial patterns are different from each other due to the fact that the model is
80
stochastic. In spite of the variety of the results, the east and the west coasts tend to
have roughly equal populations.
A B C
D E F
This study serves four major purposes: (1) to introduce agent-based models to
regional science, (2) to construct a generic spatial agglomeration model, (3) to show
that the model offers hierarchical structures, and (4) to explore the model’s potential.
This chapter summarizes the results from previous chapters with regard to these
purposes.
In chapter 3, a very simple agent-based model for spatial agglomeration is
introduced. Although the model is highly simplified and there are unrealistic
assumptions, it offers general ideas as to how agent-based models work.
In chapter 4, a generalized spatial agglomeration model is presented. The
model is designed so that the user can “customize” it not only for urban agglomeration
but also for a wide variety of agglomeration phenomena in general, which is realized
by combining global and local search for agents’ migration behaviors. When using
this model, the user can specify (1) the attributes of location, (2) their dynamics and
(3) each agent’s preferences in a consistent way.
Chapter 5 explains the common procedures for the model’s applications
demonstrated in the following chapters.
The first application is shown in chapter 6. The factors involved in each
agent’s decision-making process is as simple as the model in chapter 3, but the model
is re-written in terms of the generalized model and it now includes the “global-local
hybrid” search as well. The effects of the two key factors—transport cost and carrying
capacity—are examined. In particular, it is shown that there is a set of transport costs
and carrying capacities that generate realistic spatial distribution, which is validated
using a newly introduced index that measures how well the spatial distribution
matches the Rank-Size Rule. In fact, this model indeed offers realistic spatial patterns
81
82
and the results show that this study provides a reasonable answer as to how the Rank-
Size Rule can be valid in the real world.
Recall that, as pointed out in chapter 2, Krugman’s model produces only
uniform spatial patterns with no concept of area. Schweitzer and Steinbink’s model
does produce realistic spatial patterns, but the behavioral rules of agents lack
economic sense (as they themselves point out that their model is a “physical” model);
therefore, one cannot expect to use this model for policy analyses.
In chapter 7, resources and pollution are added in order to show how the
dynamics of spatial or land attributes affects agents’ behaviors. We observe that
additional parameters such as the depletion rate of resources, accumulation rate of
pollution and initial distribution of resources dramatically alter the resulting spatial
patterns and the existence of equilibria. It is also shown that this model can simulate
urban sprawls.
In chapter 8, this model is proven to be capable of simulating the propagation
of agents.
In chapter 9, a real North American map is incorporated into the model. It is
done by simply treating geographic features of each location in the same way as other
attributes (such as population and resources) without modifying the model itself. The
results show the potential that the model could be applied to real life problems within
the framework of the generalized spatial agglomeration model.
Axtell and Epstein also incorporated additional features such as resources,
pollution, propagation, gender, etc. Their model, however, is not constructed in a
highly generalized way, so it is not easy to “use” the model for specific purposes.
With the Generalized Spatial Agglomeration Model presented in this study, on the
other hand, it has been shown that one can include any quantifiable attributes of the
83
space into the model in a consistent way—by introducing new elements to the Q-
Vector and defining its ODE.
This study is the first to vigorously utilize agent-based models in the field of
Regional Science. It has proven that agent-based models are not only useful in the
field (especially when combined with existing methods), but they could also offer
good answers to the complex questions that traditional models cannot solve. It is
expected that agent-based models will be explored further in the coming decades and
will eventually become a norm for the types of analyses where realistic modeling is
required.
BIBLIOGRAPHY
Flick, Alexander C. (ed.). History of the State of New York, New York: Columbia
University Press, Vol. V., 1934
Page, S. “On the Emergence of Cities.” Journal of Urban Economics 45, 1999. 184-
208.
Ravenstein, E.G. “The laws of migration.” Journal of the Royal Statistical Society 48,
2: 167-235, 1885.
Zipf, G. Human Behavior and the Principle of Least Effort. Cambridge: Addison-
Wesley Press, 1949
84