Documente Academic
Documente Profesional
Documente Cultură
Abstract This tutorial provides an introduction to the Si- the map. Incorrect association can lead to catastrophic
multaneous Localisation and Mapping (SLAM) method and failure of the SLAM algorithm. Data association is par-
the extensive research on SLAM that has been undertaken.
Part I of this tutorial described the essential SLAM prob- ticularly important when a vehicle returns to a previously
lem. Part II of this tutorial (this paper) is concerned with mapped region after a long excursion; the so-called loop-
recent advances in computational methods and in new for- closure problem. Section III surveys current data as-
mulations of the SLAM problem for large scale and complex
environments.
sociation methods used in SLAM. These include batch-
validation methods that exploit constraints inherent in the
SLAM formulation, appearance-based methods, and multi-
hypothesis techniques. The third development discussed in
this tutorial is the trend towards richer appearance-based
I. Introduction models of landmarks and maps. While initially motivated
by problems in data association and loop closure, these
SLAM is the process by which a mobile robot can build methods have resulted in qualitatively dierent methods of
a map of the environment and at the same time use this describing the SLAM problem; focusing on trajectory esti-
map to compute its location. The past decade has seen mation rather than landmark estimation. Section IV sur-
rapid and exciting progress in solving the SLAM problem veys current developments in this area along a number of
together with many compelling implementations of SLAM lines including delayed mapping, the use of non-geometric
methods. The great majority of work has focused on im- landmarks, and trajectory estimation methods.
proving computational eciency while ensuring consistent SLAM methods have now reached a state of consider-
and accurate estimates for the map and vehicle pose. How- able maturity. Future challenges will centre on methods
ever, there has also been much research on issues such as enabling large scale implementations in increasingly un-
non-linearity, data association and landmark characterisa- structured environments and especially in situations where
tion, all of which are vital in achieving a practical and GPS-like solutions are unavailable or unreliable; in urban
robust SLAM implementation. canyons, under foliage, underwater or on remote planets.
This tutorial focuses on the recursive Bayesian formula-
tion of the SLAM problem in which probability distribu-
tions or estimates of landmark locations and vehicle pose
are obtained. Part I of this tutorial surveyed the develop- II. Computational Complexity
ment of the essential SLAM algorithm in state-space form,
described a number of key implementations and cited lo- The state-based formulation of the SLAM problem in-
cations of source code and real-world data for evaluation volves the estimation of a joint state composed of a ro-
of SLAM algorithms. Part II of this tutorial (this paper) bot pose and the locations of observed stationary land-
surveys the current state of the art in SLAM research with marks. This problem formulation has a peculiar structure;
a focus on three key areas; computational complexity, data the process model only aects vehicle pose states and the
association, and environment representation. Much of the observation model only makes reference to a single vehicle-
mathematical notation and essential concepts used in this landmark pair. A wide range of techniques have been de-
paper are dened in Part I and so are not repeated here. veloped to exploit this special structure in limiting the com-
SLAM, in its naive form, scales quadratically with the putational complexity of the SLAM algorithm.
number of landmarks in a map. For real-time implemen- Techniques aimed at improving computational eciency
tation, this scaling is potentially a substantial limitation may be characterised as being optimal or conservative.
in the use of SLAM methods. Section II surveys the Optimal algorithms aim to reduce required computation
many approaches that have been developed to reduce this while still resulting in estimates that are equal to the full-
complexity. These include linear-time state augmentation, form SLAM algorithm (as presented in Part I of this tu-
sparsication in information form, partitioned updates and torial). Conservative algorithms result in estimates which
submapping methods. A second major hurdle to overcome have larger uncertainty or covariance than the optimal re-
in implementation of SLAM methods is to correctly as- sult. Usually conservative algorithms, while less accurate,
sociate observations of landmarks with landmarks held in are computationally more ecient and therefore of value
in real implementations. Algorithms with uncertainties
Tim Bailey and Hugh Durrant-Whyte are with the Aus- or covariances less than those of the optimal solution are
tralian Centre for Field Robotics (ACFR) J04, The University of
Sydney, Sydney NSW 2006, Australia, tbailey@acfr.usyd.edu.au, termed inconsistent and are considered invalid solutions to
hugh@acfr.usyd.edu.au the SLAM (or any estimation) problem.
2
The direct approach to reducing computational complex- The augmented states are then a function of only a small
ity involves exploiting the structure of the SLAM problem number of existing states
in re-formulating the essential time and observation up-
date equations. The time-update computation can be lim- xv k
x+ m
ited using state-augmentation methods. The observation- k = (5)
update computation can be limited using a partitioned form g (xvk , zk )
of the update equations. Both these steps result in an op- The general idea of state augmentation can be applied
timal SLAM estimate with reduced computation. Refor- whenever new states are a function of a subset of existing
mulation of the standard state-space SLAM representation states
into information form allows sparsification of the resulting
information matrix to be exploited in reducing computa- x1
tion. Both optimal and conservative variations exist of x2 (6)
these sparsication algorithms. Submapping methods ex- f (x2 , q)
ploit the idea that a map can be broken up into regions P11 P12 P12 fxT2
with local coordinate systems and arranged in a hierarchi- PT12 P22 P22 fxT2
cal manner. Updates occur mostly in a local frame with fx2 P12 fx2 P22 fx2 P22 fxT2 + fq QfqT
T
a submap coordinate frame xGF , and where xR is the local Finally, submap registration can use batch validation gat-
submap with a locally referenced vehicle pose xR
v and lo- ing, so improving association robustness.
cally referenced landmarks mR as shown in Figures 1(a)
and 1(b), respectively. C. Sparsification
Fig. 2. Exact information matrix SLAM. These information matrices all represent the same optimal map estimate, but show the tradeo
between number of retained pose states and matrix sparsity. Removing pose states reduces the total matrix size but introduces dependencies
between the remaining pose and map states.
Batch gating alone is often sucient to achieve reliable IV. Environment Representation
data association: If the gate is suciently constrained, as-
sociation errors have insignicant eect [6, page 13], and Early work in SLAM assumed that the world could rea-
if a false association is made with an incorrect landmark sonably be modeled as a set of simple discrete landmarks
which is physically close to right one, then the inconsistency described by geometric primitives such as points, lines or
is minor. This may not always be valid, however, espe- circles. In more complex and unstructured environments
cially in large complex environments, and more comprehen- outdoors, underground, subseathis assumption often
sive data association mechanisms (such as multi-hypohesis does not hold.
tracking [5]) may be necessary.
A. Partial Observability and Delayed Mapping
B. Appearance Signatures
Environment modeling depends both on the complexity
Gating on geometric patterns alone is not the only av- of the environment and on the limitations of the sensing
enue for reliable data association. Many sensing modali- modality. Two common examples are sonar and vision.
ties, such as vision, provide rich information about shape, Sonar sensors typically produce accurate range measure-
colour and texture, all of which may be used to nd a corre- ments but often have large beam-width and side-lobes mak-
spondence between two data sets. For SLAM, appearance ing the bearing estimate unusable [33]. Measurements from
signatures are useful to predict a possible association, such a single camera, on the other hand, provide bearing infor-
as closing a loop, or for assisting conventional gating by mation without an accurate indication of range.
providing additional discrimination information.
Historically, appearance signatures and image similarity
metrics have been developed for indexing image databases
[45] and for recognising places in topological mapping [1],
[51]. More recently, appearance measures have been ap-
plied to detecting loops in SLAM [25], [41]. The work on
visual appearance signatures for loop detection by Newman
et al. [41] introduces two notable innovations. First, a sim-
ilarity metric is computed over a sequence of images, rather
than a single image and, second, an eigenvalue technique (a)Range only (b)Bearing only
is employed to remove common-mode similarity. This ap-
Fig. 5. Partial observation. Some sensing modalities cannot directly
proach considerably reduces the occurrence of false posi-
observe a landmark location and require observations from multiple
tives by considering only matches that are interesting or vantage points.
uncommon.
SLAM with range-only sensors [34], [35] and bearing-
C. Multi-Hypothesis Data Association only sensors [13], [3] show that a single measurement is
Multi-hypothesis data association is essential for robust insucient to constrain a landmark location. Rather it
target tracking in cluttered environments [5]. It resolves must be observed from multiple vantage points as shown
association ambiguities by generating a separate track es- in Figure 5. More precisely, a single measurement gen-
timate for each association hypothesis, creating over time erates a non-Gaussian distribution over landmark loca-
an ever branching tree of tracks. The number of tracks is tion, and multiple measurements are needed to obtain
typically limited by the available computational resources a (nearly) Gaussian estimate. Generalised distributions,
and low likelihood tracks are pruned from the hypothesis such as mixture models, permit immediate, non-delayed
tree. landmark tracking [46]. One way to obtain a Gaussian
Multi-hypothesis tracking (MHT) is also important for landmark estimate is to delay initialisation and instead ac-
robust SLAM implementation, particularly in large com- cumulate raw measurement data. To permit consistent de-
plex environments. For example in loop closure, a robot layed fusion, it is necessary to record the vehicle pose for
should ideally maintain separate hypotheses for suspected each deferred measurement. Thus, the SLAM state is aug-
loops, and also a no-loop hypothesis for cases where the mented with recorded pose estimates
perceived environment is structurally similar. While MHT xk = [xTvk , xTvk1 , . . . , xTvkn , mT ]T (15)
has been applied to mapping problems [10], it has yet to be
applied in the SLAM context. A major hurdle is the com- and the corresponding measurements are stored in an aux-
putational overhead of maintaining separate map estimates iliary list {zk , . . . , zkn }. Once sucient information over
for each hypothesis. Tractable solutions may be possible a period n has been collected, a landmark is initialised by a
using sparsication or submap methods. The FastSLAM batch update. Recorded poses that do not have any other
algorithm is inherently a multi-hypothesis solution, with associated measurements are then simply removed from the
each particle having its own map estimate. A signicant state.
attribute of the FastSLAM algorithm is its ability to per- Delayed fusion addresses far more than just partial ob-
form per-particle data association [38]. servability. It is a general concept for increasing robustness
7
D. Trajectory-Oriented SLAM
xk = [xTvk , mT ]T . (16)
that appear static for a while but are later moved, such
as chairs and parked cars. In dynamic environments, a
SLAM algorithm must somehow manage moving objects.
It can detect and ignore them; it can track them as moving
landmarks; but it must not add a moving object to the
map and assume it is stationary.
The conventional SLAM solution is highly redundant.
Landmarks can be removed from the map without loss of
consistency, and it is often possible to remove large num-
bers of landmarks with little change in convergence rate
[15]. This property has been exploited to maintain a con-
temporaneous map by removing landmarks that have be-
come obsolete due to changes in the environment [2, Sec-
tion 5.1]. To explicitly manage moving objects, Hahnel et
Fig. 7. Trajectory-based SLAM. Scans taken at each pose are aligned al. [27] implement an auxiliary identication routine, and
according to their pose estimate to form a global map. Picture cour-
tesy of [41].
then remove the dynamic information from a data scan
before sending it to their SLAM algorithm. Conversely,
Wang et al. [53] add moving objects to their estimated
sparse information form SLAM has led to a third type of state, and provide models for tracking both stationary and
trajectory-based SLAM [14], [20], [41], with sparse estima- dynamic targets. Simultaneous estimation of moving and
tion of Equation 17. stationary landmarks is very costly due to the added pre-
While trajectory SLAM has many positive characteris- dictive model. For this reason, the solution implemented
tics, these come with caveats. Most importantly, its state- in [53] rst involves a stationary SLAM update followed by
space grows unbounded with time, as does the quantity separate tracking of moving targets.
of stored measurement data. For very long-term SLAM
it will eventually become necessary to coalesce data into V. SLAM: Where To Next?
a format similar to the traditional SLAM map to bound
storage costs. The SLAM method provides a solution to the key com-
petency of mapping and localisation for any autonomous
E. Embedded Auxiliary Information robot. The past decade in particular has seen substantial
progress in our understanding of the SLAM problem and in
Trajectory-based SLAM lends itself to representing spa-
the development of ecient, consistent and robust SLAM
tially located information. Besides scan data for mapping,
algorithms. The standard state-space approach to SLAM
it is possible to associate auxiliary information with each
is now well understood and the main issues in representa-
pose; soil salinity, humidity, temperature, or terrain char-
tion, computation and association appear to be resolved.
acteristics, for example. The associated information may
The information-form of the SLAM problem has signicant
be used to assist mapping, to aid data association, or for
unexplored potential in large-scale mapping, problems in-
purposes unrelated to the mapping task, such as path plan-
volving many vehicles, and potentially in mixed environ-
ning or data gathering.
ments with sensor networks and dynamic landmarks. The
This concept of embedding auxiliary data is more dif-
delayed data fusion concept complements batch association
cult to incorporate within the traditional SLAM frame-
and iterative smoothing to improve estimation quality and
work. The SLAM state is composed of discrete landmark
robustness. Appearance and pose-based SLAM methods
locations and is ill suited to the task of representing dense
oer a radically new paradigm for mapping and location
spatial information. Nieto et al. [43] have devised a method
estimation without the need for strong geometric landmark
called DenseSLAM to permit such an embedding. As the
descriptions. These methods are opening up new directions
robot moves through the environment, auxiliary data is
and making links back to fundamental principles in robot
stored in a suitable data-structure, such as an occupancy
perception.
grid, and the region represented by each grid-cell is de-
The key challenges for SLAM are in larger and more
termined by a set of local landmarks in the SLAM map.
persuasive implementations and demonstrations. While
As the map evolves, and the landmarks move, the locality
progress has been substantial, the scale and structure of
of the grid region is shifted and warped accordingly. The
many environments is limited. The challenge now is to
result is an ability to consistently maintain spatial local-
demonstrate SLAM solutions to large problems where ro-
ity of dense auxiliary information using the sparse SLAM
botics can truly contribute: driving hundreds of kilometers
landmark estimates.
under a forest canopy or mapping a whole city without
F. Dynamic Environments recourse to GPS, and to demonstrate true autonomous lo-
calisation and mapping of structures such as the Barrier
Real world environments are not static. They contain Reef or the surface of Mars. SLAM has brought these pos-
moving objects, such as people, and temporary structures sibilities closer.
9