Sunteți pe pagina 1din 35

Review

Structural Health Monitoring


1–35

Data fusion approaches for structural Ó The Author(s) 2018


Article reuse guidelines:
sagepub.com/journals-permissions
health monitoring and system DOI: 10.1177/1475921718798769
journals.sagepub.com/home/shm
identification: Past, present, and
future

Rih-Teng Wu and Mohammad Reza Jahanshahi

Abstract
During the past decades, significant efforts have been dedicated to develop reliable methods in structural health monitor-
ing. The health assessment for the target structure of interest is achieved through the interpretation of collected data.
At the beginning of the 21st century, the rapid advances in sensor technologies and data acquisition platforms have led
to the new era of Big Data, where a huge amount of heterogeneous data are collected by a variety of sensors. The
increasing accessibility and diversity of the data resources provide new opportunities for structural health monitoring,
while the aggregation of information obtained from multiple sensors to make robust decisions remains a challenging
problem. This article presents a comprehensive review of the recent data fusion applications in structural health moni-
toring. State-of-the-art theoretical concepts and applications of data fusion in structural health monitoring are presented.
Challenges for data fusion in structural health monitoring are discussed, and a roadmap is provided for future research
in this area.

Keywords
Structural health monitoring, data fusion

Introduction The increasing availability of heterogeneous data (e.g.


displacement, acceleration, strain, global positioning
Background system (GPS), LiDAR, depth, ultrasonic, ground pene-
All civil structures are inevitably faced with aging prob- tration radar (GPR), infrared, and microwave) pro-
lems, possible excessive loading conditions, inappropri- vides more information to the scientists to deal with
ate usages, or natural hazards such as earthquake and conventional problems. As a result, how to integrate
hurricane. It is crucial to identify the current health information from multiple sources and make a more
condition of a target structure periodically and after robust decision for applications in SHM is an impor-
the strike of a natural hazard. In the recent decades, tant and yet challenging task. This article reviews state-
several researchers have put their efforts in monitoring of-the-art algorithms in data fusion that can be used in
the health state of structural systems to identify existing SHM problems.
damage in structures. A damage in a structural system
can be the result of changes in material properties, geo-
metric configuration, or boundary conditions, which
typically influences the performance of the structural
system. In 2007, Farrar and Worden1 defined structural
health monitoring (SHM) as ‘‘the process of imple- Lyles School of Civil Engineering, Purdue University, West Lafayette, IN,
menting a damage identification strategy for aerospace, USA
civil and mechanical engineering.’’ Recent advances in
Corresponding author:
sensor and information technologies have led to the era Mohammad Reza Jahanshahi, Lyles School of Civil Engineering, Purdue
of Big Data, which have enhanced data acquisition and University, West Lafayette, IN 47907, USA.
that inevitably lead to more opportunities in SHM.2,3 Email: jahansha@purdue.edu
2 Structural Health Monitoring 00(0)

Motivation Dempster–Shafer (DS) evidential approach, fuzzy rea-


Data fusion is the integration of various types of data soning, state estimation, machine learning algorithms,
and weighted combinations are the most common data
to obtain more information from the combined data
fusion methods applied in SHM. Furthermore, chal-
instead of considering each dataset separately.4,5 It can
lenges and opportunities for using data fusion tech-
be used to generate new raw data or more informative
niques are discussed.
new data based on the original raw data. Once the new
data are generated, it is often expected to be more help-
ful in decision-making process than using the original Scope
datasets. The concept of data fusion has been widely
The organization of the rest of the article is as follow-
applied in various disciplines including but not limited
ing. In section ‘‘Data fusion techniques in SHM,’’ the
to automated target recognition, environmental moni- theoretical basis and applications of common data
toring, robotics, medical applications, quality control fusion techniques in SHM are provided. Section
of manufacturing process, and condition monitoring of ‘‘Challenges in data fusion and concluding remarks’’
complex machinery.6 For instance, in the case of traffic discusses the challenges for data fusion in SHM as well
control, the traffic state for a certain area is monitored as the roadmap for future research. In section
through the combination of acoustic, image, and other ‘‘Summary,’’ the summary is provided.
sensor data measured near the road side. Discussions
regarding the applications such as advanced traveler
information systems, automatic incident detection, Data fusion techniques in SHM
advanced driver assistance, network control, crash This section starts with the introduction of the benefits
analysis and prevention, traffic demand estimation, stemmed from data fusion, the essential data prepro-
traffic forecasting and traffic monitoring, and accurate cessing step before applying data fusion, followed by
position estimation are reviewed in Faouzi et al.7 In the the review of the popular techniques that have been
context of SHM, data fusion is adopted to enhance the used to integrate data for SHM applications. For each
decision making for the health evaluation of the struc- data fusion technique, the basic concept of the tech-
tures. However, due to the complex nature of the real- nique and its recent applications in SHM are presented.
world applications, the uncertainties and imperfections An illustrative example for data fusion in structural
must be considered during the data processing. In this response estimation and damage detection is provided
regard, data fusion helps reduce the uncertainty by at the end of this section.
increasing the information completeness.8,9 The out-
come of a decision-making process is usually essential
for risk analysis. Prior to data fusion
In general, data integration can be achieved in vari- Why data fusion? A perspective from the observability and
ous levels of fusion depending on the task being per- identifiability of system. When dealing with a problem of
formed. There are three levels of data fusion commonly interest, one would inevitably need to identify whether
used: data-level, feature-level, and decision-level.10–12 a solution to the problem exists given the available
In data-level fusion, the raw data from multiple sources information. There could be too many unknowns in
are directly combined before further process. These the system and, therefore, the augmentation of data to
raw data must have the same physical meaning (i.e. help with problem solving might be required. This leads
measuring the same physical quantities). For instance, to the concepts of observability and identifiability. The
sound signal and image data cannot be integrated at observability of a system is defined as whether the
data-level. In feature-level fusion, the input data can be states of a system can be identified by a set of measure-
heterogeneous (i.e. measuring different physical para- ments. On the contrary, the identifiability of a system is
meters). Statistical features or signatures are often defined as whether the measurements lead to unique or
extracted from the original raw data, and these features finite solutions for the parameters of the system.13
are selected or concatenated prior to further analysis. Obviously, the question of observability is indivisible
In decision-level fusion, the final assessment result is from identifiability since the state of the system is deter-
obtained by integrating the decisions, which are mined by the parameters of the system.
obtained from different data sources, through particu- In general, data fusion improves the observability
lar combination rules. and the identifiability of a system by providing addi-
In this article, the authors present a comprehensive tional information for the decision makers. For
review of popular data fusion concepts and their instance, the observability of a 2-degree-of-freedom
applications in SHM. Among all these techniques, (DOF) system is discussed under the availabilities of
data registration, Bayesian probabilistic approaches, acceleration and displacement measurements in Chatzis
Wu and Jahanshahi 3

et al.13 It is shown that the system is observable only


when the displacement measurement of the first mode
and the acceleration measurement of the second mode
are available. Moreover, it is demonstrated that using
multiple sensors with different characteristics is poten-
tially more beneficial than using one specific type of
sensor alone. A particular sensor with its own inherent
noise and Nyquist frequency may prevent the sensor
from measuring the signal outside its bandwidth. For
instance, GPS sensors for displacement measurements
typically perform better in low frequencies, while the
accelerometers perform better in high frequencies. By
exploiting the measurements from both GPS and accel-
erometers, the accuracy of displacement estimation is
enhanced since the GPS measurements implicitly
impose constraints on the displacement computed from
Figure 1. Schematic illustration of prior, likelihood, and the
acceleration measurement; hence, the drifting in displa-
posterior distribution.
cement induced by double integration is eliminated.
This addresses the underlying benefit of heterogeneous
data fusion.14,15 improves the accuracy for crack detection since a crack
that is missed in a frame would be detected in other
Preprocessing—data registration. When dealing with the frames. Instead of considering only one video frame,
aggregation of information from multiple sensors, the the crack detection results from different video frames
registration problem occurs if the sensors are not are registered into a global spatiotemporal coordinate
located at the same position.16 For instance, the same system, and the final detection result is enhanced by
target point may appear at different pixel coordinates exploiting the underlying spatiocoherence of the data.
of the images captured from two cameras at different
locations. Before further exploitation of the data, the Bayesian theory
first step is to register the sensor readings into a com-
mon frame of reference.17–19 In other words, the data The basis of Bayesian theory is originated from condi-
registration is an essential step to transform the data tional probability that expresses the probability of
obtained from multiple sensors onto a common coordi- occurrence of event x given event y, which is denoted
nate system. as P(xjy). For data fusion, Bayes’ rule gives an expres-
Although the registration techniques often work sion for the probability of a hypothesis being true given
with two-dimensional (2D) or three-dimensional (3D) some prior knowledge or observations.12,22 When it
data, they can be applied to one-dimensional (1D) data comes to SHM, a hypothesis could be some specified
as well. In general, data registration is achieved by first damage states of a structure, or whether a certain dam-
searching the best similarity between the recordings, age exists or not. An observation may come from a
and then the recordings are mapped to the common series of experiments or a set of sensor measurements.
frame of reference based on the similarity. The similar- By computing the probability of each damage state
ity can be determined by finding the maximum correla- given the available information, decision makers can
tion between the sensor readings, template matching, make better decisions about the retrofit plan for the
regression, control points, or features invariant to the structures. The fundamental equation of Bayes’ rule is
sensor location. Depending on the data type, mapping given in equation (1)
functions like bilinear, warping, or least square optimi-
zation could be employed if necessary. A survey of P(DjHi )P(Hi ) P(DjHi )P(Hi )
P(Hi jD) = =P ð1Þ
the sensor registration techniques is presented in P(D) P(DjHi )P(Hi )
i
Liggins et al.20
It is noted that data registration is not limited to the where P(Hi jD) is the posterior probability that hypoth-
transformation of the original sensor recording. The esis Hi is true given available data D. P(DjHi ) is the
registration of higher level information onto a common likelihood function, and P(Hi ) is the prior probability.
coordinate system can also be helpful for decision mak- Figure 1 illustrates the schematic relationship between
ing as well. In Chen and Jahanshahi,21 the aggregation the prior, likelihood, and the posterior distributions.
of detection results from different video frames In this example, the prior distribution is normally
4 Structural Health Monitoring 00(0)

distributed with mean 50 and standard deviation 15, judgment, which involves the choice of the likeli-
and the likelihood function is normally distributed with hood function and the prior distribution.
mean 70 and standard deviation 5. Since the variance
of prior distribution is high (i.e. not informative), the
resulting posterior distribution tends to be closer to the Considerations in model selection. The performance of
likelihood function. Bayesian-based applications highly depends on the
Assuming the data are collected independently from selection of appropriate models. This requires a thor-
n different sensors, the posterior probability of Hi given ough uncertainty formulation and assumptions made
data D1 \ D2 \ . . . \ Dn can be expressed as equation with solid engineering knowledge. Without a funda-
(2), known as the Naı̈ve Bayes rule

P(D1 \ D2 \ . . . \ Dn jHi )P(Hi ) P(D1 jHi )P(D2 jHi )(Dn jHi )P(Hi )
P(Hi jD1 \ D2 \ . . . \ Dn ) = P = P ð2Þ
P(D1 \ D2 \ . . . \ Dn jHi )P(Hi ) P(D1 jHi )P(D2 jHi )(Dn jHi )P(Hi )
i i

where Dn corresponds to the nth observation (i.e.


declaration) made by the nth sensor, and Hi is the ith mental understanding of the underlying system or phys-
hypothesis. ical phenomenon, a suitable model class will be hard to
be constructed. For instance, the prior distribution of
Prior probability. There are two types of prior probabil- the yielding strength of steel may follow a log-normal
ities P(Hi ). One is subjective (i.e. informative) prior, distribution with median and variance determined from
while the other one is objective (i.e. non-informative) experimental data.25 In general, the considerations for
prior. The subjective prior is established based on one’s choosing an appropriate model are as follows:
subjective belief or prior knowledge about the hypoth-
esis, while the objective prior is established based on lit- (a) Prefer simpler model. A complicated model may
tle or indirect information about the hypothesis. work well on the training data but fail to predict
General rules for selecting prior probability are as fol- the outcome of the new data in the future.
lows23 (a) include as many reasonable priors as possi- (b) The prior should not be absorbed into the nor-
ble, (b) eliminate unreasonable priors, (c) confirm that malizing constant (i.e. the denominator in equa-
the prior does not require information that is difficult tion (2)) when determining the model class.
to elicit, and (d) verify the capability to compute mea- (c) A more informative prior based on previous expe-
sures of robustness without much difficulty. rience is preferred.
To address the effects of priors/model selection, sen- (d) Model selection favors a model with physical
sitivity analysis is essential in Bayesian inference. The meaning. It is difficult to estimate the uncertain
performances of the inference under different reason- parameters of an empirical model prior to data
able choices of prior distribution or probability models acquisition. Consequently, the number of choices
should be investigated through sensitivity analysis.23 of prior distribution is relatively higher for an
empirical model.

Posterior probability. In general, there are two procedures After the formulation of the likelihood function and
for estimating the posterior probability P(Hi jD):23,24 the prior distribution, the posterior distribution can be
obtained from equation (1). It is shown in Beck and
(a) Parametric identification. Formulate a class of Katafygiotis26 that for globally identifiable cases, the
mathematical models for a specific system or posterior distribution has the asymptotic behavior
physical phenomenon and identify the unknown given a large amount of data27 and therefore can be
parameters in the model. In this step, the para- approximated accurately by a Gaussian distribution.
meters of interest as well as their relation to the For the general cases where the posterior may not be
problem are determined. For instance, whether a approximated by Gaussian distribution, it is often com-
building is damaged or not can be associated with putationally intensive to obtain a close form expression
the amount of reduction in the column stiffness, of the posterior distribution when the dimensionality of
or changes in the mode frequencies of the the parameter space is high. To deal with this issue,
building. sampling algorithm such as Markov Chain Monte-
(b) Model selection. Select an appropriate class of Carlo (MCMC) and Gibbs sampler28–31 are often
mathematical models for parametric identifica- adopted to generate the samples from the posterior
tion. The selection is usually determined by user’s distribution. The MCMC algorithm avoids the
Wu and Jahanshahi 5

computation of the denominator in equation (1) by monitor the health condition of building structures
exploiting the relative probability density between two under various ground excitations. Acceleration mea-
parameter vectors. For more discussion regarding surements collected from the strong ground motion
model selection, the reader is referred to Yuen.24 and the velocity response recorded from the micro-
vibrations are used to detect damage in structures
under the earthquake mode and the micro-vibration
Application. In Vanik et al.32 and Beck et al.,33 a mode, respectively. Although the performance in the
Bayesian framework is proposed for SHM. In their earthquake mode is not as good as the micro-vibration
paper, the probability that the concerned model stiff- mode, experimental results have shown that the pro-
ness parameters would be less than a fraction of the posed approach is able to rapidly determine the condi-
undamaged stiffness parameters given the available tion and location of the damage. In Zárate et al.,41 the
modal data are computed based on Bayes theorem. crack length of a steel specimen is estimated through
The uncertainty of model parameters is introduced as Bayes theorem. Simulated samples generated through
well. The damage state of the structure is evaluated MCMC algorithm and the experimental data are fused
through the probability of reduction in structural para- to derive the probability of a certain crack length at a
meters (e.g. stiffness of the structure). In Papadimitriou specific number of loading cycles. The failure probabil-
et al.,34 the safety of a bridge structure is updated with ity is defined as the probability that the crack length is
measured dynamic responses. In this study, measure- larger than a user-defined critical crack length. In
ments of ground excitation and acceleration at the mid- Gobbato et al.,42 a theoretical probability framework is
span of the bridge deck are aggregated to infer the proposed to estimate the reliability index of the compo-
change in structural reliability. In Yuen et al.,35 a site wing structure at future times. In this framework,
Bayesian approach is proposed for structural system Bayesian inference is employed to evaluate the current
identification and damage detection. By using the simu- damage state of the system and update the joint prob-
lated floor acceleration responses, the proposed ability density function (PDF) of the damage size at
approach is demonstrated to be robust against mea- various locations. After that, probabilistic models for
surement noise, and the uncertainty of the structural future aerodynamic loads and damage propagation are
parameters can be quantified. However, damage with used to predict the joint PDF of damage extents at
little severity may not be detected. Moreover, the future times. Finally, the local and global failure cri-
employed optimization algorithm is only applicable for teria are combined to compute the lower bound and
globally identifiable cases, and the identification perfor- the upper bound of the probability of system failure at
mance depends on the selection of appropriate struc- future times. In Gobbato et al.,43 an experimental vali-
tural model. In Yuen et al.,36 model updating of dation is conducted to test the proposed probability
building structures is achieved through Bayesian meth- framework as in Gobbato et al.42 Multiple crack pro-
odology with noisy incomplete modal data. The pro- pagation trajectories recorded from a series of fatigue
posed approach uses a computationally efficient experiments are employed to estimate the remaining
iterative scheme to avoid the convergence difficulties fatigue life of the specimen. In Jaramilloa et al.,44 the
faced in non-linear optimization problem. System natu- health condition of a rotating machine is monitored by
ral frequencies, mode shapes, and stiffness are esti- means of a two-stage Bayesian inference data fusion
mated through the proposed method using numerical technique. Prior to data fusion, time domain features
examples. In Muto and Beck,37 the simulated accelera- and frequency-domain features are extracted from the
tion measurements are employed to perform the system signals of interest including acceleration, current, vol-
identification of buildings. The issue of high-dimension tage, and temperature. The overall health assessment is
optimization is relieved by the transition MCMC algo- achieved by first performing local data fusion that inte-
rithm, and the simulation results indicate decent perfor- grates the information from specific components, then
mances achieved by the proposed approach. However, perform global data fusion that combines the diagnos-
the identification results could be altered significantly tics results of the specific components.
due to poor knowledge about the damping in the struc- In Rabiei and Modarres,45 recursive Bayesian frame-
ture. In Ntotsios et al.38 and Papadimitriou and work is used to update the parameters for the crack
Ntotsios,39 modal data computed from vibration mea- growth model, as well as the probability distribution of
surements are used to quantify and localize the damage the crack size and crack growth rate. Data from acous-
in bridges. It is demonstrated through a laboratory test tic emission, periodic inspection, and the empirical
that the proposed approach depends on the model crack growth model are fused to update the crack
classes, damage magnitudes, and locations, as well information for better health assessment. In Zonta
as the configuration of sensor instrumentation. In Lin et al.,46 the accuracy of the load estimation for a cable-
et al.,40 a Naı̈ve Bayes–based approach is proposed to stayed bridge is enhanced by Bayesian inference. The
6 Structural Health Monitoring 00(0)

elongation recorded by the fiber optical sensors, the principal directions. In Au and colleagues,55–57 modal
load measured by the elasto-magnetic sensors, and the parameters and structural parameters of buildings are
thermal expansion information are fused to give a bet- identified through the aggregation of the ambient accel-
ter estimation of the cable load. In Lam et al.,47 a eration measurements. Various setups of sensor instru-
Bayesian-based model updating approach is proposed mentation are considered to test the proposed
to refine the finite element model of a coupled floor approach. Comprehensive experiments show that the
slab. In this study, the ambient acceleration measure- proposed approach fails to achieve good performance
ments collected through field test are used to update in some cases of sensor placement. Also, identification
the structural model. It is demonstrated that the model performance depends on other factors such as model-
with partition walls achieves better performances in ing error and the noise level. In Au and Ni,58 a
identifying the natural frequencies and mode shapes. In frequency-domain fast Bayesian fast Fourier transform
Ramos et al.,48 Bayesian fusion is employed in the (FFT) approach is proposed for modal identification
scope of masonry structures survey. The probability through force vibration tests. By utilizing the input
distribution for the elastic modulus of the granite excitation and the acceleration measurements, data
blocks is updated by integrating the information from from simulation and field tests have demonstrated the
prior knowledge, the data from sonic test, the data robustness of the proposed approach. However,
from ultrasonic test, as well as the data from compres- the input excitation must be large enough to eliminate
sive strength test. In Yan and Tang,49 the acoustic the bias in the identification results due to the ambient
emission source in plate-like structures is localized responses of the structure. In Yan and Katafygiotis,59
based on a Bayesian approach. Multiple 2D Gaussian the stiffness parameters of shear buildings are estimated
distributions obtained from time difference data at dif- through the incorporation of transmissibility matrix,
ferent frequencies are merged to provide the final prob- random matrix and Bayes’ theorem. Based on the
ability distribution of the acoustic emission source acceleration measurements, the proposed approach
location. In Wang et al.,50 a Bayesian computational requires no evaluation of the inverse of the covariance
sensor network is proposed to perform SHM in a matrix and hence is computationally efficient.
small-scale structure. The damage in an aluminum Although the proposed method only needs the infor-
panel is mapped by a small mobile robot equipped with mation from the structural responses, the number of
vision and ultrasonic sensors. It is indicated that the independent measurements must be larger than the
fusion of vision and ultrasonic information reduces the number of independent inputs (excitations). Chen et
uncertainty in damage localization. In Sun and Betti,51 al.60 use local binary patterns61 and support vector
Bayesian inference is employed to perform damage machine (SVM)62 to autonomously detect cracks on
detection and system identification for a 4-DOF shear metallic surfaces. The inspection videos of a nuclear
type building structure. In this study, only partial accel- power plant are recorded with an underwater camera
eration measurements are used to update the posterior system, and the frames of the videos are extracted to
PDF of the structural parameters (e.g. mass, damping perform crack detection. Next, Bayes theorem is
and stiffness). Experimental results have demonstrated adopted to aggregate the information from different
that the proposed approach accurately estimates the frames. It is shown that the performance of the detec-
parameter distributions in the presence of measurement tion algorithm is significantly enhanced with data
noise and incomplete measurement data. In Sun and fusion technique. In Huang et al.,63 Bayesian system
Büyüköztürk,52 a Bayesian inference–based regulariza- identification is employed to identify the reduction in
tion approach is proposed to identify the unknown structural stiffness through the use of modal data (e.g.
traffic loads through the vibration measurements of the mode shapes and mode frequencies). The Gibbs sam-
bridge. To this end, the accelerations, strains, and dis- pling algorithm30,64 is adopted to generate samples
placement measurements are fused through a state– from the posterior distribution of the stiffness scaling
space model. Results from a 27-bar truss bridge indi- parameters. It is claimed that the proposed approach is
cate the effectiveness and robustness of the proposed capable of incorporating all sources of uncertainties
approach against noisy data. In Au,53 an efficient based on given available data. In Minson et al.,65 a
approach for modal identification is proposed to relieve Bayesian-based approach is proposed to provide one
the problem of computation time and convergence sta- unified ground motion prediction by combining the
bility encountered by the method proposed in Yuen predictions from multiple earthquake early warning
and Katafygiotis54 when the number of measured (EEW) algorithms. The aggregation of information can
DOFs increases. Although fast computation is achieved be achieved even if these algorithms do not have com-
by the proposed approach, modal identification may patible source models. Results have demonstrated that
fail for structures with no separated modes, for exam- the proposed approach can operate in real-time and
ple, a structure with close stiffness in both the two evaluate whether the prediction from these algorithms
Wu and Jahanshahi 7

Table 1. Applications using Bayesian approaches in structure damage identification/quantification.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Vanik et al.32 and Beck et al.33 Modal parameters P No validation from real structures.
Papadimitriou et al.34 Acc. measurements and P The employed structural model is
ground excitations assumed to be identical to the actual
structure.
Yuen et al.35 Acc. measurements P Identification performance depends on
the selection of appropriate structural
model, and small damage may not be
detected.
Ntotsios et al.38 Modal data computed from P P Performance depends on the sensor
acc. measurements placements, damage magnitudes and
locations, as well as the model classes
and the modeling errors.
Papadimitriou and Ntotsios39 Modal data computed from P Damage with small magnitudes may
acc. measurements not be detected.
Lin et al.40 Acc. from ambient vibration P Less accurate for cases in strong
and strong motion motions.
Zárate et al.41 Crack length measurements P P Simple polynomial assumptions are
and fatigue cycle made for stress intensity range.
Gobbato et al.42 Ultrasonic-guided waves P Damage propagation is assumed to be
driven by fatigue only.
Gobbato et al.43 Crack propagation P P Uncertainties in loadings are not
trajectories considered.
Jaramilloa et al.44 Acc., current, voltage, and P Performance depends on the selection
temperature signals of optimal sensors and engineered
features.
Rabiei and Modarres45 Crack information from P Performance depends on the
acoustic emission, empirical correlation between the features and
model and digital imaging the crack growth rate.
Yan and Tang49 PDFs of time difference data P Modeling error and measurement
at multiple frequencies noise are assumed to be Gaussian.
Wang et al.50 Image and ultrasonic data P Damage distant from the robots is
poorly localized.
Chen et al.60 Video frames P Crack detection performance depends
on the environmental conditions
during data collection.
Chen and Jahanshahi21 Video frames P The proposed crack-detection
approach requires lots of training data
and GPU computation.

PDF: probability density function; GPU: graphics processing unit; acc.: acceleration.

is a false alarm. In Chen and Jahanshahi,21 the detec- accuracy is improved by the fusion of all the individual
tion of cracks for nuclear power plant reactors is data. Tables 1 and 2 summarize the above applications
enhanced through the Naı̈ve Bayes approach. By incor- using Bayesian approaches in damage identification/
porating the spatiotemporal coherence of cracks in the quantification and system identification/response estima-
adjacent video frames, the proposed approach outper- tion, respectively. The information of input data source,
forms other algorithms based on the fusion of multiple validation approach, and limitations/concerns are
video frames. presented.
Although the Bayesian inference–based approaches
seem to be more popular in nondestructive testing,66 it
requires reasonable assumptions to work properly. In State estimation methods
Brierley et al.,67 the classical inference–based approach, In SHM, the displacement of a structural system can be
which requires fewer assumptions, is adopted to aggre- the indicative of damage extent within the structure.68
gate the data from ultrasonic and radiographic sensors to Under controlled situations, the displacement of a
conduct nondestructive inspection. The detection structure can be easily measured through linear variable
8 Structural Health Monitoring 00(0)

Table 2. Applications using Bayesian approaches in structure system identification/response estimation.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Yuen et al.35 Acc. measurements P Identification performance


depends on the selection of
appropriate structural model,
and small damage may not be
detected.
Yuen et al.36 Modal data P Structural behavior is assumed
to be well approximated by
linear dynamics.
Muto and Beck37 Acc. measurements P Poor understanding about the
damping in the structure may
lead to unsatisfactory
identification results.
Zonta et al.46 Elongation, load, and thermal P Load estimate performance
expansion coefficient depends on appropriate data
measurements calibration and prior knowledge.
Lam et al.47 Ambient acc. measurements P Performance depends on the
selection of appropriate finite
element model.
Ramos et al.48 Sonic, ultrasonic, and P Subjective judgments are made
compressive strength test about the confidence interval of
measurements different test methods.
Sun and Betti51 Acc. measurements P P Gaussian and inverse Gamma
distributions are assumed to be
the prior PDFs of structural
parameters and prediction error
covariance, respectively.
Sun and Büyüköztürk52 Acc., disp., and strain P Noise levels are assumed to be
measurements identical for different sets of
noisy sequences.
Au53 Ambient acc. measurements P P The proposed approach only
applies to the structures with
separated modes.
Au and colleagues55–57 Ambient acc. measurements P P Performance depends on the
sensor setups, modeling error,
and signal-to-noise ratio.
Au and Ni58 Acc. measurements and input P P Ambient response is not
excitations modeled in the proposed
approach and hence may lead to
bias in identification results.
Yan and Katafygiotis59 Acc. measurements P P Number of independent
response measurements must
be larger than the number of
independent excitations.
Huang et al.63 Mode shapes and mode P P Parameter identification is
frequencies assumed to be performed with
low-amplitude vibration data.
Minson et al.65 Predictions from multiple P Ground motion intensity is
EEW systems assumed to be log-normal
distributed with fixed variance.

PDF: probability density function; acc.: acceleration; EEW: earthquake early warning.

displacement transducer (LVDT) or optical sensors. not reliable because of the amplification of low-
In practice, the acquisition of displacement infor- frequency noise. Therefore, state estimation tech-
mation is much more difficult than the acceleration niques are often dedicated to give a better estima-
information due to the difficulty in finding a refer- tion of the desirable information (e.g. displacement)
ence plane for the displacement sensor. Double inte- given only limited available information (e.g.
grating the acceleration to obtain displacement is acceleration).
Wu and Jahanshahi 9

Kalman filter concepts. Kalman filter69 is a powerful


mathematical tool that can be used to predict the state
of a dynamic system in the presence of uncertainty. It
takes advantage of the information regarding the previ-
ous state of the system, and it is suitable for real-time
implementation and embedded systems.
Consider a state vector xk is governed by linear sto-
chastic differential equation

xk = Ak xk1 + Bk uk + wk ð3Þ
with a measurement

zk = Hk xk + vk ð4Þ
where variables wk and vk denote the process noise and
measurement noise, respectively. Matrix Ak defines the
Figure 2. 1D example for the best estimation between the
relationship between the state at time step k and the
predicted and the observed measurements.
state at time step k  1 in the absence of driving func-
tion uk and process noise wk . Matrix Bk defines the
optional driving function uk . In equation (4), zk is the
actual measurement acquired from sensors, while Hk xk
is the measurement one expects to get based on the esti-
mation xk (i.e. the predicted measurement).
The core concept of Kalman filter is that it aims to
find the best weight between the predicted and the
observed measurements. Assume that the predicted and
the observed measurements belong to two Gaussian
distributions.70 The best estimation between these two
measurements would be the overlapping region of these
two Gaussian distributions. Figure 2 illustrates this
concept with 1D Gaussian distributions. In this exam-
ple, the best estimated state from Kalman filter would
lie in the overlapping region between the distributions
of the predicted and the observed measurements.
During each time step, Kalman filter recursively Figure 3. Flowchart for Kalman filter. See Faragher70 for
updates the state estimation and its uncertainty. derivation in detail.
Figure 3 illustrates the recursive update process.
Xk + 1 = AXk + BFk ð7Þ
Kalman filter in structural dynamics. In the context of struc- where A and B are the matrices that define the state–
tural dynamics, consider the second-order linear equa- space transition function (see Yuen24 for derivation in
tion of motion24 detail.)
It is noted that Kalman filter is an estimator of
M€x(t) + C x_ (t) + Kx(t) = T0 F(t) ð5Þ
Bayes’ theorem. The Kalman filter estimates the poster-
where M, C, K denote the mass, damping, and stiffness ior mean and covariance of the state vector (estima-
matrices, respectively. T0 denotes the force distribution tion) conditioned on all the previous measurements.
matrix, and F denotes the excitation. In this case, state For nonlinear systems, the extended Kalman filter
vector can be defined as follows (EKF), the unscented Kalman filter (UKF), or the par-
ticle filter (PF) should come into practice instead of
 
x(t) using the Kalman filter.
X (t) = ð6Þ
x_ (t)

Consider discrete time series, equation (5) can be for- EKF. EKF has been the standard state-estimation
mulated as a state–space representation method for dealing with nonlinear systems for the last
10 Structural Health Monitoring 00(0)

several decades.71,72 Consider the state vector xk gov- state vector can be computed through optimization
erned by nonlinear stochastic differential equation techniques (e.g. minimization of the mean squared error
(MMSE)).74 However, PF method could suffer from
xk = F(xk1 , uk , wk1 ) ð8Þ the computation issue since it may require to generate
lots of samples depending on the problem of interest.
with a measurement

zk = H(xk , vk ) ð9Þ The discontinuous EKF and the discontinuous UKF. Liu
where F and H stand for the functions that define the et al.79 have shown that the EKF and UKF algorithms
nonlinearity. For a nonlinear system, the EKF applies may fail to converge because of the unobservable states
the nonlinear transformations to the mean and covar- or parameters. This problem lies in many non-smooth
iance estimates through the Taylor series expansion. In systems such as the transition between the elastic and
EKF, only the first two moments of the conditional plastic responses, the initiation of a crack, or the initia-
PDF are updated. tion of a sliding. Switching behavior upon these dis-
For the EKF to be reliable, it is only suitable for the crete events makes the state–space equations non-
systems that are almost linear in the updating time differentiable. Recently, Chatzis and colleagues80,81
intervals since the high-order terms in the Taylor series propose a D-modification version of the EKF and
expansion are neglected. Several disadvantage are UKF, named as the discontinuous EKF and the dis-
addressed in Julier and Uhlmann73 and Smyth et al.:74 continuous UKF, to address the unidentifiability in
non-smooth systems. The D-modification ensures the
(a) The linearized transformation (i.e. Taylor series unidentifiable parameters remain invariant over such
expansion) is only applicable if the error propaga- time intervals, and therefore the estimates will not
tion can be well approximated by a linear function diverge. Table 3 summarizes the comparison between
during the updating process. various Kalman filter algorithms.
(b) The Taylor series expansion requires the compu-
tation of the Jacobian matrices. This could be dif-
Application. Smyth and Wu82 propose a multi-rate
ficult to compute and may induce additional
Kalman filter approach to estimate the displacement
computation efforts.
by the aggregation of acceleration and displacement
Therefore, the EKF may be inappropriate to be measurements. The multi-rate Kalman filter is used to
applied to higher order nonlinearities as it is often account for the lower sampling rate of the displacement
encountered in civil engineering applications. sensors (e.g. LVDT, GPS) compared with the accelera-
tion sensors. According to the simulation results in a
single degree of freedom (SDOF) system, it is demon-
UKF. Unlike the EKF method, the UKF algorithm do strated that the displacement estimates of the proposed
not require the computation of Jacobian matrices.74,75 method is superior to the single-rate Kalman filter.
A set of sigma points is first selected deterministically Also, a smoothing technique is introduced to improve
from the statistics of the transformation, and the mean the accuracy of the displacement estimates. However,
and covariance estimates of the state vector can be this smoothing technique can only by applied to offline
described by the weighted combination of these sigma estimation since it requires the filtering over the entire
points. After the nonlinear transformation, the sigma sequence of the available measurements. In Tan et
points are able to estimate the posterior mean and cov- al.,83 Kalman filter–based approaches82,98 are used to
ariance accurately to the second order (third order for determine the intrastride variation in speed for human
Gaussian inputs) for any nonlinearity.73,74 Although running. Data from GPS and the inertial measurement
the UKF is capable of dealing with higher order nonli- units (IMU) are fused to estimate the speed with fluc-
nearity than the EKF, it may not work well if the error tuation induced by cyclic pedal strokes. It is shown that
distribution is not Gaussian.73 the data fusion scheme achieves better estimates com-
pared with the GPS and IMU used alone. In Chang
PF. The PF (i.e. sequential Monte-Carlo) methods is and Xiao,84 the displacement measurements from a
capable of dealing with the nonlinear system with the camera and the acceleration signals from a acceler-
state vector described as a non-Gaussian posterior ometer are integrated through the multi-rate Kalman
PDF.76–78 In general, the PF method generates samples filter with smoothing technique proposed by Smyth
randomly to approximate the posterior PDF based on and Wu.82 It is demonstrated that data fusion using
the Monte-Carlo simulation. Once the samples are Kalman filter not only estimates the velocity response
obtained, the posterior mean and the variance of the but also improves the accuracy of the estimated
Wu and Jahanshahi 11

Table 3. Comparison between various Kalman filter algorithms.

Algorithm Properties Applications

Kalman filter 1. Real-time operation Smyth and Wu,82 Tan et al.,83 Chang and Xiao,84
2. Limited to linear system Bock et al.,85 Rodger,86 Kim et al.,87 Azam et al.,88
Naets et al.,89 Huang et al.,90 and Kim and Sohn91
EKF 1. Can be used in nonlinear system Wu and Smyth,92 Erazo and Hernandez,93
2. Not suitable for system with and Smyth et al.74
higher order nonlinearity
UKF 1. Applicable for system with high Chatzi and Smyth,15 Wu and Smyth,92 Erazo
nonlinearity and Hernandez,93 Smyth et al.,74 Chatzi
2. Not appropriate for system with and Fuggini,94 Chatzis et al.,95 and
non-Gaussian error distribution Asgarieh et al.96
3. Performance depends on initial
estimates of parameters
PF 1. Applicable when state vector is Chatzi and Smyth15 and Erazo and
non-Gaussian Hernandez93
2. Computationally intensive and
hence only for offline operation
Discontinuous 1. Applicable for non-smooth Chatzis et al.80
EKF systems
2. Capable of dealing with
unobservable states or parameters
3. Requires detection of transition
events in non-smooth system.
Discontinuous 1. Applicable for non-smooth Chatzis and Chatzi81,97
UKF systems
2. Capable of dealing with
unobservable states or parameters

EKF: extended Kalman filter; UKF: unscented Kalman filter; PF: particle filter.

displacements and enlarges the frequency bandwidth of against the measurement noise than the EKF. In
the displacement estimates. In Yang et al.,99 an adap- Chatzi and Smyth,15 the non-collocated acceleration
tive EKF is proposed to detect variations in the struc- and displacement measurements are fused to determine
tural parameters. Compared to the classical least- the displacement response and the hysteretic para-
squares estimation (LSE) method, the EKF approach meters of a 3-DOF system. The performance of the
requires only acceleration measurements and therefore UKF, generic PF, and the Gaussian mixture sigma
avoids the error induced from numerical integration. point particle filter (GMSPPF)102 are compared.
However, LSE is numerically more stable and easy to According to the simulation results, the UKF and
converge than the EKF approach. In Yang et al.,100 GMSPPF are the most efficient techniques in terms of
the EKF formulation proposed in Yang et al.99 is validation with the final identified hysteretic para-
modified to deal with systems with unknown excita- meters. Also, the GMSPPF method achieves the best
tions. Although the EKF-based approach is able to performance when estimating the time invariant model
perform system identification, it may suffer computa- parameters.
tion burden if the test structure has a large number of In Bock et al.,85 the Kalman filter approach devel-
DOFs. oped in Smyth and Wu82 is employed to estimate the
In Wu and Smyth,92 the ground excitation and the displacement time history of a full-scale seven-story
acceleration measurements are integrated to estimate reinforced concrete wall building based on collocated
the vibration responses and structural parameters of a GPS and accelerometers. It is shown that the adopted
SDOF and a 2-DOF system. To this end, the nonlinear data fusion scheme are able to estimate the displace-
hysteretic Bouc–Wen system101 is introduced to model ment time series with millimeter precision. In Rodger,86
the system nonlinearity. Based on the simulation the fuzzy Kalman filter is used to reduce the failure risk
results, it is demonstrated that the UKF yields more of an integrated vehicle health maintenance system. To
accurate estimation than the EKF when the system is this end, data from accelerometer, pressure sensor, tor-
highly nonlinear. Moreover, the UKF is more robust que sensor, and liquid quality sensor are integrated
12 Structural Health Monitoring 00(0)

through fuzzy logic and state estimation. In Kim the proposed approach outperforms the method in Wu
et al.,87 a new approach based on Kalman filter is and Smyth92 with a lower estimation error and can be
developed to estimate the displacement responses from operated in real time as well.
the fusion of acceleration and displacement measure- In Erazo and Hernandez,93 the seismic-induced dam-
ments. This method considers the acceleration measure- age in buildings is evaluated on the basis of a probabil-
ment bias explicitly in the system dynamics of the istic framework. Four Bayesian filters, including the
Kalman filtering. The acceleration error is modeled as EKF, UKF, ensemble Kalman filter,106 and the PF, are
a combination of an offset bias (e.g. mechanical hyster- used to estimate the structural response as well as the
esis), installation error (e.g. misalignment), and a zero- uncertainties of estimations and their performances are
mean stochastic noise process. Based on the data col- compared. The response estimations are achieved by
lected from a small cantilever beam, it is claimed that integrating the limited acceleration measurements and
the proposed approach is superior to the methods the information of a nonlinear model. It is pointed out
developed in Smyth and Wu.82 that the EKF is the most suitable filter among all the fil-
In Azam et al.,88 a dual implementation of the ters being compared. In Chatzi and Fuggini,94 a multi-
Kalman filter is proposed to estimate the displacements rate UKF is implemented to monitor the response of a
and the velocities of a structure with only a limited high rise building with the fusion of GPS and accelera-
number of noise-contaminated acceleration measure- tion data. An artificial zero-mean white noise measure-
ments. Floor acceleration responses are fused to obtain ment is added to avoid the drifting of the displacement
the displacements, velocities, and the input forces of estimates induced by low-frequency integration errors.
the structure. It is claimed that the proposed method Through the data obtained from a densely instrumen-
eliminates the numerical problems caused by the un- ted building, it is shown that the proposed approach
observability and rank deficiency of the augmented yields accurate estimates of the structural response. In
Kalman filter.103 In addition, the drift effect in the esti- Chatzis et al.,95 the dynamic response and physical rep-
mated displacements and input force observed in the resentation of a structural system are estimated through
Gillijn and De Moor104 filter is reduced by the pro- the transformed subspace state–space system identifica-
posed method with a good prior knowledge of the cov- tion (T-SSID)107 and the UKF. The acceleration and
ariance of the unknown input force. In Naets et al.,89 displacement signals are aggregated through T-SSID
an augmented Kalman filter is proposed to circumvent and UKF separately, and the performances of the two
the divergence problem encountered in the estimation algorithms are discussed. The results indicate that T-
of input force when only the acceleration measurements SSID method is vulnerable to noise with low-frequency
are available. It is claimed that by adding the dummy content. Also, the T-SSID method is unable to operate
displacement measurements in the Kalman filter updat- under a reduced observation scheme (i.e. it requires the
ing process, the drift in the estimated response can be information from every floor in the form of either accel-
eliminated. However, the proposed approach is only eration or displacement), and T-SSID needs offline
applicable when the steady-state position of the system implementation. Although the UKF method overcomes
is known. In Cho et al.,105 the displacement response of the limitations of the T-SSID method, the variation in
a bridge is estimated through Kalman filter. The accel- the identified structural parameters for the UKF
eration and strain measurements are incorporated into method is higher. This disadvantage is resolved by the
the state–space model. The proposed approach requires fusion of acceleration and displacement measurements.
no reference points, which is necessary for the conven- In Smyth et al.,74 it is pointed out that the UKF
tional displacement transducers. In Huang et al.,90 a appears to be more robust than the EKF when dealing
Kalman filter–based approach is proposed to identify with higher order nonlinearities. The EKF fails to give
the stiffness, mass, and damping of a structure simulta- accurate estimations when encountering non-Gaussian
neously. The input force and output acceleration mea- conditional PDFs. Finally, it is commonly seen that the
surements of a structure are aggregated to estimate the state estimation approaches are incorporated within the
structural parameters in real-time operation. Although probability framework. A typical example is the appli-
this approach requires complete measurements from cation in Rabiei and Modarres45 as aforementioned. In
each model DOF, numerical simulations have demon- Asgarieh et al.,96 the UKF is used to perform the model
strated that the proposed method is robust against calibration and response estimation for a 7-DOF shear
noisy data and can be operate online. In Kim and building model. Two modeling schemes are considered,
Sohn,91 a smoothing-based Kalman filter approach is namely, the cantilever and shear building models. The
proposed to estimate the displacement time history by cantilever model outperforms the shear building model
aggregating the velocity from a laser Doppler vibrom- in the identification process. It is claimed that the fusion
eter (LDV) and the displacement from a LiDAR sen- of acceleration and displacement measurements fails to
sor. Results from a series of lab-scale tests show that enhance the response estimation in the presence of large
Wu and Jahanshahi 13

modeling error. In Chatzis et al.,80 the discontinuous sensor 2. Then, the belief in outcome A being true can
extended Kalman filter (DEKF) is proposed to deal be expressed as follows
with the divergence problem in non-smooth dynamic P
systems. The impact problem, the nonlinear hysteretic mi (Ak )mj (Ak 0 )
Ak \Ak 0 = A
Bouc–Wen model, and the elasto-plastic system are Belief (A) = mi  mj (A) = ð10Þ
selected as case studies to validate the proposed algo- 1K
rithm. It is shown that the DEKF outperforms the where
EKF in all the selected non-smooth systems. When
X
compared with the UKF algorithm, the proposed K= mi (Al )mj (Al0 ) ð11Þ
DEKF performs on a par with or better than the UKF Al \Al0 = f
in some cases. However, the good performance of UKF
may depend on the initial estimates and the noise in the and mi  mj (A) stands for the combined proposition A,
measurement signals. More recently, Chatzis and that is, the intersection of the observed hypothesis Ak
Chatzi81 have proposed the discontinuous unscented from sensor 1 and the observed hypothesis Ak 0 from sen-
Kalman filter (DUKF) to enhance the parameter esti- sor 2.
mates in non-smooth dynamic systems. Similar case
studies in Chatzis et al.80 are used to test the proposed Conflicts. Conflicts may happen when the application’s
algorithm. Results indicate that the DUKF improves frame of discernment is not well defined. If the values
the estimates and behaves more consistently than the of mass function are strongly skewed, the belief of the
standard UKF for non-smooth systems. Also, the hypothesis computed from equation (10) would be
DUKF does not require the detection of the transition unreasonable.109 There are several methods to solve the
event, which is an advantage over the DEKF algorithm. conflict problem proposed by researchers. Yager
In Chatzis and Chatzi,97 the DUKF algorithm is et al.110 proposed a term as the ground probability
applied to the identification of a rocking body subjected mass assignment, that is the DS’ basic probability mass
to ground motions. The DUKF outperforms the UKF assignment without the normalization by (1  K) (i.e.
since the latter suffers from the divergence of the fric- the denominator of equation (10)). The probability
tion coefficient during the time periods of unidentifia- mass tends to be extremely small when new conflicting
bility. Table 4 summarizes the above applications using observations are being introduced. Inagaki111 intro-
KF approaches in system identification/response esti- duced an approach to make a balance between DS’
mation. The information of input data source, valida- method and Yager’s method by means of a factor k.
tion approach, and limitations/concerns are presented.
This factor helps to reduce conflicts and make DS the-
ory more reliable. However, how to choose the optimal
DS theory value of k becomes an issue, and it is hard to be gener-
alized for different applications. In Wu,109 a dynamic
DS theory is an approach for combining degrees of
weight calibration scheme is introduced to fuse the
belief based on evidence. It is a generalized form of the
information from multiple sensors. The calibration
Bayesian theory for considering multiple unions of
scheme tends to assign larger weights to the informa-
hypothesis which are mutually exclusive.108,109
tion that are acquired more recently.
Different from probability theory, DS allows the quan-
tification of ignorance. For example, in probability the-
ory, the probabilities of the occurrences of all possible Application. Bao et al.112 employed DS theory to identify
events sum to one. In the DS theory, a specified degree the damage locations on a 3D truss structure. The glo-
of belief can be assigned to the hypothesis where it is bal stiffness matrix can be expressed as the sum of stiff-
not knowing which event would occur. The elements ness matrices of its substructures. Zervas et al.113
for mathematical formulation of the DS theory are developed a multi-sensor data fusion scheme to per-
introduced in Shafer.108 form the early detection of fire events. In this study, the
In terms of data fusion using multiple sensors, the DS theory was used to estimate the probability of fire
DS theory allows us to integrate the beliefs derived event by means of information from temperature,
from different sensors.109 Suppose that the observa- humidity, and vision sensors. In Zhao et al.,114 the
tions from two sensors are available, and the two sen- information from multiple piezoceramic sensors located
sors could have different observations. Denote mi is the at different locations of a two-story concrete frame was
mass function assigned to the observations based on fused based on DS theory. A weighted fusion damage
sensor 1, mj is the mass function assigned to the obser- index was proposed to indicate the damage states of
vations based on sensor 2, Ak is an observed hypothesis different areas of the frame. In Zhou et al.,115 the dam-
from sensor 1, and Ak 0 is an observed hypothesis from age detection of a building structure was performed by
14 Structural Health Monitoring 00(0)

Table 4. Applications using KF approaches in structure system identification/response estimation.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Smyth and Wu82 Acc. and disp. measurements P The proposed smoothing technique is
only suitable for offline estimation.
Chang and Xiao84 Disp. from imaging and acc. P Reduction in the sampling rate of the
measurements disp. from imaging may induce slight
drift in the fused disp. estimates.
Yang et al.99 Acc. measurements P The proposed EKF approach requires
considerations in numerical stability.
Yang et al.100 Acc. measurements P The proposed approach is
computation intensive if the structure
has many DOFs.
Wu and Smyth92 Ground acc. and acc. responses P The ground acc. and structural acc.
response are assumed to be
measurable.
Chatzi and Smyth15 Acc. and disp. measurements P Measurement noise is not considered.
Bock et al.85 GPS and acc. measurements P Networks of densely collocated GPS
and seismic instruments are necessary.
Kim et al.87 Acc. and disp. measurements P Performance depends on the
displacement sampling rates.
Azam et al.88 Floor acc. measurements P Performance depends on the fine-
tuning of the covariance of the
unknown input.
Naets et al.89 Acc. measurements P Only applicable when the steady-state
position of the system is available.
Cho et al.105 Acc. and strain measurements P P Performance may depend on the
amplitude of displacement response.
Huang et al.90 Acc. and force measurements P Requires complete measurements
from each model DOF.
Kim and Sohn91 Vel. and disp. measurements P Time-varying bias is not considered in
acc. measurements.
Erazo and Limited acc. measurements and output P Optimized sensor placement is not
Hernandez93 from a nonlinear model discussed for the proposed approach.
Chatzi and Fuggini94 Acc. and GPS measurements P An artificial white noise observation is
required for on-line monitoring.
Chatzis et al.95 Acc. and disp. measurements P Performance depends on the selection
of noise matrices and the assumption
of known structured model.
Asgarieh et al.96 Acc. and disp. measurements P Performance depends on reasonable
modeling assumptions.
Chatzis et al.80 Ground acc. input and output of model P Performance depends on the initial
estimates of the parameters. the initial
condition requires a pre-estimate.
Chatzis and Chatzi81 Disp. and output of model P Parameters are assumed to be
invariant during unobservable time
intervals.
Chatzis and Chatzi97 Ground acc. and disp. measurements P Parameters are assumed to be
invariant during unobservable time
intervals.

Acc.: acceleration; disp.: displacement; vel.: velocity; EKF: extended Kalman filter; DOF: degrees of freedom; GPS: global positioning system.

the combination of posterior probability SVM and DS accuracy when data were subjected to disturbance. In
theory. SVM outputs the posterior probabilities from Moosavian et al.,116 the fault diagnosis of spark plug in
the recording of multiple sensors. These probabilities an internal combustion engine was performed using the
were fused to determine whether the structure was dam- combination of artificial neural network (ANN), least
aged or not based on DS theory. Also, the performance square SVM, and the DS theory. Acoustic signals and
of data fusion at different levels was discussed. It was vibration signals were fed into the two classifiers sepa-
shown that fusion at decision level achieved the best rately to classify different types of damages. The result
Wu and Jahanshahi 15

Table 5. Applications using DS approaches in structure damage identification/quantification.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Bao et al.112 Damage probability assignment P Damage localization is assumed


functions from different datasets to be completely characterized
by modal strain energy.
Zhao et al.114 Recordings from multiple P Requires active sensing.
piezoceramic sensors
Zhou et al.115 Acc. measurements P Damage is assumed to be
characterized by wavelet
features.
Grande and Mode shapes P Requires a finite element model.
Imbimbo118
Huang and 2D images and 3D P Performance depends on the
Liu119 laser scanning selection of thresholds.
Völker and IE, US, and GPR P Damage is assumed to be fully
Shokouhi120 measurements characterized by engineered
wave features.

IE: impact-echo; US: ultrasonic pulse echo; GPR: ground penetration radar; 2D: two-dimensional; 3D three-dimensional; acc.: acceleration.

indicated that using the DS theory to fuse the results of sensors are used to perform feature-level fusion.
the two classifiers achieved better classification accu- Results demonstrate that both data fusion approaches
racy than the result of each individual classifier alone. perform better than the best single sensor, and the
In Hou and Bergmann,117 the fault diagnosis of an Hadamard product method outperforms the DS theory
induction motor was performed based on neural net- when there is no nearly zero entries in the features.
work and the DS theory. The motor stator current sig- Table 5 summarizes the above applications using DS
nal and two acceleration signals were fed into three approaches in damage identification/quantification.
neural networks separately. The outcomes of the three The information of input data source, validation
classifier were integrated by the DS theory. The analy- approach, and limitations/concerns are presented.
sis results indicated that decision-level fusion enhanced
the robustness of fault diagnosis. In Grande and
Imbimbo,118 a multi-stage data fusion approach was
Fuzzy logic
proposed to detect damage in structural systems. Local Different from classical set theory, fuzzy logic is often
damage indicators were built based on the flexibility used to address vagueness and imprecision that exists
approach, which uses the change of the flexibility of the in real-world events. It is particularly useful when the
structural element as a way to quantify damage. The boundaries between sets of categories are not well-
DS theory was employed to integrate all local decisions defined, or the mathematical formulation of the system
and make the final decision. In Huang and Liu,119 the of interest is not fully understood.12,122 For instance,
accuracy of pavement crack detection was enhanced by whether a person is tall or short depends on subjective
the combination of 2D gray-scale image analysis and judgment of the observer, and there is no valid bound-
3D laser scanning method. The 2D method may fail to ary value between the definition of tall and short.
detect the cracks in the presence of shadows and tire Therefore, fuzzy logic is applicable when the noise con-
marks, while the 3D method is unable to quantify the tent of the data is high, or the measurement of the sen-
cracks when there is no obvious depth changes in the sor is not precise enough. The key elements in fuzzy set
pavement. As a result, the 2D and 3D crack character- theory are the membership functions that define the
istics were fused based on the DS combination rules to boundaries between the fuzzy set variables. The basic
perform more accurate crack detection. However, the concept and procedure for fuzzy logic is introduced in
proposed approach is highly dependent on the selection the following section.
of the thresholds (e.g. the mass value for the 2D detec-
tion results). In Völker and Shokouhi,120 the DS theory Analysis procedure. Typically, each member in a fuzzy set
and the Hadamard product121 approach are employed consists of the variable value as well as the associated
to detect the honeycomb defects in concrete specimens. membership functions of the variable in one or more
Measurements from the impact-echo (IE), ultrasonic sets. A membership function provides a graphical rep-
pulse echo (US), and ground penetrating radar (GPR) resentation of the boundaries between each set. The
16 Structural Health Monitoring 00(0)

Table 6. Applications using fuzzy logic approaches in structure damage identification/quantification.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Jiang et al.123 Outputs of three fuzzy neural P Comprehensive training dataset


networks is required to find the optimal
weights for fusion.
Heideklang and Images from eddy current test, P Combination rules are simple
Shokouhi124 magnetic flux leakage, and and straightforward, could be
thermography inadequate for complex data.

Table 7. Applications using fuzzy logic approaches in structure system identification/response estimation.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Teng et al.128 Strain P Requires transient time history analysis on the finite element model.
measurements

values for the membership function of the variable lie neural networks (FNN) with three types of inputs: nor-
in the interval between 0 and 1, while 0 means the vari- malized damage signature index (NDSI), normalized
able is not a member of the set and 1 means the variable frequency change ratio (NFCR), and normalized mode
is a member of the set. Values between 0 and 1 indicate change ratio (NMCR). In the second stage, the results
the variable may be partially a member of one or more of FNN models were fed into a fusion center to create a
sets. For instance, the temperature values could be a new output for damage pattern recognition. This was
variable in environmental monitoring, while the sets achieved by computing the weighting of the three FNN
consist of cool, medium, warm, and hot. The member- models based on the results in the first stage, and the
ship function is then used to point out the current tem- new output was determined based on the weighted com-
perature belongs to which set. Originally, bell-shaped bination of different FNN models. It was concluded
curves have been employed to define membership func- that combining data fusion and FNN techniques would
tion. However, they are replaced with triangle or trape- perform better than using single FNN models alone. In
zoid curves in many applications for simplicity.12 A Heideklang and Shokouhi,124 the false alarm rate for
general analysis procedure is described as follows: the detection of near-surface crack was reduced based
on the fusion of images from eddy current test, mag-
(a) Define the variable of interest and its netic flux leakage, and thermography test. The image
characteristics. fusion was achieved through the fuzzy AND opera-
(b) Define the membership functions associated with tor,125,126 fuzzy OR operator, and the shift-invariant
the variable. wavelet transform.127 In Teng et al.,128 fuzzy sets and
(c) Define the production rules for aggregating the the DS theory were employed to perform stress identifi-
output from the membership functions and com- cation of two real-world structures. It was shown that
pute the fuzzy values of samples based on the pro- the identification results obtained from data fusion
duction rules. were more robust against the effect of noise than the
(d) Convert the fuzzy values to a fixed and discrete results without data fusion. In Dempsey and Sheng,129
output that can be employed to make inference the damage detection of a wind turbine gearbox was
about the characteristic of the sample. conducted through fuzzy inference. The vibration data
and the oil-debris data served as inputs for fuzzy infer-
ence, and the output damage level was determined using
Application. Jiang et al.123 combined fuzzy logic and the defined membership functions. Tables 6 and 7 sum-
neural network to detect damage in a seven-story shear- marize the above applications using fuzzy logic
beam type building model. The damage detection in the approaches in damage identification/quantification and
first stage was performed separately by the use of fuzzy system identification/response estimation, respectively.
Wu and Jahanshahi 17

The information of input data source, validation


approach, and limitations/concerns are presented.

Machine learning approaches


Among all the machine learning techniques, ANN and
SVM are the most popular techniques employed to per-
form data fusion in SHM in the past decades. Usually,
this type of fusion is achieved through feature-level
fusion. Features extracted from raw data or pre-
processed data are fed into machine learning algo-
rithms to perform regression or classification tasks.
Recently, deep learning–based approaches have gain
lots of attention due to their capability in automatically
learning meaningful feature representations from the
raw data. The input to the learning model does not Figure 4. An illustration of ANN with one hidden layer.
need to be the handcrafted features as commonly seen
in the conventional machine learning algorithms, and
the generalization of the model is achieved through the
use of large amount of data. The basic concepts of
ANN, SVM, and deep learning are introduced in the
following sections.

ANN. ANN is originally inspired by the human ner-


vous system that consists of billions of connected neu-
rons.130 It is a supervised learning method that is
widely used in various disciplines. A supervised learn-
ing means the algorithm is first provided with a train-
Figure 5. SVM illustration: the original data are projected into
ing dataset and the corresponding answers, then the a higher dimensional feature space by a kernel function where a
parameters in the network are updated (i.e. learned by hyperplane separates the data.
means of the training data). Typically, an ANN con-
sists of multiple layers including an input layer, several
hidden layers, and an output layer. In each layer, there (b) Design the network structure (i.e. the number of
are a number of neurons (i.e. nodes, connected with hidden layers and the number of nodes in each
every nodes in the previous layer). Each node in the layer).
hidden layer and the output layer can be formulated as (c) Train the network with sufficient number of train-
a weighted combination of all the nodes (including a ing samples. Usually, the parameters of the net-
bias term) in the previous layer, and the combination is work are updated by backpropagation algorithms.
further passed into a activation function to model the This process involves the optimization of a cost
nonlinearity. In ANN, all the layers are fully con- function.
nected. Figure 4 shows a typical configuration for (d) Evaluate the network using the test dataset that is
ANN with one hidden layer. The inputs xi are weighted not used during the training stage.
summed, and a bias term is added to obtain yj , and yj is
passed through the activation function f to get the out-
put of node j (i.e. zj ). The network is established by
providing a set of training samples, and its performance SVM. SVM is a supervised machine learning algorithm
is verified by new test data. A general analysis proce- as well. The fundamental concept of SVM is that it aims
dure for ANN is described as follows: to find the best hyperplane to separate the data.131,132
The original data points are projected by a kernel func-
(a) Determine the input of the network. Usually, the tion into a higher dimension feature space. An illustra-
inputs to the network are the feature representa- tive example is provided in Figure 5. In this space,
tions of the raw data. SVM tends to find the hyperplane that separates the
18 Structural Health Monitoring 00(0)

Figure 6. A sample for CNN configuration.

Figure 7. A schematic configuration for CNN.

data with the largest margin. The hyperplane is defined handcrafted features is subjective and may not contain
by a subset of data points called support vectors that lie enough information for the network to perform well.
on the margin. A general analysis procedure for SVM is For instance, the deep convolution neural network
described as follows: (CNN) leads to a great success in object classification
using large amount of image data.133,134 The spatial
(a) Determine the input of SVM. Usually, the inputs invariant features such as edges/contrast of an object
are the feature representations of the raw data. are automatically learned during the training stage.
(b) Train the SVM model with sufficient number of Figure 6 illustrates a CNN configuration with one con-
training samples. The separation plane is deter- volution layer followed by activation and one pooling
mined by the optimization of a cost function. layer. In the convolution layer, the kernel w1ij performs
(c) Evaluate the model using the test dataset that is convolution operation on the input data to generate
not used during the training stage. the first feature map, the kernel w2ij convolves with the
input data and generates second feature map. These
feature map will be transformed using activation such
Deep learning. Different from the conventional machine as RELU function,135 and the pooling layer extracts
learning algorithms, a deep learning algorithm uses a the dominant features from the feature map by apply-
deeper (larger) network to conduct the regression or ing operations such as retaining only the maximum
classification tasks. Although it requires intensive com- value in the window that slides over the feature map.
putation to establish the network from a large amount These features can be adopted to other classifiers to
of training data, the network is able to learn the high- perform learning tasks. Figure 7 shows a schematic
level representations from the raw data autonomously. configuration of CNN that consists of multiple convo-
This is particularly useful since the selection of lution and pooling layers. The convolution and pooling
Wu and Jahanshahi 19

layers extract features from the input data, and the fea-
tures are employed to determine whether there is a
damage present in the input data.
Another deep learning algorithm, named as the deep
auto-encoder (DAE),136,137 is categorized as an unsu-
pervised learning approach. The DAE algorithm is par-
ticularly useful when there is no enough training
samples to conduct supervised learning. During the
training of a DAE, the output of the network is identi-
cal with the input, and the network tries to minimize
the difference between the input and the output. Figure
8 shows a sample of DAE configuration. Usually, the
DAE consists of fully connected layers, and the output
of the DAE is set to be the same as the input.
Therefore, the DAE aims to reconstruct the input data, Figure 8. A sample for DAE configuration.
and the hidden nodes in the network become a high-
level representation of the input data. These representa-
tions can be further applied to other classifiers or clus- correlation of two images, pixel to pixel difference, and
tering methods for learning tasks. the normalized difference vegetation index (NDVI)
play an important role in the classification process. It is
claimed that the combination of SAR and optical data
Application. Liu et al.138 employ the wavelet packet
leads to better classification results than using SAR or
transformation (WPT) algorithm139,140 and ANN to
optical data alone. Longbotham et al.154 compare the
perform the detection of damage occurrence, location,
performances of different fusion algorithms, such as
and extent. Data fusion is achieved by integrating the
supervised and unsupervised methods, when dealing
first 16 components of wavelet packet relative energy
with the detection of flooded regions. Data sources in
(WPRE) computed from vibration signals. It is indi-
this application are multi-temporal and multi-modal
cated that using several ANN models performs better
images. It is shown that the supervised algorithms (e.g.
than using only one single ANN model. In Pei and col-
ANN or SVM) do not always outperform the unsuper-
leagues,141–146 the use of ANN to approximate non-
vised method (fuzzy C-mean clustering algorithm). The
linear functions in engineering mechanics is discussed.
The nonlinear restoring force is estimated through the performance of the learning algorithm is highly depen-
displacement and velocity inputs. In Pei et al.147 and dent on the input features, and the unsupervised
Pei and Masri,148 ANN is used to estimate the displace- method may be more favorable when there is no
ment and acceleration transmissibility functions as well enough training samples available in disaster scenarios.
as the restoring force of a viscous fluid damper. In In Khazaeli et al.,155 a framework of data-driven
Ghiasi et al.,149 the ANN and the least square support model for real-time SHM is proposed. Various machine
vector machine (LS-SVM)150 are employed to detect learning algorithms including Naı̈ve Bayes and
the damage location and severity. The performance of AdaBoost are employed to perform damage detection
these two algorithms is evaluated separately, and the and localization. Damage in structure is simulated by
selection of input features is optimized by using particle changes in stiffness and mass. Statistical features such
swarm harmony search (PSHS) algorithm.151,152 as mean, standard deviation, and skewness are
Similar to Liu et al.,138 data fusion is conducted by fus- extracted from the vibration time history signals. These
ing the first six components of WPRE of vibration sig- features are then selected and used to monitor the dam-
nals as well as the time-domain statistical features. It is age state of a multi-story shear building structure. In
shown that the learning model with input features opti- Khuc and Catbas,156 a new damage indicator is pro-
mized by PSHS algorithm achieves a higher accuracy, posed for bridge health monitoring based on image
and LS-SVM performs better than ANN. Stramondo data. The vehicle types and the associated moving
et al.153 use maximum-likelihood criterion to detect loads are first determined by the AdaBoost157 tech-
damage area after earthquake. Two types of remote nique and the Cascade classifier,158 then the unit influ-
sensing sensors are employed to rapidly construct the ence surface of displacements is constructed based on
damage map of urban areas after a seismic event hap- images. The experimental results from a four-span
pened. One is synthetic aperture radar (SAR) and the bridge demonstrate the effectiveness of the proposed
other one is optical satellite data. Features such as the method in the damage detection and localization of the
complex coherence of two images, the intensity bridge. In Sun et al.,159 neural network and fuzzy
20 Structural Health Monitoring 00(0)

inference are combined to evaluate the structural condi- displacement, velocity, and ground acceleration infor-
tion of a cable bridge. Heterogeneous data fusion is mation. Damage detection is achieved based on the dif-
performed by integrating the GPS displacement data ference between the prediction of ANN and the
and wind velocity data. The GPS and wind velocity measured response. Although the proposed approach
data are first fed into the neural network to generate detects the damage accurately, it remains challenging
features, and then these features serve as inputs to to correlate the quantified detection results with
fuzzy inference system to output the health index of the the visual observation in the specimen. In Mokhtari
structure. However, this method highly relies on the et al.,166 the performances of various supervised learn-
user-defined fuzzy rules. For instance, the output ing algorithms are compared for vision-based pavement
would be risky if the wind is identified as very strong crack detection. Among the decision tree, the k-nearest
and the GPS signal is identified as normal. In Mohanty neighbor (KNN), the ANN, and the adaptive neuro-
et al.,160 the corrosion cracking of a U-bend pipe is fuzzy inference system (ANFIS)167 algorithms, the
monitored by means of Bayesian Gaussian process and ANN and ANFIS not only provide superior perfor-
multiple sensor fusion. The Bayesian Gaussian process mance but also are more flexible in terms of the output
aims to find the posterior distribution of the test output format than the other two algorithms. Although the
given a random test input, available training samples, ANFIS method requires longer training time, it
and the predefined likelihood and noise functions. The achieves comparable performances with the ANN and
degree of degradation of the structure element is deter- has higher interpretability over ANN. In Vanraj and
mined through Bayesian Gaussian process, and the Pabla,168 the KNN classifier is used to perform fault
results of multiple ultrasonic sensors are fused based on diagnosis of a fixed-axis gearbox. Features extracted
principal component analysis to compute the final from the vibration and sound signals are fed into the
damage index of the element. In Osornio-Rios et al.,161 KNN classifier to discriminate the normal and faulty
the health state of a truss-type structure is investigated conditions. Results indicate that the aggregation of
by using multiple signal classification (MUSIC) ANN. vibration and sound features achieves better perfor-
The MUSIC algorithm produces a high-resolution mance than using the individual signal alone. In Völker
spectral estimation even when the noise content of the et al.,169 the detection of honeycomb in concrete speci-
signal is high. The vibration signals acquired from mul- mens is achieved through the machine learning–based
tiple sensors installed at different locations of the truss approach. Features computed from the measurements
structure are first processed by the MUSIC algorithm. of IE, US, and GPR sensors are aggregated through
The output of the MUSIC algorithm is then inserted the density-based clustering algorithm (DBSCAN).170
into a neural network to evaluate the damage location Compared to the individual evaluation methods, it is
and damage severity of the structure. In Kesavan and shown that data fusion enhances the detectability of
Kiremidjian,162 ambient vibration signals from unda- the honeycomb.
maged and damaged structures are fused to perform Recently, Cha et al.171 have employed CNN to
damage detection and quantification. Wavelet features detect cracks on concrete surfaces. The deep CNN
are extracted from the vibration signals, and the domi- achieves good performance without the use of hand-
nant features are determined through principal compo- crafted features. Results demonstrate that CNN-based
nent analysis. The k-means clustering algorithm approach is capable of dealing with real-world chal-
adopted in this study is shown to be able to character- lenges such as the varying illumination conditions.
ize the damage patterns correctly. In Safari et al.,163 the Abdeljaber et al.172 have developed a CNN-based dam-
state vector of a linear dynamic system with multiple age detection approach for a planar steel frame. In this
sensors is estimated through the combination of multi- study, the 1D acceleration signals collected from 30
ple Kalman filters and one ANN. Each sensor mea- joints of the planar frame are used to train 30 CNN
surement is fed into one individual Kalman filter, and networks. Each CNN model determines whether the
the estimates from each Kalman filter serve as inputs corresponding joint is damaged or not. It is shown that
to the ANN to determine the final state vector of the the CNN-based approach accurately detect and localize
system. Based on the simulation results, the proposed the damage, although the proposed method fails to
network yields better performance than the fusion localize the damage when the joints along the struc-
scheme proposed in Yan et al.164. In Derkevorkian et ture’s line of symmetry were damaged. In 2017, Chen
al.,165 the ANN and an ordinary differential equation and Jahanshahi21 have proposed a deep learning–based
(ODE) solver are combined to predict the relative dis- data fusion framework to detect and localize cracks on
placement and velocity time history of a 12-story con- the metallic surfaces of nuclear power plant reactors.
crete frame. The ANN is adopted to model the soil– Deep CNN is used to detect cracks in the video frames,
structure interaction by the aggregation of the and the Naı̈ve Bayes method is incorporated to account
Wu and Jahanshahi 21

for the inherent spatiotemporal coherence of cracks in integration when there are multiple decision makers.
the adjacent video frames. It is shown that the pro- The rules of voting being applied are task-oriented and
posed method outperforms other approaches with a depend on the nature of the output from individual
higher hit rate by aggregating the information from decision makers. For instance, majority voting is com-
multiple video frames. monly seen when there are multiple classifiers used to
In terms of the applications using DAE, a deep con- perform classification task. If the user prefers a model
volutional selective auto-encoder is proposed to con- to make conservative decisions, then the voting rule
duct the early detection of combustion instability.173 may be designed to be dominated by the most conser-
The hi-speed video frames of the combustor are fed vative decision made by all the decision-makers.12
into the network to determine whether the combustion Although voting-based data fusion approach is
is in the stable state or not. In Sarkar et al.,174 the straightforward in concept and easy to implement, it is
DAE is employed to characterize cracks in composite noted that the algorithm designer should be careful to
coupon specimens that frequently used in aircraft appli- avoid the presence of dictatorship during the fusion
cations. To this end, the DAE is trained by using only process. In 1950, KJ Arrow177 proposed a theorem to
the images of intact surface to reconstruct the input address the difficulty in aggregating individual’s prefer-
image. When an test image with crack is fed into the ence for social welfare system. This theorem is known
DAE, the regions with higher reconstruction error indi- as Arrow’s impossibility theorem. Without going into
cate the crack locations. In Reddy et al.,175 DAE is the mathematical details, an explanation quoted from
used to perform the fault diagnosis for the flight data. Hazelrigg178 is provided as follows: ‘‘Any constitution
Signals from 13 sensors (e.g. time, measured load, (people get together to come to a conclusion of any
ambient temperature) are concatenated to form the sort) that respect the four conditions of unanimity,
input to the DAE. The network is trained using the sig- transitivity, independence of irrelevant alternatives, and
nals from the nominal (healthy) state of the flight, and no dictator compromise a contradictory set for which
the fault diagnosis is achieved through the clustering of there is no mathematical resolution.’’ This implies that
the reconstruction errors of the 13 attributes. In Yan when the first three conditions (i.e. unanimity, transitiv-
and Yu,176 the detection of anomaly in gas turbine is ity, and independence of irrelevant alternatives) are sat-
achieved through the use of DAE. The DAE serves as isfied, there must be a dictator existing among all the
a feature extractor for the input signals of exhaust gas individuals. Consider the following example where
temperature, and the features are fed into an ANN for there are three individuals given their preference over
anomaly detection. Results demonstrate that using the three events A, B, and C
DAE learned features achieves better performance than
the handcrafted features. Tables 8 and 9 summarize the 1. A . B . C
above applications using machine learning approaches 2. B . C . A
in damage identification/quantification and system 3. C . A . B
identification/response estimation, respectively. The
information of input data source, validation approach, The objective is to find the best social choice of the
and limitations/concerns are presented. event that satisfies all the above three preferences.
According to pairwise comparison, the social choice
Weighted combination and voting would prefer A over B (from preferences of voters 1
and 3), prefer B over C (from preferences of voters 1
A straightforward and simple approach for data fusion and 2), and prefer C over A (from preferences of voters
is the weighted combination of data. This is often not 2 and 3). In this situation, the majority vote fails to give
applicable to the original data, but is achieved in the best choice to satisfy the preference of all individu-
feature-level or decision-level data fusion. The simplest als. In other words, if the majority vote choose event B
method is averaging all the data with the same physical as the best choice, it is basically ignoring voter 3 and
quantity from all the sources, which means each data hence result in dictatorship. Back to the discussion of
source has equal weight contribution. However, some- data fusion, suppose there are three models that try to
times there are several health states inferred from mul- infer the potential damage state of a structure. In this
tiple sensors with different resolutions and precisions. case, the fusion result could be unreliable if the algo-
The decision made from the sensor with less precision
rithm designer attempts to aggregate the decisions from
and confidence might be assigned smaller weight con-
the three models through majority voting scheme.
tribution prior to the fusion. For classification tasks,
the selection of appropriate thresholds is needed to
assign the predicted damage pattern. Another method Application. Lu and Michaels179 have applied feature
is voting scheme that could be applied for data and sensor fusion to detect damage in aluminum
22 Structural Health Monitoring 00(0)

Table 8. Applications using machine learning approaches in structure damage identification/quantification.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Liu et al.138 Acc. measurements P Damage assessment is assumed


to be characterized by wavelet
energies.
Ghiasi et al.149 Acc. measurements P Damage assessment is assumed
to be characterized by wavelet
energies and time-domain
statistical features.
Khazaeli et al.155 Acc. and disp. measurements P Damage is assumed to be
characterized by time-domain
statistical features.
Khuc and Catbas156 Images P Performance may depend on the
camera resolution.
Sun et al.159 GPS and wind vel. P Performance depends on user-
defined fuzzy rules.
Mohanty et al.160 Ultrasonic measurements P Damage index is determined by
the first principal component
only.
Osornio-Rios et al.161 Acc. measurements P Damage index is assumed to be
characterized by the amplitudes
of natural frequencies.
Kesavan and Kiremidjian162 Acc. measurements P The proposed approach
requires a database of baseline
measurements for optimal signal
selection.
Mokhtari et al.166 Images P ANFIS algorithm is more
interpretable but requires
longer training time than ANN.
Völker et al.169 IE, US, and GPR P Requires contact sensing.
measurements
Cha et al.171 Images P Requires sufficient training data
and computing capability.
Abdeljaber et al.172 Acc. measurements P Damage localization may fail
when damage is presented in
symmetric joints.
Chen and Jahanshahi21 Video frames P Requires sufficient training data
and computing capability.
Sarkar et al.174 Images P Detection performance depends
on the selection of threshold for
reconstruction error.
Reddy et al.175 13 time series data P Requires sufficient dataset for
(e.g. time, temperature, loads) undamaged structure.

GPS: global positioning system; vel.: velocity; acc.: acceleration; ANFIS: adaptive neuro-fuzzy inference system; ANN: artificial neural network; IE:
impact-echo; US: ultrasonic pulse echo; GPR: ground penetration radar.

specimens. To this end, different features are computed employed fusion scheme if there is no damage shown in
from the diffuse ultrasonic signals, and the overall the individual images. In Guan et al.,181 the maximum
detection performance is improved by adopting voting and the average data fusion strategy are used to post-
fusion schemes on multi-sensors. In Michaels and process the phased-array ultrasonic inspection data.
Michaels,180 images generated from guided wave signals Results indicate that the average fusion scheme pro-
are used to localize the damage in plates. Multiple duces a clearer visualization for the ultrasonic data. Bai
images are obtained by passing the wave signals et. al.182 propose an approach for detecting crack loca-
through filters with various frequency bands, and the tions in beams by obtaining the overall Katz’s fractal
images are fused at pixel-level. Although the results dimension (KFD) curve.183 It is demonstrated that the
indicate the fused image achieves better localization overall KFD curve, that is computed from averaging all
performance than the individual images, the fused the single KFD curves, leads to a more reliable results
image may fail to detect the damage under the for locating cracks compared to a single KFD curve. In
Wu and Jahanshahi 23

Table 9. Applications using machine learning approaches in structure system identification/response estimation.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Pei and colleagues141–145 Disp. and vel. measurements P Only applicable for simple ANN
configuration.
Pei et al.146 Disp. and vel. measurements P Only applicable for simple ANN
configuration.
Pei et al.147 Disp. and vel. measurements P The proposed approach is based
on heuristic methods rather
than rigorous mathematical
formulation.
Pei and Masri148 Disp. and vel. measurements P P The proposed prototype is only
adequate for specific nonlinear
function mapping.
Derkevorkian et al.165 Disp., vel., and ground acc. P Hard to correlate between the
estimated response and the
observed damage.

Disp.: displacement; vel.: velocity; ANN: artificial neural network; acc.: acceleration.

Heideklang and Shokouhi,184 the cracks on a metallic level) are investigated to find the best separability
specimen are detected through the aggregation of between the undamaged and damaged regions of a
images. The images taken from the eddy current sensor, fuselage rib. Fisher’s188 discriminant ratio and simple
the giant magnetoresistance (GMR) sensor, and the voting schemes are utilized in this study. It is found that
thermography sensor are fused at pixel-level to deter- the fusion technique with the best performance varies
mine the presence of a crack. The performance of vari- with different specimens and features selected. In
ous fusion schemes including the wavelet coefficient Salahshoor et al.,189 the fault detection and the diagno-
fusion, singular value decomposition fusion, and the sis of an industrial steam turbine are achieved based on
simple average method is compared and discussed. It is SVM and an ANFIS (i.e. the combination of ANN and
shown that the simple average method achieves the best fuzzy logic).190 The results of these two systems are
performance among all the fusion schemes being con- integrated by ordered weighted averaging operator. It
sidered. In Zhu et al.,185 the damage diagnosis for a is demonstrated that the performance of the fusion
long-span suspension bridge is conducted through the scheme is better than each individual system. In Chen
fusion of damage indices from changes in modal fre- et al.,191 the information from an RGB sensor and a
quency, eigenvalues of principal component, wavelet- depth sensor are aggregated to measure the 3D displa-
packet energy, and changes in the covariance of covar- cement time history of a moving target. The coordi-
iance matrix of accelerations. The fusion techniques nates of the desirable target points are first extracted
including the weighted average method, Bayesian the- from the color image based on the corner detection
ory, and DS theory are used and their performances are algorithm developed by Geiger et al.,192 and then they
investigated. It is claimed that the weighted average are mapped to the depth image to acquire the depth
method and the DS theory are not sensitive to the con- measurement. Once the depth and the pixel coordinates
flicting evidences and hence achieve more robust dam- are obtained, the 3D world coordinates of the target
age indicators. In Haynes and Todd,186 the damage points relative to the sensor can be computed based on
localization for complex structures is enhanced through the pinhole camera model.193 In this study, the weights
sensor fusion and statistical modeling. It is demon- for the information from the color image and the depth
strated that using maximum-likelihood estimate to image are identical. By comparing with the measure-
localize the damage may fail due to the sparsity of sen- ment of an LVDT sensor from the shaking table test, it
sor instrumentation in large structures. The localization is shown that the proposed method is able to measure
is improved by means of the Neyman–Pearson criter- the displacements accurately. Also, the effects of vari-
ion, and the contribution of each sensor pair is ous test conditions (i.e. amplitude, frequency, sampling
weighted according to the uncertainty (described by a rate, spatial distortion, and the relative motion between
Rayleigh-distributed random process) in the estimate of the target and the sensor) are comprehensively studied.
localization. In Chen et al.,194 a cost-effective autonomous data
In Haynes et al.,187 the performance of fusion tech- acquisition system is developed to detect, localize, and
niques at different levels (i.e. data, feature, and decision quantify the defects (e.g. potholes) on the road surfaces.
24 Structural Health Monitoring 00(0)

This system consists of four RGB-D cameras, a GPS the dense displacements of the bridge are computed
sensor, and two triaxial accelerometers. The RGB-D using mode shape approximation and modal expan-
cameras aim to capture features (e.g. edges, corners, or sion. Numerical and experimental results indicate a
texture) for recognizing the defects as well as compute good agreement between the estimated dense displace-
the 3D world coordinates of the target points. The GPS ments and the measured displacements. In Heideklang
sensor is used to record the location of defects and the and Shokouhi,203 a kernel density estimation–based
speed of the vehicle, while the accelerometers are approach204 is proposed to perform crack localization
employed for the alignment of the sensor platform. for ferromagnetic metals. The joint density of crack
Furthermore, the defect detection and quantification location is obtained by aggregating the partial density
method is adopted from the approach proposed in that includes only the detected crack location from
Jahanshahi et al.2 Based on several field tests, it is three individual test (i.e. eddy current test, magnetic
pointed out that the developed system is able to moni- flux leakage, and thermography test). Results have
tor the pavement surfaces more frequently. However, shown that the proposed fusion approach substantially
factors like sunlight interference, motion blur, and roll- reduces false alarms under various test conditions. In
ing shutter distortion may affect the performance of the Park et al.,205 the estimation of displacements is
proposed system. In Park et al.,195 the acceleration- enhanced by aggregating the displacements obtained
based method developed by Lee et al.196 and the strain- from time-synchronized image data and acceleration
based method proposed by Shin et al.197 are integrated measurements. Complimentary filters are used to
to estimate the displacement responses of beam-like achieve a better frequency response for the combined
structures. According to the results of the numerical displacements. It is demonstrated that the proposed
simulation and a full-scale field test, the proposed approach is able to deal with various noise level and
method is able to estimate the nonzero mean displace- signals with different frequency characteristics. Tables
ment and reduce the high-frequency noise. Moreover, 10 and 11 summarize the above applications using
there is no calibration technique required for the pro- weighting approaches in damage identification/quan-
posed method since the scaling factor for the strain– tification and system identification/response estima-
displacement relationship can be derived directly from tion, respectively. The information of input data
the power spectral densities of the estimated displace- source, validation approach, and limitations/concerns
ments from the acceleration and strain at the first mode are presented.
frequency. In Shin et al.,197 the information from the
acceleration and strain sensor are equally weighted. In
Park et al.,198 the data fusion method developed in An illustrative example for data fusion in structural
Park et al.195 is adopted in a wireless sensing network
response estimation and damage detection
to monitor the displacement of a bridge. According to
the laboratory and field tests, the proposed sensing sys- Consider a linear SDOF system
tem enables the time synchronization of the strain and
acceleration measurements and further enhances the m€x(t) + c_x(t) + kx(t) = f (t) ð12Þ
sensing precision of strain. In Sim,199 the flexibility
matrix of a simply supported beam is estimated where the mass m = 1, the damping c = 1:5, and the stiff-
through the aggregation of the acceleration and angular ness k = 1. f (t) is the ground excitation. x(t), x_ (t), and
velocity measurements. The proposed formulation is an €x(t) are the displacement, velocity, and acceleration
extension of the single measurement-based method response, respectively.
developed by Bernal.200 Results from the numerical In this example, Kalman filter is employed to esti-
simulations indicate that the proposed approach esti- mate the displacement response of the system through
mates the flexibility matrix within a reasonable error the noise-contaminated displacement measurements
tolerance. In Kim et al.,201 model updating of a simply and known ground excitation. Following the procedure
supported beam is conducted through the incorpora- described in section ‘‘Kalman filter in structural
tion of acceleration and angular velocity measurements. dynamics’’ with a discrete time step Dt = 0:005 (s), a
The identifications of elastic modulus and boundary state–space representation can be formulated as follows
conditions of the beam demonstrate the superior per-
formance of the proposed data fusion approach than Xk + 1 = AXk + Bfk ð13Þ
using acceleration alone. In Park et al.,202 the safety    
evaluation of railway bridges is achieved through the 1 0:005 1:253105
where A = , B= . The
aggregation of acceleration and strain measurements. 0:005 0:9925 0:005
The displacements at every sensor location are first esti- process noise and the measurement noise are assumed
mated using acceleration and strain information. Then, to be zero mean Gaussian white noise with covariance
Wu and Jahanshahi 25

Table 10. Applications using weighting approaches in structure damage identification/quantification.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Lu and Michaels179 Ultrasonic measurements P Additional considerations may


be required if the wetting area
of aluminum specimens is large.
Michaels and Michaels180 Images P Fusion scheme may fail if
individual images fail to detect
damage.
Bai et al.182 Operating deflection shapes P P Equal weighting is assigned to
each data due to simple
averaging.
Heideklang and Shokouhi184 Images P Equal weighting is assigned to
each data due to simple
averaging.
Zhu et al.185 Mode frequency and acc. P No cases of strong motion, only
measurements ambient vibration is considered.
Haynes and Todd186 Ultrasonic measurements P Damage pattern is assumed to
be point-like scatters.
Haynes et al.187 Ultrasonic measurements P The most appropriate statistical
feature may depend on problem
of interest.
Chen et al.194 Images, GPS and acc. P Performance may depend on
measurements environmental factors, for
example, illumination condition.
Heideklang and Shokouhi203 Images from eddy current test, P Performance depends on fusion
magnetic flux leakage, and rules.
thermography test

acc.: acceleration; GPS: global positioning system.

Table 11. Applications using weighting approaches in structure system identification/response estimation.

Literature Data source Numerically Experimentally Limitations/concerns


validated validated

Park et al.195 Acc. and strain measurements P P Heuristic approach is employed


to investigate the influence of
sensor quantity.
Park et al.198 Acc. and strain measurements P Displacement estimation is
achieved through contact
sensing.
Sim199 Acc. and angular vel. P Measurement noise is not
measurements considered.
Kim et al.201 Acc. and angular vel. P P Only concentrated point
measurements excitation is considered.
Park et al.202 Acc. and strain measurements P P Large estimation errors may
occur for beams with fixed-end
boundary condition.
Park et al.205 Image and acc. measurements P P Performance depends on the
design of complementary filters.

acc.: acceleration; vel.: velocity.

0.01. The input excitation is zero mean Gaussian white the estimated response is 0.0031, which indicates a
noise with variance 1. decent performance achieved by Kalman filter. Figure
Figure 9(a) shows the Kalman filter estimation ver- 9(b) depicts the estimated displacement response using
sus the true displacement response. The root-mean- k = 1 and k = 0:8 to simulate the undamaged and dam-
square-error (RMSE) between the true response and aged structure, respectively. The maximum absolute
26 Structural Health Monitoring 00(0)

Figure 9. An illustrative example for data fusion: (a) displacement estimation using Kalman filter and (b) damage detection using the
estimated response.

displacement of the damaged structure is 0.1360, which


is 8% larger than the undamaged structure (0:1259).
This demonstrates that the damage detection can be
achieved through the statistical measures of the discre-
pancy between the structural behavior of the damaged
and undamaged structure. Moreover, features extracted
from time-domain and frequency-domain responses can
be further fed into clustering or machine learning algo-
rithms to detect the existence of damage, providing that
sufficient training data are generated either numerically
or experimentally. As an example for damage detection,
the Kalman filter simulation is repeated 1000 times for
both the k = 1 and k = 0:8 cases. Damage detection is
achieved through the fusion of frequency-domain fea-
tures computed from the 2000 estimated displacement
responses of the undamaged and damaged structure.
The estimated response from Kalman filter is decom- Figure 10. Damage detection using SVM with frequency-
posed by the wavelet analysis with the Haar wavelet at domain features: the ROC curve of the validation data (AUC:
level 4, and the 16 wavelet energy components are fed 0.9704).
into an SVM classifier with linear kernel for damage
detection. The training data are selected based on ran-
dom permutation with 70% out of the 2000 samples, space representation is unknown, data-driven-based
and the rest of the samples (not being seen by the classi- approaches could be more favorable than the model-
fier during the training process) are used for validation. based approaches. In addition, when comparing the
Figure 10 shows the receiver operating characteristic performance of different approaches/algorithms, one
(ROC) curve of the SVM classifier tested on the valida- should consider the following aspects but not limited
tion data. The area under curve (AUC) is 0.9704, which to:
indicates a satisfactory performance in detecting the
damage. According to the ROC curve, a detection accu-  Robustness against noise;
racy of 92:43% is achieved if the user-defined tolerance  The underlying assumptions in each approach;
of false positive rate is 10%.  Algorithm’s sensitivity against any modification in
Note that in this particular example, state estimation the assumption;
approach is applicable since the state–space relation-  Computation efficiency, which points to the applic-
ship is well understood. For cases in which the state– ability in real-time practice;
Wu and Jahanshahi 27

 Whether the hyper-parameters of the algorithm are characteristics of the system. Additional test dataset is
tuned properly; also necessary to avoid overfitting in the training data-
 Any fundamental limitation of the selected set. Finally, even though the weighted combination and
algorithm; voting approaches are easy to be implemented and fast
 Algorithm’s scalability toward real-world problems. for real-time computation, how to assign appropriate
weights to each sensor or decision maker is the key
issue of these approaches. So far, most of the applica-
Challenges in data fusion and concluding tions use constant weights to aggregate information
remarks from sources. But a dynamic weight calibration scheme
As more inexpensive sensors and data acquisition sys- is necessary when dealing with the long-term health
tems come into practice, the availability of data and monitoring. Also, the selection of appropriate thresh-
various categories of data source bring many possibili- olds for making inference from the fused result is essen-
ties to SHM. Although there have been several research tial. A fixed threshold may become unreliable when the
efforts in data science, how to aggregate useful infor- input data vary a lot.
mation from data and make robust decisions still With the issues and challenges in data fusion tech-
remain challenging problems. There is no universal niques, the road map for future research is discussed as
data fusion technique that works fine in all the applica- follows. Within the recent few years, deep learning–
tions across disciplines.206 The performance of data based approaches have been quite popular across vari-
fusion approaches varies with different applications ous research fields due to the enhancement of comput-
and depends on the format and quality of the data as ing capability. However, for the deep learning–based
well as the objective of fusion. During the data acquisi- approaches to work well, it is essential to have suffi-
tion stage, the noise content in the signal may be differ- cient training data. This is sometimes intractable for
ent even if all the sensors come from the same real-world problems. For instance, the damage scenar-
manufacturer and are measuring the same physical ios for a 30-story building are hard to define, not to
quantities. Possible conflicts between types of data, the mention the difficulties in collecting the data for all the
incompleteness of data, and the lack of conciseness are damage scenarios from the real structure. Therefore,
the common problems when the data source is hetero- the learning task should be conducted in an unsuper-
geneous. Misleading information may also present in vised manner to avoid the need of large amount of
the recorded data if the sensors suffer from inappropri- labeled data. This means that the learning model
ate usage or need to be maintained. These are all should be established using only the data from the
related issues that exist in the nature of data. nominal (healthy) state of the system, and later on the
In terms of the challenges in data fusion techniques, model can be used to determine the damage scenario
the Bayesian probability framework and the state esti- based on the reconstruction performance. A typical
mation method are capable of accounting for the example is the use of DAE to diagnose the fault in
uncertainties of the parameters during the analysis pro- flight data.175 In terms of the network configuration
cess. However, these approaches often require subjec- for deep learning, the generative models like Restricted
tive decisions about the prior probabilities and model Boltzmann Machines (RBM)207 and deep belief net-
selections. The performance may vary with the selec- work208 may outperform the discriminative models if
tion of models and hence a sensitivity analysis is crucial the learning process is performed in unsupervised man-
for examination of robustness. Similar to Bayesian ner.209 The long short term memory (LSTM) unit210,211
approach, the DS theory needs a subjective choice for could also be adopted in the network and may have the
the frame of discernment. Poor selection of frame of potential to enhance the performance due to its ability
discernment and inappropriate combination rules may in learning long-term dependencies. More recently, a
lead to unreasonable fusion results. Although Fuzzy new configuration named Capsule network is proposed
logic is suitable for addressing vagueness and impreci- to enhance the performance in object recognition.212
sion between each judgment, it requires well-defined Unlike the pooling layer in CNN, which only extracts
membership functions and production rules to achieve the dominant feature based on the maximum opera-
acceptable inference. The lack of mathematical founda- tion, the routing between capsules extracts the features
tion could lead to pros and cons when the domain of more effectively. The dynamic routing mechanism
application changes. In regard to machine learning between the lower and the higher level of capsules
approaches, the performance of the learning model is allows the learning model to recognize multiple objects
highly related to the training data being used. The even if the objects overlap with each other. This could
number of training samples needs to be sufficient, and potentially enhance the performance of vision-based
the training data should be able to capture the general data fusion approaches.
28 Structural Health Monitoring 00(0)

Although machine learning–based approaches often the techniques presented in this article. For instance,
benefit from data augmentation, one should be aware Game theory could be potentially applicable for data
of the ‘‘curse of dimensionality’’213 that may occur fusion in SHM. In Bruce and Reynolds,218 Game the-
when the dimension of the data, that is, the number of ory is adopted to aggregate the hyperspectral images
variables, increases rapidly. As the number of variables for groundcover classification in precision agriculture.
increases, the complexity in algorithm grows exponen- It is expected to see more extensive research in adopt-
tially, which not only leads to the difficulty in finding ing data fusion techniques in SHM.
the optimal solution but also requires more training
data to achieve convergence. Moreover, as the dimen-
sion increases, some distance metrics such as Euclidean Summary
measure become uniform among the data samples, This article presents a comprehensive review of state-
resulting the poor performance of the similarity-based of-the-art data fusion approaches used in SHM. The
algorithms such as clustering or nearest neighbor classi- basic concepts of the current popular data fusion tech-
fication. One possible solution to this issue is to search niques are introduced, and the applications in each
the optimal solution in a small subset of the full space. fusion technique are reviewed. The recent advances in
To this end, dimensionality reduction algorithms such sensor and information technologies have led to the
as principal component analysis214 or singular value adoption of data fusion approaches in various research
decomposition215 may be helpful, but one should be fields. However, there is no ubiquitous data fusion
careful to select the subspace appropriately to consider technique that works well in all disciplines. How to
the situation where different clusters may lie in differ- aggregate information efficiently and make robust deci-
ent subspaces.216 sions remain challenging tasks in data fusion. As a
Another related issue that needs further investigation result, this article discusses about the advantages and
is the quality of data. Whether the data quality is good disadvantages of different fusion techniques used in
or not often requires subjective judgment and is task- SHM. Furthermore, the challenges of data fusion are
oriented.217 The quality metrics could be in terms of presented that can be used as a road map for future
sensor resolution, signal-to-noise ratio, correctness, research in this area.
completeness, accessibility, robustness, or timeliness.
For instance, an image that is perfect for one specific
Declaration of conflicting interests
task may be inappropriate for another. Most data
fusion applications in the past do not consider the effect The author(s) declared no potential conflicts of interest with
respect to the research, authorship, and/or publication of this
of data quality, which could be problematic if the input
article.
data contain wrong information. An example for con-
sidering the data quality is,81 where the unidentifiability
of the parameters in non-smooth systems is addressed Funding
to solve the divergence issue in parameter estimates. The author(s) received no financial support for the research,
Also, the early detection of sensor malfunction needs to authorship, and/or publication of this article.
be incorporated into the data fusion framework and
should be prior to any data processing. Although the ORCID iDs
completeness in data is usually beneficial for decision
Rih-Teng Wu https://orcid.org/0000-0002-9632-7109
making, this is not always true in certain situations. For
Mohammad Reza Jahanshahi https://orcid.org/0000-0001-
instance, in disaster response system, how to use partial 6583-3087
information to make decisions is crucial since the time
constraint is critical for saving lives. In this case, the
data fusion algorithms that are capable of operating in References
real-time are more favored, and the data transmission 1. Farrar CR and Worden K. An introduction to structural
between the sensor and the base station needs to be effi- health monitoring. Philos T R Soc A 2007; 365: 303–315.
cient. To this end, a reliable data reduction scheme are 2. Jahanshahi MR, Jazizadeh F, Masri SF, et al. Unsuper-
necessary. This may be achieved by the DAE estab- vised approach for autonomous pavement-defect detec-
lished through off-line operations, and then the DAE tion and quantification using an inexpensive depth
sensor. J Comput Civ Eng 2013; 27(6): 743–754.
automatically extracts the dominant feature representa-
3. Mizuno Y, Abe M, Fujino Y, et al. Development of inter-
tions in lower dimension for data transmission. active support system for visual inspection of bridges.
In summary, there is room for improvement of data Proc SPIE 2001; 4337: 155–166.
fusion techniques as well as the unresolved problems. 4. Hall DL and Llinas J. An introduction to multisensor
Note that the data fusion approaches are not limited to data fusion. Proc IEEE 1997; 85: 6–23.
Wu and Jahanshahi 29

5. Waltz E and Llinas J. Multisensor data fusion (vol. 685). 24. Yuen KV. Bayesian methods for structural dynamics and
Boston: Artech House, 1990. civil engineering. Singapore: John Wiley & Sons, 2010.
6. Liggins ME, Hall DL and Llinas J. Multisensor data 25. Jalayer F, Risi RD and Manfredi G. Bayesian cloud anal-
fusion: theory and practice. 2nd ed. Boca Raton, FL: ysis: efficient structural fragility assessment using linear
CRC Press, 2009. regression. B Earthq Eng 2015; 13: 1183–1203.
7. Faouzi NEE, Leung H and Kurian A. Data fusion in 26. Beck JL and Katafygiotis LS. Updating models and their
intelligent transportation systems: progress and chal- uncertainties. I: Bayesian statistical framework. J Eng
lenges: a survey. Inform Fusion 2011; 12: 4–10. Mech 1998; 124(4): 455–461.
8. Hall DL and Llinas J. Handbook of multisensor data 27. Lindley DV. Introduction to probability and statistics from
fusion. Boca Raton, FL: CRC Press, LLC, 2001. a Bayesian viewpoint. Cambridge: Cambridge University
9. Lopez I and Sarigul-Klijn N. A review of uncertainty in Press, 1965.
flight vehicle structural damage monitoring, diagnosis 28. Beck JL and Au SK. Bayesian updating of structural
and control: challenges and opportunities. Prog Aerosp models and reliability using Markov chain Monte Carlo
Sci 2010; 46: 247–273. simulation. J Eng Mech 2002; 128(4): 380–391.
10. Llinas J and Hall DL. An introduction to multi-sensor 29. Ching J, Muto M and Beck JL. Bayesian linear structural
data fusion. In: Proceedings of the international sympo- model updating using Gibbs sampler with modal data.
sium on circuits and systems, Monterey, CA, 31 May–3 In: Proceedings of the 9th international conference on
June 1998. New York: IEEE. structural safety and reliability, Rome (Italy), 19–23 June
11. Hu J. Data fusion: a first step in decision informatics. Ann 2005, pp. 2609–2616. Rotterdam: Millpress.
Arbor: ProQuest/UMI Dissertation Publishing, 2011. 30. Ching J, Muto M and Beck JL. Structural model updat-
12. Klein LA. Sensor and data fusion: a tool for information ing and health monitoring with incomplete modal data
assessment and decision making. 2nd ed. Bellingham, using Gibbs sampler. Comput-Aided Civ Inf 2006; 21(4):
Washington: SPIE Press, 2012. 242–257.
13. Chatzis MN, Chatzi EN and Smyth AW. On the obser- 31. Beck JL. Bayesian system identification based on prob-
vability and identifiability of nonlinear structural and ability logic. Struct Control Hlth 2010; 17: 825–847.
mechanical systems. Struct Control Hlth 2015; 22(3): 32. Vanik MW, Beck JL and Au SK. Bayesian probabilistic
574–593. approach to structural health monitoring. J Eng Mech
14. Kogan MG, Kim WY, Bock Y, et al. Load response on 2000; 126(7): 738–745.
a large suspension bridge during the NYC marathon 33. Beck JL, Au SK and Vanik MW. Monitoring structural
revealed by GPS and accelerometers. Seismol Res Lett health using a probabilistic measure. Comput-Aided Civ
2008; 79(1): 12–19. Inf 2001; 16: 1–11.
15. Chatzi EN and Smyth AW. The unscented Kalman filter 34. Papadimitriou C, Beck JL and Katafygiotis LS. Updat-
and particle filter methods for nonlinear structural system ing robust reliability using structural test data. Probabilist
identification with non-collocated heterogeneous sensing. Eng Mech 2001; 16: 103–113.
Struct Control Hlth 2009; 16: 99–123. 35. Yuen KV, Au SK and Beck JL. Two-stage structural
16. Freedman DD and Smyton PA. Overview of data fusion health monitoring approach for phase I benchmark stud-
activities. In: Proceedings of the American control confer- ies. J Eng Mech 2004; 130(1): 16–33.
ence, San Francisco, CA, 2–4 June 1993. New York: 36. Yuen KV, Beck JL and Katafygiotis LS. Efficient model
IEEE. updating and health monitoring methodology using
17. Gonzalez RC and Woods RE. Digital image processing. incomplete modal data without mode matching. Struct
Reading, MA: Addison-Wesley, 1993, p. 302. Control Hlth 2006; 13: 91–107.
18. Brown LG. A survey of image registration techniques. 37. Muto M and Beck JL. Bayesian updating and model class
ACM Comput Surv 1992; 24(4): 325–376. selection for hysteretic structural models using stochastic
19. Liu Z, Forsyth DS, Komorowski JP, et al. Survey: state simulation. J Vib Control 2006; 14(1–2): 7–34.
of the art in NDE data fusion techniques. Instrumentation 38. Ntotsios E, Papadimitriou C, Panetsos P, et al. Bridge
and Measurement 2007; 56(6): 2435–2451. health monitoring system based on vibration measure-
20. Liggins ME, Hall DL and Llinas J. Handbook of multi- ments. B Earthq Eng 2009; 7: 469–483.
sensor data fusion: theory and practice. Boca Raton, FL: 39. Papadimitriou C and Ntotsios E. Bayesian methodology
CRC Press, 2017. for structural damage identification and reliability assess-
21. Chen FC and Jahanshahi MR. NB-CNN: deep learning- ment. In: Proceedings of the 3rd international operational
based crack detection using convolutional neural network modal analysis conference, Portonovo, Ancona, Italy, 4–6
and Naı̈ve Bayes data fusion. IEEE T Ind Electron 2018; May 2009. Starrylink Editrice.
65: 4392–4400. 40. Lin TK, Yu LC, Ku CH, et al. Implementation of a bio-
22. Bernardo J and Smith A. Bayesian theory. Chichester: inspired two-mode structural health monitoring system.
Wiley, 1994. Smart Struct Syst 2011; 8(1): 119–137.
23. Ghosh JK, Delampady M and Samanta T. An introduc- 41. Zárate BA, Caicedo JM, Yu J, et al. Bayesian model
tion to Bayesian analysis theory and methods. New York: updating and prognosis of fatigue crack growth. Eng
Springer, 2006. Struct 2012; 45: 53–61.
30 Structural Health Monitoring 00(0)

42. Gobbato M, Conte JP, Kosmatka JB, et al. A reliability- 58. Au SK and Ni YC. Fast Bayesian modal identification of
based framework for fatigue damage prognosis of com- structures using known single-input forced vibration data.
posite aircraft structures. Probabilist Eng Mech 2012; 29: Struct Control Hlth 2014; 21: 381–402.
176–188. 59. Yan WJ and Katafygiotis LS. Application of transmissi-
43. Gobbato M, Kosmatka JB and Conte JP. A recursive bility matrix and random matrix to Bayesian system iden-
Bayesian approach for fatigue damage prognosis: an tification with response measurements only. Smart Mater
experimental validation at the reliability component level. Struct 2016; 25: 105017.
Mech Syst Signal Pr 2014; 45: 448–467. 60. Chen FC, Jahanshahi MR and Wu RT. A texture-based
44. Jaramilloa VH, Ottewilla JR, Dudek R, et al. Condition video processing methodology using Bayesian data fusion
monitoring of distributed systems using two-stage Baye- for autonomous crack detection on metallic surfaces.
sian inference data fusion. Mech Syst Signal Pr 2016; 87: Comput-Aided Civ Inf 2017; 32: 271–287.
91–110. 61. Ojala T, Pietikäinen M and Mäenpää T. Multiresolution
45. Rabiei M and Modarres M. A recursive Bayesian frame- gray-scale and rotation invariant texture classification
work for structural health management using online mon- with local binary patterns. IEEE T Pattern Anal 2002;
itoring and periodic inspections. Reliab Eng Syst Safe 24(7): 971–987.
2013; 112: 154–164. 62. Cortes C and Vapnik V. Support-vector networks. Mach
46. Zonta D, Bruschetta F, Zandonini R, et al. Sensor fusion Learn 1995; 20(3): 273–297.
on structural monitoring data analysis: application to a 63. Huang Y, Beck JL and Li H. Bayesian system identifica-
cable-stayed bridge. Key Eng Mater 2013; 569–570: tion based on hierarchical sparse Bayesian learning and
812–819. Gibbs sampling with application to structural damage
47. Lam HF, Peng HY and Au SK. Development of a practi- assessment. Comput Method Appl M 2017; 318: 382–411.
cal algorithm for Bayesian model updating of a coupled 64. Geman S and Geman D. Stochastic relaxation, Gibbs dis-
slab system utilizing field test data. Eng Struct 2014; 79: tribution and the Bayesian restoration of images. IEEE T
182–194. Pattern Anal 1984; 6: 721–741.
48. Ramos LF, Miranda T, Mishra M, et al. A Bayesian 65. Minson SE, Wu S, Beck JL, et al. Combining multiple
approach for NDT data fusion: the Saint Torcato church earthquake models in real time for earthquake early
case study. Eng Struct 2015; 84: 120–129. warning. B Seismol Soc Am 2017; 107(4): 1868–1882.
49. Yan G and Tang J. A Bayesian approach for localization 66. Gross XE. NDT data fusion. Butterworth-Heinemann:
of acoustic emission source in plate-like structures. Math Oxford, 1997.
Probl Eng 2015; 2015: 247839. 67. Brierley N, Tippetts T and Cawley P. Data fusion for
50. Wang W, Joshi A, Tirpankar N, et al. Bayesian computa- automated non-destructive inspection. P Roy Soc Lond
tional sensor networks: small-scale structural health mon- A: Math 2014; 470(2167): 20140167.
itoring. Procedia Comput Sci 2015; 51: 2603–2612. 68. BSSC. NEHRP guidelines for the seismic rehabilitation of
51. Sun H and Betti R. A hybrid optimization algorithm buildings (FEMA-273). Washington, DC: FEMA, 1997.
with Bayesian inference for probabilistic model updating. 69. Kalman RE. A new approach to linear filtering and pre-
Comput-Aided Civ Inf 2015; 30: 602–619. diction problems. J Basic Eng 1960; 82(1): 35–45.
52. Sun H and Büyüköztürk O. Identification of traffic- 70. Faragher R. Understanding the basis of the Kalman filter
induced nodal excitations of truss bridges through het- via a simple and intuitive derivation. IEEE Signal Proc
erogeneous data fusion. Smart Mater Struct 2015; 24: Mag 2012; 29(5): 128–132.
075032. 71. Jazwinski AH. Stochastic processes and filtering theory.
53. Au SK. Fast Bayesian FFT method for ambient modal San Diego, CA: Academic Publishing, 1970.
identification with separated modes. J Eng Mech 2011; 72. Sorenson HW. Kalman filtering: theory and application.
137(3): 214–226. Piscataway, NJ: IEEE, 1985.
54. Yuen KV and Katafygiotis LS. Bayesian fast Fourier 73. Julier SJ and Uhlmann JK. Unscented filtering and non-
transform approach for modal updating using ambient linear estimation. Proc IEEE 2004; 92(3): 401–422.
data. Adv Struct Eng 2003; 6(2): 81–95. 74. Smyth AW, Kontoroupi T and Brewick PT. Efficient
55. Au SK and Zhang FL. Fast Bayesian ambient modal data fusion and practical considerations for structural
identification incorporating multiple setups. J Eng Mech identification. CISM Int Cent Mech Sci 2016; 567: 35–49.
2012; 138(7): 800–815. 75. Julier SJ and Uhlmann JK. A new extension of the Kal-
56. Zhang FL, Au SK and Lam HF. Assessing uncertainty in man filter to nonlinear systems. In: Proceedings of the
operational modal analysis incorporating multiple setups 11th international symposium on aerospace/defense sen-
using a Bayesian approach. Struct Control Hlth 2015; 22: sing, simulation and controls, Orlando, FL, April 21–25,
395–416. 1997, pp. 182–193 Bellingham, Washington: SPIE press.
57. Zhang FL and Au SK. Fundamental two-stage formula- 76. Moral PD. Non linear filtering: interacting particle solu-
tion for Bayesian system identification, part II: applica- tion. Markov Process Relat 1996; 2(4): 555–580.
tion to ambient vibration data. Mech Syst Signal Pr 2016; 77. Liu JS and Chen R. Sequential Monte Carlo methods for
66–67: 43–61. dynamic systems. J Am Stat Assoc 1998; 93(443): 1032–1044.
Wu and Jahanshahi 31

78. Khan T, Ramuhalli P and Dass SC. Particle-filter-based 94. Chatzi EN and Fuggini C. Structural identification of a
multisensor fusion for solving low-frequency electromag- super-tall tower by GPS and accelerometer data fusion
netic NDE inverse problems. IEEE T Instrum Meas 2011; using a multi-rate Kalman filter. In: Alfred Strauss, Dan
60(6): 2142–2153. Frangopol, Konrad Bergmeister (eds) Life-cycle and sus-
79. Liu PT, Li F and Xiao H. A state decoupling approach to tainability of civil infrastructure systems: proceedings of
estimate unobservable tracking systems. IEEE J Oceanic the 3rd international symposium on life-cycle civil engineer-
Eng 1996; 21(3): 256–259. ing. Boca Raton, FL: CRC Press 2012.
80. Chatzis MN, Chatzi EN and Triantafyllou S. A discon- 95. Chatzis MN, Chatzi EN and Smyth AW. An experimen-
tinuous extended Kalman filter for non-smooth dynamic tal validation of time domain system identification meth-
problems. Mech Syst Signal Pr 2017; 92: 13–29. ods with fusion of heterogeneous data. Earthq Eng Struct
81. Chatzis MN and Chatzi EN. A discontinuous unscented D 2015; 44: 523–547.
Kalman filter for non-smooth dynamic problems. Front 96. Asgarieh E, Moaveni B, Barbosa AR, et al. Nonlinear
Built Environ 2017; 3(56): 1–15. model calibration of a shear wall building using time and
82. Smyth A and Wu M. Multi-rate Kalman filtering for the frequency data features. Mech Syst Signal Pr 2017; 85:
data fusion of displacement and acceleration response 236–251.
measurements in dynamic system monitoring. Mech Syst 97. Chatzis MN and Chatzi EN. Online Bayesian identifica-
Signal Pr 2007; 21(2): 706–723. tion of non-smooth systems. Procedia Eng 2017; 199:
83. Tan H, Wilson AM and Lowe J. Measurement of stride 918–923.
parameters using a wearable GPS and inertial measure- 98. Caron F, Duflos E, Pomorski E, et al. GPS/IMU data
ment unit. J Biomech 2008; 41(7): 1398–1406. fusion using mutli-sensor Kalman filtering: introduction
84. Chang CC and Xiao XH. Accurate displacement mea- of contextual aspects. Inform Fusion 2006; 7: 221–230.
surement from fusion of vision-based displacement and 99. Yang JN, Lin S, Huang H, et al. An adaptive extended
acceleration with Kalman filter. In: Proceedings of the Kalman filter for structural damage identification. Struct
20th international conference on adaptive structures and Control Hlth 2006; 13(4): 849–867.
technologies, Shatin, Hong Kong, 20–22 October 2009. 100. Yang JN, Pan S and Huang H. An adaptive extended
85. Bock Y, Melgar D and Crowell BW. Real-time strong- Kalman filter for structural damage identifications II:
motion broadband displacements from collocated GPS unknown inputs. Struct Control Hlth 2007; 14(3):
and accelerometers. B Seismol Soc Am 2011; 101(6): 497–521.
2904–2925. 101. Wen YK. Method for random vibration of hysteretic
86. Rodger JA. Toward reducing failure risk in an integrated systems. J Eng Mech Div 1976; 102(2): 249–263.
vehicle health maintenance system: a fuzzy multi-sensor 102. Van der Merwe R and Wan E. Gaussian mixture sigma-
data fusion Kalman filter approach for IVHMS. Expert point particle filters for sequential probabilistic inference
Syst Appl 2012; 39: 9821–9836. in dynamic state-space models. In: Proceedings of the
87. Kim J, Kim K and Sohn H. Autonomous dynamic dis- international conference on acoustics, speech, and signal
placement estimation from data fusion of acceleration processing, Hong Kong, China, 6–10 April 2003. New
and intermittent displacement measurements. Mech Syst York: IEEE.
Signal Pr 2014; 42: 194–205. 103. Lourens E, Reynders E, De Roeck G, et al. An augmen-
88. Azam SE, Chatzi E and Papadimitriou C. A dual Kalman ted Kalman filter for force identification in structural
filter approach for state estimation via output-only accel- dynamics. Mech Syst Signal Pr 2012; 27: 446–460.
eration measurements. Mech Syst Signal Pr 2015; 60–61: 104. Gillijns S and De Moor B. Unbiased minimum-variance
866–886. input and state estimation for linear discrete-time sys-
89. Naets F, Cuadrado J and Desmet W. Stable force identi- tems with direct feedthrough. Automatica 2007; 43:
fication in structural dynamics using Kalman filtering 934–937.
and dummy-measurements. Mech Syst Signal Pr 2015; 105. Cho S, Park JW, Palanisamy RP, et al. Reference-free
50–51: 235–248. displacement estimation of bridges using Kalman filter-
90. Huang XH, Dyke S, Sun Z, et al. Simultaneous identifica- based multimetric data fusion. J Sensors 2016; 201:
tion of stiffness, mass, and damping using an on-line model 3791856.
updating approach. Struct Control Hlth 2017; 24: e1892. 106. Evensen G. The ensemble Kalman filter: theoretical for-
91. Kim K and Sohn H. Dynamic displacement estimation mulation and practical implementation. Ocean Dyn
by fusing LDV and LiDAR measurements via smoothing 2003; 53(4): 343–367.
based Kalman filtering. Mech Syst Signal Pr 2017; 82: 107. Van OP and De MB. Subspace algorithms for the iden-
339–355. tification of combined deterministic-stochastic systems.
92. Wu M and Smyth AW. Application of the unscented Kal- Automatica 1994; 30(1): 75–93.
man filter for real-time nonlinear structural system identi- 108. Shafer G. A mathematical theory of evidence. Princeton,
fication. Struct Control Hlth 2007; 14: 971–990. NJ: Princeton University Press, 1976.
93. Erazo K and Hernandez EM. Bayesian model–data 109. Wu H. Sensor data fusion for context-aware computing
fusion for mechanistic postearthquake damage assess- using Dempster-Shafer theory. PhD Thesis, The
ment of building structures. J Eng Mech 2016; 142(9): Robotics Institute, Carnegie Mellon University, Pitts-
04016062. burgh, PA, 2003.
32 Structural Health Monitoring 00(0)

110. Yager RR, Kacprzyk J and Fedrizzi M. Advances in the 129. Dempsey PJ and Sheng S. Investigation of data fusion
Dempster-Shafer theory of evidence. Hoboken, NJ: John applied to health monitoring of wind turbine drivetrain
Wiley & Sons, 1994. components. Wind Energy 2013; 16: 479–489.
111. Inagaki T. Interdependence between safety-control pol- 130. Basheer IA and Hajmeer M. Artificial neural networks:
icy and multiple-sensor schemes via Dempster-Shafer fundamentals, computing, design, and application. J
theory. IEEE T Reliab 1991; 40(2): 182–188. Microbiol Meth 2000; 43(1): 3–31.
112. Bao Y, Li H, An Y, et al. Dempster-Shafer evidence 131. Boser BE, Guyon I and Vapnik V. A training algorithm
theory approach to structural damage detection. Struct for optimal margin classifiers. In: Proceedings of the 5th
Health Monit 2011; 11(1): 13–26. annual workshop on computational learning theory, Pitts-
113. Zervas E, Mpimpoudis A, Anagnostopoulos C, et al. burgh, PA, 27–29 July 1992, pp. 144–152. New York:
Multisensor data fusion for fire detection. Inform Fusion ACM.
2011; 12: 150–159. 132. Vapnik VN. The nature of statistical learning theory.
114. Zhao X, Wang R, Gu H, et al. Innovative data fusion New York: Springer-Verlag, 1995.
enabled structural health monitoring approach. Math 133. LeCun Y, Bengio Y and Hinton G. Deep learning.
Probl Eng 2014; 2014: 369540. Nature 2015; 521: 436–444.
115. Zhou Q, Zhou H, Zhou Q, et al. Structural damage 134. Krizhevsky A, Sutskever I and Hinton GE. ImageNet
detection based on posteriori probability support vector classification with deep convolutional neural networks.
machine and Dempster-Shafer evidence theory. Appl In: Proceedings of the 26th annual conference on neural
Soft Comput 2015; 36: 368–374. information processing systems, 3–6 December 2012,
116. Moosavian A, Khazaee M, Najafi G, et al. Spark plug Lake Tahoe, Nevada: NIPS 2012; 1106–1114.
fault recognition based on sensor fusion and classifier 135. Hahnloser R, Sarpeshkar R, Mahowald MA, et al.
combination using Dempster-Shafer evidence theory. Digital selection and analogue amplification coexist in a
Appl Acoust 2015; 93: 120–129. cortex-inspired silicon circuit. Nature 2000; 405:
117. Hou L and Bergmann NW. Novel industrial wireless sen- 947–951.
sor networks for machine condition monitoring and fault 136. Hinton G and Salakhutdinov R. Reducing the dimen-
diagnosis. IEEE T Instrum Meas 2012; 61(10): 2787–2798. sionality of data with neural networks. Science 2006:
118. Grande E and Imbimbo M. A multi-stage approach for 313: 504–507.
damage detection in structural systems based on flexibil- 137. Bengio Y, Lamblin P, Popovici D, et al. Greedy layer-
ity. Mech Syst Signal Pr 2016; 76–77: 455–475. wise training of deep networks. In Proceedings of
119. Huang J and Liu W. A pavement crack detection advances in neural information processing systems 19,
method combining 2D with 3D information based on 4–7 December 2006, British Columbia, Canada: Adv
Dempster-Shafer theory. Comput-Aided Civ Inf 2014; Neur In 2007; 19(153): 153–160.
29: 299–313. 138. Liu YY, Ju YF, Duan CD, et al. Structure damage diag-
120. Völker C and Shokouhi P. Multi sensor data fusion nosis using neural network and feature fusion. Eng Appl
approach for automatic honeycomb detection in con- Artif Intell 2011; 24(1): 87–92.
crete. NDT E Int 2015; 71: 54–60. 139. Ogden R. Essential wavelets for statistical applications
121. Zhang F. The Schur complement and its applications. and data analysis. Boston, MA: Birkhäuser, 1997.
New York: Springer, 2005. 140. Coifman RR and Wickerhauser M. Entropy-based algo-
122. Zadeh LA. Fuzzy logic. IEEE Comput 1988; 21: 83–93. rithms for best basis selection. IEEE T Inform Theory
123. Jiang SF, Zhang CM and Zhang S. Two-stage struc- 1992; 38: 713–718.
tural damage detection using fuzzy neural networks 141. Pei JS, Wright JP and Smyth AW. Neural network initi-
and data fusion techniques. Expert Syst Appl 2011; 38: alization with prototypes—a case study in function
511–519. approximation. In: Proceedings of the international joint
124. Heideklang R and Shokouhi P. Multi-sensor image conference on neural networks, Montreal, QC, Canada,
fusion at signal level for improved near-surface crack 31 July–4 August 2005. New York: IEEE.
detection. NDT E Int 2015; 71: 16–22. 142. Pei JS and Smyth AW. New approach to designing mul-
125. Lou X and Loparo KA. Bearing fault diagnosis based tilayer feedforward neural network architecture for
on wavelet transform and fuzzy inference. Mech Syst modeling nonlinear restoring forces. I: formulation. J
Signal Pr 2004; 18: 1077–1095. Eng Mech 2006; 132(12): 1290–1300.
126. Sugeno M. Fuzzy measures and fuzzy integrals: a sur- 143. Pei JS and Smyth AW. New approach to designing mul-
vey. In: Gupta MM, Saridis GN and Gains BR (eds) tilayer feedforward neural network architecture for
Fuzzy automata and decision processes. Amsterdam: modeling nonlinear restoring forces. II: applications. J
North Holland Publishing Company, 1977, pp. 89–102. Eng Mech 2006; 132(12): 1301–1312.
127. Amolins K, Zhang Y and Dare P. Wavelet based image 144. Pei JS, Mai EC, Wright JP, et al. Neural network
fusion techniques: an introduction, review and compari- initialization with prototypes—function approxima-
son. ISPRS J Photogramm 2007; 62: 249–263. tion in engineering mechanics applications. In: Pro-
128. Teng J, Lu W, Wen R, et al. Instrumentation on struc- ceedings of the international joint conference on neural
tural health monitoring systems to real world structures. networks, Orlando, FL, 12–17 August 2007. New
Smart Struct Syst 2015; 15(1): 151–167. York: IEEE.
Wu and Jahanshahi 33

145. Pei JS and Mai EC. Constructing multilayer feedfor- 159. Sun D, Lee VC and Lu Y. An intelligent data fusion
ward neural networks to approximate nonlinear func- framework for structural health monitoring. In: Pro-
tions in engineering mechanics applications. J Appl ceedings of the 11th conference on industrial electronics
Mech 2008; 75: 061002. and applications, Hefei, China, 5–7 June 2016. New
146. Pei JS, Wright JP, Masri SF, et al. Toward constructive York: IEEE.
methods for sigmoidal neural networks—function 160. Mohanty S, Jagielo B, Bhan CB, et al. Online stress cor-
approximation in engineering mechanics applications. rosion crack monitoring in nuclear reactor components
In: Proceedings of the international joint conference on using active ultrasonic sensor networks and nonlinear
neural networks, San Jose, CA, 31 July–5 August 2011. system identification: data fusion based big data analy-
New York: IEEE. tics approach. In: Proceedings of the pressure vessels and
147. Pei JS, Mai EC, Wright JP, et al. Mapping some basic piping conference, Boston, MA, 19–23 July 2015. New
functions and operations to multilayer feedforward York: IEEE.
neural networks for modeling nonlinear dynamical sys- 161. Osornio-Rios RA, Amezquita-Sanchez JP, Romero-
tems and beyond. Nonlinear Dynam 2013; 71: 371–399. Troncoso RJ, et al. MUSIC-ANN analysis for locating
148. Pei JS and Masri SF. Demonstration and validation of structural damages in a truss-type structure by means of
constructive initialization method for neural networks vibrations. Comput-Aided Civ Inf 2012; 27: 687–698.
to approximate nonlinear functions in engineering 162. Kesavan KN and Kiremidjian AS. A wavelet-based
mechanics applications. Nonlinear Dynam 2015; 79: damage diagnosis algorithm using principal component
2099–2119. analysis. Struct Control Hlth 2012; 19: 672–685.
149. Ghiasi R, Torkzadeh P and Noori M. Structural dam- 163. Safari S, Shabani F and Simon D. Multirate multisensor
age detection using artificial neural networks and least data fusion for linear systems using Kalman filters and a
square support vector machine with particle swarm har- neural network. Aerosp Sci Technol 2014; 39: 465–471.
mony search algorithm. Int J Sustain Mater Struct Syst 164. Yan LP, Liu BS and Zhou DH. Multirate multisensor
2014; 1(4): 303–320. data fusion for linear systems using Kalman filters and
150. Suykens JA, Brabanter JD, Lukas L, et al. Weighted a neural network. Aerosp Sci Technol 2006; 39: 465–471.
least squares support vector machines: robustness and 165. Derkevorkian A, Hernandez-Garcia M, Yun HB, et al.
sparse approximation. Neurocomputing 2002; 48: Nonlinear data-driven computational models for
85–105. response prediction and change detection. Struct Con-
151. Eberhart RC and Kennedy J. A new optimizer using trol Hlth 2015; 22: 273–288.
particle swarm theory. In: Proceedings of the 6th interna- 166. Mokhtari S, Wu L and Yun HB. Comparison of super-
tional symposium on micro machine and human science, vised classification techniques for vision-based pave-
Nagoya, Japan, 4–6 October 1995, pp. 39–43. New ment crack detection. J Transp Res Board 2016; 2595:
York: IEEE. 119–127.
152. Geem ZW, Kim JH and Loganathan GV. A new heuris- 167. Terzi S. Modeling for pavement roughness using the
tic optimization algorithm: harmony search. Simulation ANFIS approach. Adv Eng Softw 2013; 57: 59–64.
2001; 76(2): 60–68. 168. Vanraj SD and Pabla B. Hybrid data fusion approach
153. Stramondo S, Bignami C, Chini M, et al. Satellite radar for fault diagnosis of fixed-axis gearbox. Struct Health
and optical remote sensing for earthquake damage Monit 2017; 17: 936–945.
detection: results from different case studies. Int J 169. Völker C, Kruschwitz S, Boller C, et al. Feasibility
Remote Sens 2006; 27(20): 4433–4447. study on adapting a machine learning based multi-
154. Longbotham N, Glenn T, Zare A, et al. Multi-modal sensor data fusion approach for honeycomb detection
change detection, application to the detection of flooded in concrete. In: Proceedings of the NDE/NDT for high-
areas: outcome of the 2009–2010 data fusion contest. ways & bridges: structural materials technology, Port-
IEEE J Sel Top Appl 2012; 5(1): 1–12. land, OR: Analytical Sciences. 29–31 August 2016.
155. Khazaeli S, Ravandi AG, Banerji S, et al. The applica- 170. Ester M, Kriegel HP, Sander J, et al. A density-based
tion of data mining and cloud computing techniques in algorithm for discovering clusters in large spatial data-
data-driven models for structural health monitoring. bases with noise. In: Proceedings of the 2nd international
Proc SPIE 2016; 98052M pp. 1-18. conference on knowledge discovery and data mining
156. Khuc T and Catbas FN. Structural identification using (KDD-96), 1996, https://www.aaai.org/Papers/KDD/
computer vision–based bridge health monitoring. J 1996/KDD96-037.pdf
Struct Eng: ASCE 2018; 144(2): 04017202. 171. Cha YJ, Choi W and Büyüköztürk O. Deep learning-
157. Freund Y and Schapire RE. A decision-theoretic gener- based crack damage detection using convolutional
alization of on-line learning and an application to boost- neural networks. Comput-Aided Civ Inf 2017; 32:
ing. J Comput Syst Sci 1997; 55(1): 119–139. 361–378.
158. Viola P and Jones M. Rapid object detection using a 172. Abdeljaber O, Avci O, Kiranyaz S, et al. Real-time
boosted cascade of simple features. In: Proceedings of vibration-based structural damage detection using one-
the computer vision and pattern recognition, Kauai, HI, dimensional convolutional neural networks. J Sound
8–14 December 2001, p. 511. New York: IEEE. Vib 2017; 388: 154–170.
34 Structural Health Monitoring 00(0)

173. Akintayo A, Lore KG, Sarkar S, et al. Prognostics of 188. Fisher RA. The use of multiple measurements in taxo-
combustion instabilities from hi-speed flame video using nomic problems. Ann Eugenic 1936; 7: 111–132.
a deep convolutional selective autoencoder. Int J Progn 189. Salahshoor K, Kordestani M and Khoshro MS. Fault
Health Manag 2016; 7: 1–14. detection and diagnosis of an industrial steam turbine
174. Sarkar S, Reddy KK, Giering M, et al. Deep learning using fusion of SVM (support vector machine) and
for structural health monitoring: a damage characteriza- ANFIS (adaptive neuro-fuzzy inference system) classi-
tion application. In: Proceedings of the annual conference fiers. Energy 2010; 35: 5472–5482.
of the prognosis and health management society, October 190. Jang JS. ANFIS: adaptive network-based fuzzy infer-
3–6, 2016, Denver, Colorado: phmsociety 2016. ence systems. IEEE T Syst Man Cyb 1993; 23(3):
175. Reddy KK, Sarkar S, Venugopalan V, et al. Anomaly 665–685.
detection and fault disambiguation in large flight data: a 191. Chen YL, Abdelbarr M, Jahanshahi MR, et al. Color
multi-modal deep auto-encoder approach. In: Proceed- and depth data fusion using an RGB-D sensor for inex-
ings of the annual conference of the prognosis and health pensive and contactless dynamic displacement-field mea-
management society, 2016, https://pdfs.semanticscholar.- surement. Struct Control Hlth 2017; 24: e2000.
org/4386/72b86456cbdc959848a9cece34b79fec6cb9.pdf 192. Geiger A, Moosmann F, Car O, et al. Automatic cam-
176. Yan W and Yu L. On accurate and reliable anomaly era and range sensor calibration using a single shot. In:
detection for gas turbine combustors: a deep learning Proceedings of the international conference on robotics
approach. In: Proceedings of the annual conference of the and automation, Saint Paul, MN, 14–18 May 2012, pp.
prognosis and health management society, 2015, https:// 3936–3943. New York: IEEE.
www.phmsociety.org/sites/phmsociety.org/files/ 193. Bradski G and Kaehler A. Learning OpenCV: computer
phm_submission/2015/phmc_15_025.pdf vision with the OpenCV library. Sebastopol, CA: OReilly
177. Arrow KJ. A difficulty in the concept of social welfare. J Media Inc, 2008.
Polit Econ 1950; 58(4): 328–346. 194. Chen YL, Jahanshahi MR, Manjunatha P, et al. Inex-
178. Hazelrigg GA. Fundamentals of decision making for engi- pensive multimodal sensor fusion system for autono-
neering design and systems engineering. NEILS CORP mous data acquisition of road surface conditions. IEEE
2012. Sens J 2016; 16(21): 7731–7743.
179. Lu Y and Michaels JE. Feature extraction and sensor 195. Park JW, Sim SH and Jung HJ. Displacement estima-
fusion for ultrasonic structural health monitoring under tion using multimetric data fusion. IEEE/ASME T
changing environmental conditions. IEEE Sens J 2009; Mechatron 2013; 18(6): 1675–1682.
9(11): 1462–1471. 196. Lee HS, Hong YH and Park HW. Design of an FIR fil-
180. Michaels JE and Michaels TE. Guided wave signal pro- ter for the displacement reconstruction using measured
cessing and image fusion for in situ damage localization acceleration in low-frequency dominant structures. Int J
in plates. Wave Motion 2007; 44: 482–492. Numer Method Eng 2010; 82(4): 403–434.
181. Guan X, Zhang J, Zhou SK, et al. Post-processing of 197. Shin S, Lee SU, Kim Y, et al. Estimation of bridge dis-
phased-array ultrasonic inspection data with parallel placement responses using FBG sensors and theoretical
computing for nondestructive evaluation. J Nondestruct mode shapes. Struct Eng Mech 2012; 42(2): 229–245.
Eval 2014; 33(3): 342–351. 198. Park JW, Sim SH and Jung HJ. Wireless displacement
182. Bai R, Song X, Radzieński M, et al. Crack location in sensing system for bridges using multi-sensor fusion.
beams by data fusion of fractal dimension features of Smart Mater Struct 2014; 23: 045022.
laser-measured operating deflection shapes. Smart 199. Sim SH. Estimation of flexibility matrix of beam struc-
Struct Syst 2014; 13(6): 975–991. tures using multisensor fusion. J Struct Integr Maint
183. Katz M. Fractals and the analysis of waveforms. Com- 2016; 1(2): 60–64.
put Biol Med 1988; 18(3): 145–156. 200. Bernal D. Extracting flexibility matrices from state-
184. Heideklang R and Shokouhi P. Application of data space realizations. COST F3 Conf 2000; 1: 127–135.
fusion in nondestructive testing (NDT). In: Proceedings 201. Kim H, Cho S and Sim SH. Data fusion of acceleration
of 16th international conference on information fusion, and angular velocity for improved model updating.
Istanbul, Turkey, 9–12 July 2013. New York: IEEE. Measurement 2016; 91: 239–250.
185. Zhu J, Wang L and Jiang H. Study on damage diagno- 202. Park JW, Lee KC, Sim SH, et al. Traffic safety evalua-
sis indices fusion for long-span suspension bridges under tion for railway bridges using expanded multisensor data
ambient vibration. In: Hong Hao, Chunwei Zhang (eds) fusion. Comput-Aided Civ Inf 2016; 31: 749–760.
Mechanics of structures and materials: advancements and 203. Heideklang R and Shokouhi P. Decision-level fusion of
challenges. Milton Park, UK: Taylor & Francis 2016. spatially scattered multi-modal data for nondestructive
186. Haynes C and Todd M. Enhanced damage localization inspection of surface defects. Sensors 2016; 16: 105.
for complex structures through statistical modeling and 204. Parzen E. On estimation of a probability density func-
sensor fusion. Mech Syst Signal Pr 2015; 54–55: 195–209. tion and mode. Ann Math Stat 1962; 33: 1065–1076.
187. Haynes C, Todd MD, Flynn E, et al. Statistically-based 205. Park JW, Moon D, Yoon H, et al. Visual–inertial displa-
damage detection in geometrically-complex structures cement sensing using data fusion of vision-based displa-
using ultrasonic interrogation. Struct Health Monit cement with acceleration. Struct Control Hlth 2018; 25:
2012; 12(2): 141–152. e2122.
Wu and Jahanshahi 35

206. Khaleghi B, Khamis A, Karray FO, et al. Multisensor papers.nips.cc/paper/6975-dynamic-routing-between-cap


data fusion: a review of the state-of-the-art. Inform sules.pdf
Fusion 2013; 14(1): 28–44. 213. Bellman RE. Dynamic programming. Princeton, NJ:
207. Hinton GE. A practical guide to training restricted Boltz- Princeton University Press, 1957.
mann machines. Department of Computer Science, Uni- 214. Wold S, Esbensen K and Geladi P. Principal component
versity of Toronto 2010. analysis. Chemometr Intell Lab 1987; 2(1–3): 37–52.
208. Tamilselvan P and Wang P. Failure diagnosis using deep 215. Golub GH and Reinsch C. Singular value decomposi-
belief learning based health state classification. Reliab tion and least squares solutions. Numer Math 1970;
Eng Syst Safe 2013; 115: 124–135. 14(5): 403–420.
209. Ng AY and Jordan MI. On discriminative versus gen- 216. Steinbach M, Ertöz L and Kumar V. The challenges of
erative classifiers: a comparison of logistic regression clustering high dimensional data. In: Wille L.T. (eds)
and Naı̈ve Bayes. Neural Inform Process Syst 2001; 14: New Directions in Statistical Physics Berlin, Heidelberg:
605–610. Springer 2004; 273–309.
210. Hochreiter S and Schmidhuber J. Long short term mem- 217. Batini C and Scannapieco M. Data and information qual-
ory. Neural Comput 1997; 9(8): 1735–1780. ity. Principles and Techniques, New York: Springer,
211. Malhotra P, Ramakrishnan A, Anand G, et al. LSTM- 2016.
based encoder-decoder for multi-sensor anomaly detec- 218. Bruce LM and Reynolds D. Game theory based
tion. In: Proceedings of the ICML anomaly detection data fusion for precision agriculture applications. In:
workshop, 2016, https://arxiv.org/pdf/1607.00148.pdf Proceedings of the international geoscience and remote
212. Sabour S, Frosst N and Hinton GE. Dynamic routing sensing symposium, Beijing, China, 10–15 July 2016.
between capsules. In: Proceedings of the 31st conference New York: IEEE.
on neural information processing systems, 2017, https://

S-ar putea să vă placă și