Sunteți pe pagina 1din 16

Structural Safety 72 (2018) 1–16

Contents lists available at ScienceDirect

Structural Safety
journal homepage: www.elsevier.com/locate/strusafe

A machine learning framework for assessing post-earthquake structural


safety
Yu Zhang ⇑, Henry V. Burton, Han Sun, Mehrdad Shokrabadi

a r t i c l e i n f o a b s t r a c t

Article history: A machine learning framework is presented to assess post-earthquake structural safety. The concepts of
Received 26 March 2017 response and damage patterns are introduced and incorporated into a systematic methodology for gen-
Received in revised form 29 November 2017 erating a robust dataset for any damaged building. Incremental dynamic analysis using sequential ground
Accepted 4 December 2017
motions is used to evaluate the residual collapse capacity of the damaged structure. Machine learning
algorithms are used to map response and damage patterns to the structural safety state (safe or unsafe
to occupy) of the building based on an acceptable threshold of residual collapse capacity. Predictive mod-
Keywords:
els including classification and regression tree and Random Forests are used to probabilistically identify
Machine learning
Random forests
the structural safety state of an earthquake-damaged building. The proposed framework is applied to a
Damage pattern 4-story reinforced concrete special moment frame building. Distinct yet partially overlapping response
Post-earthquake structural safety and damage patterns are found for the damaged building classified as safe and unsafe. High prediction
Performance-based assessment accuracies of 91% and 88% are achieved when the safety state is assessed using response and damage pat-
Seismic resilience terns respectively. The proposed framework could be used to rapidly evaluate whether a damaged build-
ing remains structurally safe to occupy after a seismic event and can be implemented as a subroutine in
community resilience evaluation or building lifecycle performance assessment and optimization.
Ó 2017 Elsevier Ltd. All rights reserved.

1. Introduction seismic resilience of a residential community, Burton et al. [12]


proposed a set of building-level limit states, which are explicitly
Assessing building structural and nonstructural component- linked to post-earthquake recovery and functionality. Fragility
level damage is a key step in the performance-based earthquake curves were developed to link ground shaking intensity to the
engineering (PBEE) framework [1–8]. Component damage charac- probability of exceedance of these limit states, which include func-
terization serves as the link between building structural response tional loss, unsafe to occupy, demolition and collapse. Building-
and the performance metrics (death, dollars and downtime) that level limit states have also been used for lifecycle seismic perfor-
are relevant to end-users. More recently, the importance of mance assessment and optimization for structures, where retrofit
building-level limit states has been highlighted due to the growing strategies, repair cost, time and salvage value are considered [13].
emphasis on quantifying seismic resilience at the individual infras- Post-earthquake structural safety is key to determining
tructure and community scales [9,10]. Iervolino [11] defined post- whether a damaged building is safe to re-occupy, which is one of
earthquake building-level limit states based on the level of func- the key pieces of information needed by stakeholders immediately
tionality (relative to before the event) that a damaged structure after an earthquake. ATC-20 [14,15] provides guidelines for post-
could support following an earthquake. These limit states were earthquake visual inspection to rapidly evaluate building struc-
used to model the probabilistic recovery of functionality using tural safety and assign corresponding green, yellow and red tags
state- and time-dependent Markov Chains. In the assessment of to buildings that are deemed safe to occupy, occupiable with
restrictions and unsafe to occupy respectively. It has been widely
used after U.S. earthquakes such as Loma Prieta, Landers, North-
Abbreviations: CART, Classification and regression tree; CP, Complexity param- ridge [15] and Hawaii [16] and adapted for use in many other
eter; DM, Damage measure; EDP, Engineering demand parameter; IDA, Incremental countries around the world. Building on the work of Porter et al.
dynamic analysis; NRHA, Nonlinear response history analysis; PBEE, Performance-
based earthquake engineering; RC, Reinforced concrete; ROC, Receiver operating
[7], Mitrani-Reiser [17] used fragility curves to map continuous
characteristic; SMF, Special moment frame. engineering demand parameters (EDPs) to discrete component-
⇑ Corresponding author. level damage states that have similar descriptions to the ones used
E-mail address: y.zhang@ucla.edu (Y. Zhang).

https://doi.org/10.1016/j.strusafe.2017.12.001
0167-4730/Ó 2017 Elsevier Ltd. All rights reserved.
2 Y. Zhang et al. / Structural Safety 72 (2018) 1–16

in ATC-20. A ‘‘virtual inspector” was then used to probabilistically including classification and regression tree (CART) and Random
estimate building safety and assign corresponding tags based on Forests are used to build predictive models, which can probabilis-
the criteria in the first two tiers of an ATC-20 evaluation. The tically identify the post-earthquake structural safety state of the
Mitrani-Reiser methodology was adopted in FEMA P58 [18,19] building based on its residual collapse capacity, given any unique
using a slightly different approach to link component-level damage response or damage pattern. To illustrate the overall methodology,
to the likelihood that a building will be assigned an unsafe placard. a case study is conducted using a 4-story RC SMF building. Several
For each structural and non-structural element, the median frac- applications are envisioned for the proposed framework. The
tion of components (based on the total number in the building or model can be embedded in an electronic tool that can be used to
a single story) in a particular damage state is estimated. The dam- supplement the judgement of field inspectors conducting post-
aged building is assigned an unsafe placard if any of these values earthquake building safety assessments. Observations of the distri-
exceed a pre-defined triggering ratio. For example, the unsafe plac- bution of component-level damage (or damage pattern) can serve
ard for steel special concentrically braced frames with wide flange as inputs into the model, which will provide probabilistic predic-
braces could be triggered by any of the three scenarios: more that tions of the safety state based on the reduced collapse capacity.
60% of the components are in damage state 2 (brace has lost signif- For buildings instrumented to record, process and transmit struc-
icant axial capacity), more than 40% of the components are in dam- tural response demands, the machine learning algorithm can be
age state 3 (brace and gusset are severely damaged with significant used to provide preliminary rapid estimates of the safety state of
loss in stiffness and resistance), and more than 20% of the compo- the building. The methodology can also be used to generate fragi-
nents are in damage state 4 (brace or gusset have fractured). It is lity curves for the ‘‘unsafe to occupy” building-level limit state,
important to note that the triggering ratios in FEMA-P58 are lar- which can be incorporated into building or community resilience
gely based on judgement and are not explicitly linked to the reduc- and lifecycle performance assessments and optimization.
tion in collapse safety of the damaged building.
Incremental dynamic analysis (IDA) [20–22] has been widely 2. Post-earthquake structural safety assessment
used to probabilistically assess the most critical building-level
limit state of collapse. More recently, the reduction in the collapse 2.1. Overview of methodology
capacity of mainshock damaged buildings has been used as a met-
ric for assessing the post-earthquake structural safety and occupi- A schematic overview of the methodology used to assess post-
ability of damaged buildings. IDAs are performed using sequential earthquake structural safety is shown in Fig. 1. Starting with an
ground motions to quantify the reduction in collapse capacity. intact structure, five distinct yet fully integrated steps are used
Maffei et al. [23,24] proposed four post-earthquake occupiability to illustrate the assessment framework. The outcome of this
criteria, which differ based on the metric used to quantify the assessment is the predicted structural safety state conditioned
reduction in collapse safety of the damaged building. Yeo and Cor- on the structural response demands (from instrumentation)
nell [25] used the time-varying aftershock hazard at a given site to and/or available observed physical damage (through field
compute an equivalent constant collapse rate, which decreases inspections).
with time after the occurrence of the mainshock. The time- The first step describes the process of using a set of ‘‘damaging”
varying tagging scenarios are established based on the evolving ground motions to create samples of the damaged structure from
collapse risk in the aftershock environment. The time-dependent which response and damage patterns will be extracted. The
tag could be changed from red to yellow and even green as time response patterns or distribution of EDPs is obtained directly from
elapses. In the Maffei et al. and Yeo and Cornell studies, no direc- NRHA. Subjecting the intact structure to a single damaging ground
tion link was made between component-level damage and the motion scaled to a specific spectral intensity will produce a single
safety state of the building. Raghunandan et al. [26] quantified distinct response pattern. Multiple response patterns with differ-
the increase in vulnerability to collapse of mainshock-damaged ent levels and distributions of response demands are obtained by
modern ductile reinforced concrete (RC) special moment frame using a suite of damaging ground motions scaled to incrementally
(SMF) buildings. They also evaluated the extent to which different increasing spectral intensities. Damage patterns are simulated
system- (transient and residual story drifts) and component-level using structural component damage fragility functions which
damage indicators (e.g. beam and column plastic rotation) can relate local EDPs to the probability of exceeding a given damage
serve as a proxy for the reduced collapse capacity. Single variable state. A single damage pattern is described by each structural com-
linear regression was used to link individual damage indicators ponent assigned a single discrete damage state. Monte Carlo Sim-
to the residual collapse capacity. Burton and Deierlein [27] ulation is used to generate multiple damage patterns for a single
extended the Mitrani-Reiser and FEMA P58 approach by explicitly ground motion and spectral intensity. More details on generating
linking the component-level damage ratios that trigger an unsafe the response and damage patterns and their relationship to the
placard to the increase in collapse risk of the damaged building. safety state of the building are provided in Section 2.3.
However, the interaction between damage indicators was not con- The collapse capacity of the damaged structure is assessed
sidered in either of these two (Raghunandan et al., and Burton and through the application of IDAs using sequential ground motions
Deierlein) studies. in the second step (Step 2). Each damaging record, which is used
Presented in this paper is a novel approach to assessing post- as the first ground motion in the sequence, is followed by an IDA
earthquake structural safety. Central to the newly proposed using a set of ‘‘collapsing” ground motions. The median first-
methodology is a machine learning framework for mapping build- mode spectral acceleration corresponding to the collapse point
ing response and observable damage patterns to the residual col-
ð^
Sacol;DMG Þ is used as the measure of residual collapse safety of the
lapse capacity of the structure, which is used as the criterion for
damaged building. In the third step, the collapse capacity of the
assessing its safety state. The term ‘‘response pattern” is used to
intact structure is assessed by conducting single-record IDAs using
describe the distribution of peak global (e.g. residual and transient
the collapsing ground motions. The median collapse capacity
drifts) and local response (component deformations) demands
obtained from nonlinear response history analysis (NRHA). Simi- ð^
Sacol;INT Þ is also used as the measure of collapse safety for the intact
larly, the term ‘‘damage pattern” describes the distribution of structure. Note that the dispersion or log-standard deviation of the
observable states of physical damage to key structural components collapse capacities, bcol;DMG and bcol;INT , are also obtained but not
obtained from damage simulation. Machine learning algorithms directly used. The ratio of ^
Sacol;DMG to ^
Sacol;INT ðjÞ is used as a quan-
Y. Zhang et al. / Structural Safety 72 (2018) 1–16 3

Fig. 1. Schematic of post-earthquake structural safety assessment methodology.

titative measure of the increased collapse vulnerability or the expected, the median collapse capacity decreases as SDRDG
reduction in the collapse capacity of the damage structure. The increases. For example, the median collapse capacity is 1.12 g
damaged building is classified as safe or unsafe to occupy by com- ðj ¼ 0:93Þ when SDRDG ¼ 2% and reduces to 0.43 g ðj ¼ 0:36Þ
paring j to a pre-established threshold ðjmin Þ, which represents when SDRDG ¼ 5%.
the minimum acceptable reduction in collapse safety. For each damaging ground motion, log-normal collapse fragility
The fourth step uses 1) the response and damage patterns gen- curves are fitted for the ten target damage levels, while accounting
erated in step 1 and 2) their associated post-earthquake structural for record-to-record variability and structural modeling uncer-
safety states determined based on the reduction in collapse safety tainty [18,19]. Examples of these fragility curves and the relation-
(comparing j to jmin ) obtained from steps 2 and 3 to establish a ship between j and SDRDG for a single damaging ground motion are
classification problem. The machine learning algorithm is then shown in Fig. 3. It shows that there is no measurable reduction in
used to construct the predictive models. Subsequently, any arbi- collapse safety until SDRDG ¼ 2%, where j drops to 0.92, after
trary response and/or damage pattern can be used to probabilisti- which j decreases almost linearly with SDRDG . The results shown
cally predict the safety state of an earthquake-damaged structure in Fig. 3b confirms the effectiveness of SDRDG as a proxy for the
(Step 5). severity of structural damage. Moreover, only for this specific dam-
aging ground motion, it appears that j corresponding to
2.2. Establishing post-earthquake safety state criteria SDRDG ¼ 2% might be an appropriate minimum threshold ðjmin Þ
to distinguish the safe and unsafe structural states. Further discus-
As noted in Section 2.1, the reduction in the collapse capacity of sion of this issue is presented later in this section.
the damaged structure (as defined by j) is used as the basis for Establishing an acceptable value of jmin is partly a policy deci-
assessing its post-earthquake safety state. For each of the damag- sion and is not fully addressed in this study. However, as noted ear-
ing ground motions, ten incrementally-increasing maximum story lier, the observed trend between j and SDRDG can inform the choice
drift ratio ðSDRDG Þ levels are targeted, ranging from 0.5% to 5% at of jmin . To further illustrate this point, Fig. 4a shows a plot of j and
increments of 0.5%. These SDRDG values serve as proxies for the SDRDG that was generated by using a suite of damaging ground
possible states of building damage under a damaging ground motions. The details of the damaging and collapsing ground
motion. The number and increment of SDRDG used to generate motions used this study are presented in Section 3. Fig. 4b shows
the response and damage patterns is selected to balance the need a statistical summary of the relationship shown in Fig. 3b including
to cover a broad range of damage levels and the computational plots based on the median ðj ^ Þ and 90% confidence interval for j.
expense associated with performing IDA to collapse of the dam- Here it can be observed that the absolute value of the slope of
aged structure (described in step 2). Fig. 2 shows a series of IDA the median line remains roughly constant up to about
plots for the intact structure and three instances of the damaged SDRDG ¼ 2%, after which there is a sharp increase. Based on this
structure for the 4-story RC SMF building used to illustrate the observation, one approach could be to take jmin to be the median
methodology. The design, modeling and ground motion details value of j corresponding to SDRDG ¼ 2% in this study. Note how-
are provided in Section 3.1. Fig. 2a shows that the median collapse ever that, using this approach, there is a 50% probability that the
capacity for the intact building, which was obtained by applying value of j is less than jmin (based on the definition of the median)
IDAs using the collapsing ground motions, is 1.20 g. at SDRDG ¼ 2%. An alternative approach is to choose a value of j at
Fig. 2b, c and d show IDA plots for the damaged structure gener- SDRDG ¼ 2% that has a higher probability of being exceeded (e.g.
ated using a single damaging ground motion scaled to produce 90% probability that j P jmin ). A third approach would be to con-
SDRDG values of 2%, 3.5% and 5% respectively. Note that, for the sider the uncertainty in the safety state threshold by assuming a
damaged structure, the peak SDR corresponding to a collapsing parametric (e.g. lognormal) probability distribution for jmin and
ground motion spectral acceleration level ðSaT1 Þ of zero is the same using the mean/median and dispersion corresponding to
as the residual drift demand from the damaging ground motion. As SDRDG ¼ 2%.
4 Y. Zhang et al. / Structural Safety 72 (2018) 1–16

Fig. 2. Example IDAs using safety-testing ground motions for the (a) intact building and the building damaged by damage-generating ground motions with peak SDR of (b)
0.01, (c) 0.03 and (d) 0.05.

Fig. 3. Example (a) collapse fragility curves and (b) median Sacol versus SDR under the DG ground motion.
Y. Zhang et al. / Structural Safety 72 (2018) 1–16 5

Fig. 4. Median collapse capacity for (a) individual samples and (b) sample subsets by target damage levels.

2.3. Classified structural response and damage patterns learning approach is the availability of powerful algorithms that
can isolate the most critical response and damage points and
As described in Section 2.1, structural response and damage establish more clear distinctions between the two safety states.
patterns are used in combination with the safety states to con-
struct the prediction model using an appropriate machine learning 2.4. Safety state prediction using machine learning
algorithm. Fig. 5 shows some examples of response patterns
obtained from NRHA using the ‘‘damaging” ground motions and In a classification problem, machine learning algorithms utilize
the corresponding safety state. The global response patterns an observed dataset that is divided into a training subset, which is
include peak transient and residual SDRs along building height used to construct the predictive models, and testing subset, which
(Fig. 5a and b). Profiles of peak plastic hinge rotations in the frame is used to evaluate the performance of those models. Each observa-
beams and columns (Fig. 5c and d) are considered local response tion has two attributes: a set of predictors and their associated
patterns. A single profile represents the response pattern for a sin- classes. In this study, the response and damage patterns serve as
gle ‘‘damaging” ground motion and value of SDRDG . The response the predictors, which are always assumed to be available. The
patterns are classified into the safe and unsafe states based on structural safety states (safe or unsafe) serve as the corresponding
jmin ¼ 0:95 (median j at SDRDG ¼ 2%) from Fig. 4b. A noteworthy binary categorical classes, which are assumed to be known for the
observation is that there is a clear demarcation between the peak training data and unknown for the testing data.
and residual drift response patterns of the safe and unsafe states
in the first two stories. However, in the upper stories, there is a fair 2.4.1. Classification and regression tree
amount of overlap between the response patterns of the two safety CART is a machine learning algorithm that can be used to pre-
states. A similar observation can be made for the response patterns dict both categorical classes (classification tree) and ordered values
based on the beam and column plastic hinge rotations. This obser- (regression tree) [37]. This study is focused on categorical class
vation highlights the importance of considering the spatial varia- prediction. By recursively partitioning the predictor space (P),
tion of EDPs (or response pattern) and not just the absolute CART gradually explores and learns the structure of the observed
maximum in the entire building in structural safety state dataset with the goal of creating mutually exclusive subspaces
assessment. with better purity. The ratio between the sample number of the
Examples of safe and unsafe damage patterns for the beam dominant class, which is the one with the most samples, and the
hinges simulated using the peak plastic rotations in Fig. 5(c) are size of the subset, defines the purity of a subspace.
shown in Fig. 6. Damage patterns are described by each element Fig. 7 shows an example of a simple binary tree that is used to
being assigned one of four discrete damage states, where state 0 represent a CART. It comprises of a root node (node number 1 in
represents no visual damage and state 3 represents the most sev- Fig. 7), three interior nodes (numbered 2, 4 and 8) and five leaf
ere damage requiring complete beam replacement. Comparing nodes (numbered 3, 5, 9, 16 and 17). Except for the root node
Fig. 6a and b, the damage pattern corresponding to the unsafe state which consists of the full sample space, each node represents a
has 8 beam hinges in higher damage states, 10 in the same damage subspace of observed predictors and classes. Each interior (or deci-
states and 5 in lower damage states. This observation again rein- sion) node is generated when a parent node is split into two child
forces the need to consider the spatial distribution of physical nodes based on the observed values of one of the predictors (p 2 P).
damage (or damage patterns) to assess the severity of system- For example, the split at the root node is based on the damage state
level damage and the corresponding building safety state. The in one of the beam hinges (e.g. damage state 0.5 or >0.5). The
response and damage patterns in Figs. 5 and 6 suggest that only edges from a parent node to two child nodes represent a split
a subset of the EDPs and damage states would significantly affect (sp 2 Sp ), where Sp is the space of all the split points in predictor
the structural safety states. It is important to recall that these p at the parent predictor. The simple tree shown in Fig. 7 has a
results represent patterns obtained for a single damaging record depth (the number of splits) of four and five leaf (terminal) nodes.
and their spatial variation and severity could be very different for Each leaf node provides a predicted class (c 2 C), where C is the
other distinct ground motions. A major advantage of the machine space of all the classes under consideration, given the inputs of
6 Y. Zhang et al. / Structural Safety 72 (2018) 1–16

Fig. 5. Example response patterns of (a) peak and (b) residual SDRs, and peak plastic hinge rotations of exterior (c) beams and (d) columns.

Fig. 6. Example damage patterns at beam hinges for a structure classified as (a) safe
and (b) unsafe.

all the predictors represented by the complete path from the root
to that leaf.
There are three main steps involved in building a CART model.
First, the binary tree is grown following the greedy algorithm,
which always tries to find a predictor, p, and its split point, sp , that
maximizes the purity (or minimizes the impurity) of the resulting
Fig. 7. Example tree structure, where FiBjHk denotes the damage states at Hinge k
subspaces. The Gini Index (GI) [28] is introduced as a measure of of Beam j at Floor i, and SiCjHk denotes the damage states at Hinge k of Column j in
the impurity in Eq. (1). Story i.
Y. Zhang et al. / Structural Safety 72 (2018) 1–16 7

X X X X
GI ¼ n2N c2C
pn qc ð1  qc Þ ¼ n2N
pn ð1  c2C
q2c Þ ð1Þ producing the optimal predictive model. Bagging CART can result
in highly correlated tree structures due to the application of the
where N is the set of all leaf nodes in a given tree; pn is the pro- greedy algorithm, where all of the original predictors are consid-
portion of the dataset falling into a leaf node, n 2 N, and qc denotes ered at every split of every tree [32]. In order to overcome this
the probability of class c at a given node. For the predictors with problem, the Random Forests [33] algorithm was developed with
ordered values such as the response patterns, each split point, sp , a modification to Bagging to reduce such tree correlation. At each
creates a hyperplane perpendicular to the axis of the predictor, p, split point during the growth of the tree, Random Forests only
at location sp , to divide the predictor subspace of the current parent applies the greedy algorithm to a randomly selected proportion
node. For predictors defined by categorical classes, such as the of the original predictors.
damage patterns, each split point, sp , partitions the predictor space
into two sets of classes: one contains only a selected class c and the
other consists of all other classes, ðC  cÞ. Starting with the root 2.4.3. Evaluating the performance of the prediction model
node, all the predictors are assembled and almost all their possible Since the unsafe state is unfavorable and critical, it is defined in
split points are tested by computing the GI for each split. The pre- this paper as the ‘‘positive class” while the safe state as the ‘‘nega-
dictor and split point with the lowest GI is then used to create two tive class”. This section introduces some important performance
child nodes. This process is repeated to recursively partition the measures, which are used for testing and selecting the predictive
dataset into subspaces with boundary surfaces parallel to the pre- models built using different machine learning algorithms and cor-
dictor axes. responding tuning parameters.
The second main step involves establishing an appropriate cri- The sensitivity or true positive rate, is defined as the proportion
terion for stopping the growth of the tree. Examples of stopping of the unsafe (positive) samples in the testing dataset that are also
criteria include a minimum number of samples needed at a given predicted to be unsafe. The specificity of a prediction model, which
node for further split, a maximum tree depth, a limit on the total is also referred to as the true negative rate, is expressed as the pro-
number of leaf nodes in a tree. An alternative stopping criterion portion of known safe (negative) samples that is correctly pre-
that balances the tree size and its associated GI could be achieved dicted. The accuracy, which combines the sensitivity and
by introducing a penalty term. The concept of cost-complexity tun- specificity, is computed as the ratio between the sum of all the true
ing [28] is then adopted, which penalizes the GI. positive and true negative samples and the number of the total
X X samples. The false positive rate represents the proportion of the
GI ¼ n2N
pn ð1  c2C
q2c Þ þ cp jNj ð2Þ safe samples which are recognized as unsafe by the predictive
model, and is equal to one minus the specificity. More importantly,
where cp is the complexity parameter (CP) and jNj is the size of
the false negative rate measures how many of the unsafe samples
N for the given tree. Eq. (2) stops the growth of the tree when the
are misclassified as safe by the predictive model. This is the most
gain from reducing the overall impurity is not large enough to
unfavorable situation as it poses the highest risk to injury or fatal-
compensate for the cost of an excessively complex tree. It should
ity of the building occupants and therefore needs to be carefully
be noted that the optimal value of cp could be very different for dif-
maintained at an acceptably low rate.
ferent datasets. If it is too large, the model will be unable to ade- The receiver operating characteristic (ROC) describes the rela-
quately capture the characteristics of the dataset. On the other tionship between sensitivity and specificity [34,35]. The discrimi-
hand, a cp that is too small can lead to overfitting of the training nation threshold of the ROC, which can take on a numerical
data and have poor performance on the testing data. value between 0 and 1, is used to relate the probability distribution
The third and final step involves pruning the tree to reduce the of the safety state to single state. A sample is classified as unsafe if
possibility of overfitting. This could be achieved by randomly the probability of being positive obtained from the predictive
selecting a hold-out set from the training data at the very begin- model is larger or equal to this threshold; otherwise, the safe clas-
ning, passing it through the generated tree, and testing all splits sification is assigned. A threshold of 0 guarantees the maximum
from the bottom-up, deleting those whose removals would reduce sensitivity because all the samples are predicted to be unsafe and
the overall impurity for the hold-out set. The pruning process con- 1 provides the best specificity because it produces only safe predic-
tinues recursively until there are no further improvements. tions. When the threshold increases, the sensitivity drops while the
specificity grows. An ROC curve is obtained by incrementally vary-
2.4.2. Random forests ing the discrimination threshold from 0 to 1 and plotting its corre-
Although powerful, a CART is very sensitive to the specific data- sponding sensitivity and specificity. The area under the ROC curve
set on which it is trained and is therefore a high-variance model. As is then used as a balanced measurement of the performance of a
a result, the tree and corresponding predictions generated for dif- particular model.
ferent subsets of the same training dataset could be quite different. K-fold cross-validation is a resampling technique that is also
This problem can be addressed by assembling bootstrap-sampled used to reduce possible overfitting and select the optimal tuning
subsets of the training data. Bootstrap is a statistical technique that parameters for a predictive model [30]. The initial sample space
involves randomly sampling from a dataset to create a series of is partitioned into k roughly equally-sized subsets, and a model
sub-datasets [29,30]. Each random sample is placed back into the is trained using all but one held-out subset, which is used to assess
original dataset such that multiple (or no) instances of a particular the model performance [32]. This procedure is repeated k times by
sample can be included in the sub-dataset. Based on these resam- holding out the subsets one at a time and fitting the model to the
pled datasets, bootstrap aggregation, also known as bagging [31], is remaining ðk  1Þ subsets. The overall performance for these k
an ensemble method, which seeks to achieve better predictions by resampled datasets is evaluated for each predictive model with
combining multiple less accurate models. Bagging CART is used to specific tuning parameters, such as the CP for CART, the number
train the predictive models using all these resampled datasets and of total trees and number of randomly selected predictors to
generate aggregated predictions by using the prediction that is choose from at each split for Random Forests. The k-fold cross-
generated by the majority of models. validation procedure could also be repeated using a different set
While bagging addresses some of the limitations of a basic CART of partitioned subsamples to further reduce possible bias from
model, the method still has an inherent defect that prevents it from the partition.
8 Y. Zhang et al. / Structural Safety 72 (2018) 1–16

3. Case study: Post-earthquake structural safety of a modern capacity, the ratio of maximum to yield moment and the energy-
reinforced concrete special moment-frame building based cyclic degradation parameter.
The ground motions used for NRHA are selected based on the
3.1. Building description, ground motion selection, structural modeling conditional mean spectra, which are computed for the building site
and response simulation (118.162° W, 33.996° N) using probabilistic seismic hazard deag-
gregation and the ground motion prediction equations developed
The post-earthquake structural safety of a 4-story perimeter by Campbell and Bozorgnia [44,45]. The approach suggested by
frame building is assessed to demonstrate the proposed frame- Baker and Cornell [46,47] is used to select both the damaging
work. This building was developed by Haselton et al. [36] (identi- and collapsing ground motions, where a mean e of 1.47 for large-
fied with building design ID 1013) as part of a larger study on the magnitude short distance events is used for both sets. A total of
seismic collapse safety of modern code conforming reinforced con- 98 ground motions is included in the damaging set by randomly
crete moment frame buildings. The building was designed accord- sampling one of the horizontal components of each record. This
ing to the provisions of the International Building Code [37], ASCE is done to avoid having ground motions with highly correlated
7-02 [38], and ACI 318 [39], which includes requirements for spectral shapes that would induce similar (and redundant)
strength, stiffness, capacity design, and detailing. The building response patterns. 32 ground motion pairs are selected for the col-
has an 8-in flat slab floor system with a square plan of 120 ft by lapsing set. The collapsing and damaging ground motion spectra
120 ft. A height of 15 ft is used for the first story and 13 ft for all are shown in Fig. 9.
the upper stories. The seismic design is based on the mapped haz- The collapse performance of the intact building is evaluated by
ard for a Los Angeles site with SS ¼ 1:5 g, S1 ¼ 0:6 g, and soil site performing IDAs using the two components in each pair of the col-
class D. lapsing ground motions and taking the lower of the two collapse
A two-dimensional model of the three-bay moment frame is intensities. To obtain the collapse capacity of the damaged build-
built in OpenSees [40] (Fig. 8) using expected gravity loads ings, sequential NRHAs are performed, whereby the building is
(D þ 0:25L). A leaning column is used to account for P-Delta effects subjected to each of the 98 damaging ground motions, followed
resulting from the loads on the gravity system, which is not explic- by an IDA to collapse using the 32 pairs of collapsing records. Like
itly modeled. The leaning column is axially rigid, provides no lat- the intact building, the lower of the two collapse intensities for
eral restraint and the horizontal translational degrees of freedom each collapsing record-pair is used for the damaged buildings.
of the end nodes are constrained to the floor nodes. It is assumed The response and damage patterns are obtained from the damag-
that damage to the slab-column connections of the gravity system ing analysis. Using the ten states of building-level damage
will not result in a vertical collapse of the slab. The foundation described in Section 2.2 for each damaging ground motion pro-
rotation stiffness is calculated from typical grade-beam design duces 980 samples, which include response patterns, damage pat-
and soil stiffness properties. The moment frame columns are fixed terns and j0 s. However, 55 of those samples were excluded
at the base. Rayleigh damping corresponding to 5% of critical because collapse occurred during the damaging analysis. As a
damping in the first and third modes is applied. result, the prediction model was constructed using 935 samples.
The moment frame is defined using elastic beam-column ele-
ments with flexural plastic hinges at the ends. The nonlinear 3.2. Safety-state-based distribution of response demands and damage
behavior of the flexural hinges in the frame beams and columns patterns
is based on the peak oriented hysteretic model [41] and the predic-
tive equations developed by Panagiotakos and Fardis [42] and The probability distributions of the global response patterns
Haselton et al. [43] are used to obtain the backbone parameters, (peak and residual SDRs) for all the 935 samples corresponding
such as the plastic rotation capacity, post-capping deformation to each safety state is shown in Fig. 10. We are particularly

Fig. 8. Schematic illustration of the RC SMF OpenSees model.


Y. Zhang et al. / Structural Safety 72 (2018) 1–16 9

Fig. 9. Response spectra for (a) damaging and (b) collapsing ground motions.

Fig. 10. Histogram of (a) peak and (b) residual SDRs for samples classified as safe and unsafe (the dashed and dot lines show the 90% confidence intervals for the safe and
unsafe states respectively).
10 Y. Zhang et al. / Structural Safety 72 (2018) 1–16

interested in the amount of overlap (or lack thereof) in the distri- unsafe states, having 90% confidence intervals of 1:09  104 to
bution of peak and residual drifts between the safe and unsafe 2:07  104 and 1:34  104 to 2:52  104 respectively. The distri-
states. The 90% confidence interval for peak 1st story drift demands butions are more separated in the top hinge of the 2nd and 3rd
(Fig. 10a) ranges from 0.4% to 2.1% for the safe state and 1.5% to story column, where the rotation demands are concentrated
4.6% for the unsafe state. For peak 2nd story drift demands, the
around 104 for the safe samples and generally range from 103
range is 0.5% to 2.9% for the safe state and 1.6% to 5.0% for the
unsafe state, which indicates that the distribution for the two to 5  102 for the unsafe ones. For the bottom hinges, only those
safety states is quite different. In contrast, the 3rd and 4th stories in the 1st story have distributions that are clearly separated, where
have significant overlap in the distribution of peak story drift ratios a possible split point (recall in Section 2.4) could be located within
for the safe and unsafe samples, for example, the 4th story has 90% the range between 103 and 102 .
confidence intervals of 0.2–1.5% and 0.3–1.6% respectively. A sim- For each of the 935 samples, 100 damage patterns are generated
ilar trend can be observed for the distribution of residual drifts. using Monte Carlo simulation by combining the local response pat-
There is relatively less overlap in the residual drift distributions terns with fragility curves that relate EDPs to the probability of
between the unsafe and safe samples in the first two stories. For exceeding discrete damage states. A single damage pattern is
example, only unsafe samples have residual drift demands greater described by each component being assigned a damage state. Each
than 2.5% in the 1st and 2nd stories. The opposite is generally true damage measure (DM) (or mode of component-level damage) con-
for the 3rd and 4th stories where the residual drift distributions are sists of a pre-defined number of unique damage states [18]. The
very similar for the two safety states. The separation of the global damage states for beam (DM1) and column (DM2) flexural damage
EDP distribution in the first two stories suggests that they are more are described in Table 1. The median EDP and dispersion that
likely to be used as split boundaries that distinguish between safe defines the lognormal fragility function for each damage state is
and unsafe states. In other words, the global EDPs in the first two also shown in Table 1. These fragility parameters are computed
stories are the potential strong predictors for classification. using the methodology outlined in Section 3.8.4 of FEMA P58-1
Fig. 11 shows the histograms of the peak plastic hinge rotations [18] and exercised in Section 6.3 of FEMA P58-2 [19]. The median
at the ends of a typical exterior column. From the figure, it can be rotation demand for the flexural damage states is defined relative
seen that the plastic rotation demand distributions for the column to the (capping) rotation demand at peak strength, hc .
hinges at the top of Stories 1 and 4 are very similar for safe and In Fig. 12, the distribution of different damage states in beam
and column hinges for the safe and unsafe states is compared using

Fig. 11. Histogram of peak plastic hinge rotations at the (a) top and (b) bottom ends of a typical column (the dashed and dot lines show the 90% confidence intervals for the
safe and unsafe states respectively).
Y. Zhang et al. / Structural Safety 72 (2018) 1–16 11

Table 1
Structural component damage measures and median EDP and dispersion associated with each damage state.

Damage Measure ID Damage Measure Damage State ID Damage State Description Median EDP Dispersion
DM1 Beam Flexural Damage DM1-1 Residual crack widths >0.06 in 0.3hc 0.4
DM1-2 Concrete spalling and exposed rebar 0.7hc
DM1-3 Concrete crushing and buckling or fracture of rebar 1.0hc
DM2 Column Flexural Damage DM2-1 Residual crack widths >0.06 in 0.25hc 0.4
DM2-2 Concrete spalling and exposed rebar 0.55hc
DM2-3 Concrete crushing and buckling or fracture of rebar 0.8hc

Fig. 12. Distribution of damage states for safe and unsafe samples at (a) exterior beam end, (b) interior beam end, (c) top column end and (d) bottom column end.

the 93,500 samples with distinct damage patterns. As observed in generally less severe than the exterior ones. Fig. 12(c) shows that,
Fig. 12a and b, beam flexural damage is highest in the two lower for both the safe and unsafe states, observable damage to the top
floors. For the safe state of the exterior beams, 14.0% and 16.0% column hinges only occurs in the 2nd and 3rd story. For the safe
of the samples corresponding to the 2nd and 3rd floors respectively state, DM2-1 occurs in 1.1% and 4.0% of the samples corresponding
are in the lowest damage state (DM1-1). For those at the 3rd and to the 2nd and 3rd stories respectively. Much more damage occurs
4th floors, only 5.2% and 2.9% respectively are in DM1-1. Almost in the samples associated with the unsafe state. 33.4%, 7.0% and
no higher damage states occur among the safe state samples. For 2.8% of top column hinges in the 2nd story and 18.2%, 2.7% and
the unsafe state, 54.7%, 15.2% and 7.8% of exterior beams at the 0.7% in the 3rd story, are in DM2-1, DM2-2 and DM2-3, respec-
2nd floor are in damage states DM1-1, DM1-2 and DM1-3, respec- tively. As indicated in Fig. 12(d), less than 2.5% of the safe samples
tively. The proportions of those damage states in the 3rd floor are for the bottom column hinges in the first two stories are in DM2-1
32.4%, 5.6% and 2.1%, respectively. Similar trends are observed for and there is no observable damage elsewhere. In general, the dis-
the interior beams at each floor, however, the damage was tinct damage patterns between the safe and unsafe states provides
12 Y. Zhang et al. / Structural Safety 72 (2018) 1–16

the basis for building the advanced prediction models using tions can be made with more than 90% probability in the cases
machine learning algorithms. where peak SDR is less than 0.012 or larger than 0.018, but poor
performance with only 59% probability is found when the SDR falls
between those two values. Therefore, the tree keeps growing until
3.3. Machine learning-based safety state prediction models
the reduction of the overall GI is less than the predefined CP.
Finally, a 7-layer tree with 11 leaf nodes is constructed after a total
Predictive models are built using two different datasets: 1) 935
of 9 partitions, which has more than 90% classification accuracy for
classified response patterns that comprise only global EDPs and 2)
approximately 84% of the training subsets, and only 2% of all the
93,500 damage patterns sampled from classified local EDPs. Each
predictions have an accuracy less than 70%.
dataset is partitioned into two subsets: 75% of the samples are
Fig. 14 illustrates the parameter tuning process for the CART
used for training and the remaining 25% are held-out for testing
and Random Forests models trained using the response patterns.
the performance of the proposed models. 10-fold cross-
In Fig. 14a, the area under the ROC curve, sensitivity and specificity
validations are performed using training subsets to find the opti-
are plotted versus CP ranging from 0.5 to 1  106. It can be
mal predictive models with the least amount of overfitting given
observed that the area under the ROC curve and specificity increase
specific tuning parameters. To further reduce the possible bias in
as the CP decreases, while the sensitivity decreases. All three per-
dividing the training data into 10 folders, the cross-validation is
formance measures remain almost unchanged after the CP exceeds
repeated 3 times with random partitions. Basic CART models are
0.001. Since the unsafe state is more critical, sensitivity is more
built as well as advanced Random Forests models.
valued than specificity in this study so the optimal CP is chosen
An example CART model constructed using the training subset
as 0.05. Fig. 14b demonstrates how the number of predictors con-
of the response pattern dataset is presented in Fig. 13. The root
sidered at each split and the number of trees are used to tune the
node has 38% safe samples and 62% unsafe samples, which would
Random Forests model. The former ranges from 2 to 10 with an
result in an unsafe prediction based on the majority vote, if no
increment of 2 and the latter is defined by the set {10, 20, 30, 40,
more splits are created in the model. At this initial step, jNj ¼ 1,
50, 100, 150, 200, 250, 300}. As shown in Fig. 14(b), the area under
pn ¼ 1, and the Gini Index computed using Eq. (1) is 0:47, indicat- ROC curve tends to increase as the number of trees increases. The
ing a very high impurity when compared with the maximum GI of
model that allows 10 predictors for further splits has the lowest
0.5 for a binary dataset. By testing the possible splits associated
area while the models allowing to 2, 4, 6 and 8 predictors have
with the predictors, the algorithm finds the optimal point of peak
comparable ones. Fig. 14(c) and (d) show that the models that
SDR ¼ 0:018 and partitions the original sample space into two sub-
allows more predictors for selecting each split generally have bet-
spaces. 43% of the samples have peak SDR < 0:018 and the remain-
ter performance in terms of the sensitivity, but are less competitive
ing 57% have peak SDR P 0:018. The overall GI after the split
for specificity. By synthesizing the above observations, the optimal
decreases to 0:197, which represents a substantial reduction of
tuning parameters are selected as 6 predictors considered at split
0.274. Next, within each subspace, the algorithm is repeated in
points and 250 trees for bagging.
search of the next optimal split point, which is again peak SDR
Parameter tuning is also performed for the models trained using
but at a different point, 0.012. After this split, the GI is computed
the damage patterns and the results are summarized in Fig. 15. A
as 0:166, which represents a 0.031 decrease. If the model stops
CP value of 1  106 is chosen as optimal as it maximizes the area
here, it would only have two splits and three leaf nodes. Predic-

Fig. 13. Example CART model with CP of 0.01 and trained using response patterns, where PSDRi denotes the peak SDR at story i, PRDR denotes the peak RDR, RSDRj denotes
the residual SDR at story j and RRDR denotes the residual RDR.
Y. Zhang et al. / Structural Safety 72 (2018) 1–16 13

Fig. 14. Parameter tuning for (a) CART and (b)–(d) Random Forests models trained using response patterns.

under the ROC curve and balances the sensitivity and specificity, as gle tree might be able to capture the key characteristic of the
shown in Fig. 15a. Fig. 15(b) shows that the area under ROC curve response patterns. 22,900 testing damage patterns are fed into
increases as the number of trees increases. The model with 10 pre- the final CART model, of which 7219 and 12,646 correct predic-
dictors at each split is the best and is clearly distinguished from the tions are found for the safe and unsafe states respectively. The false
other four. Fig. 15(c) and (d) show that the model with only 1 pre- positive rate is 15.1% and the more critical false negative rate is
dictor at each split has the lowest performance for sensitivity but 12.2%. Considerably better performance is observed for the Ran-
highest performance for specificity, while all the other models dom Forests model. The improvements of 1.46%, 1.85% and 0.8%
are comparable. Based on these observations, the optimal model on accuracy, sensitivity and specificity seem small, but in fact,
allows 5 predictors to be considered at split points and uses 150 there is a 15.2% (267 out of 1754) reduction in the critical false
trees. negative rate compared to the CART model.
Table 2 summarizes the performance measures including sensi- The predictive models based on the response patterns have an
tivity, specificity and accuracy for the final CART and Random For- overall slightly better performance than the damage pattern mod-
ests models using 25% held-out testing datasets. Predictions els. A possible explanation is that additional randomness is intro-
obtained from all the final models are compared with their corre- duced by using component fragility curves and Monte Carlo
sponding reference (real) states. 240 testing samples with response simulation to generate the damage pattern from the EDPs. More-
patterns are employed in the final CART model, where 74 true neg- over, the precision of the predictor information is somewhat
ative and 143 true positive predictions are observed. Only 19 cases reduced when continuous EDPs are converted to discrete damage
are recognized as unsafe (negative) when they are actually safe states. With that being said, it should be noted that exact or real
(positive), while 4 unsafe ones are misclassified as safe. An overall EDP values are assumed in the response pattern models even
accuracy of 90.4% is achieved, and more importantly, the false neg- though the field instruments used to record these EDPs may
ative rate is only 1.7%, giving an acceptably low rate of risky predic- introduce measurement errors, which could potentially lower the
tions. The Random Forests model has similar behavior and only performance. This issue is not addressed in the current study. More
performs slightly better than the CART model, indicating that a sin- importantly, unlike the response patterns which require pre-
14 Y. Zhang et al. / Structural Safety 72 (2018) 1–16

Fig. 15. Parameter tuning for (a) CART and (b)–(d) Random Forests models trained using damage patterns.

Table 2
Performance measures of final models based on testing datasets.

Response Dataset Damage Dataset


CART Random Forests CART Random Forests
Actually Actually Actually Actually Unafe Actually Actually Actually Actually
Safe Unafe Safe (Positive) Safe Unafe Safe Unafe
(Negative) (Positive) (Negative) (Negative) (Positive) (Negative) (Positive)
Predicted Safe (Negative) 74 4 76 4 7219 1754 7287 1487
Predicted Unsafe (Positive) 19 143 17 143 1281 12,646 1213 12,913
Sensitivity 0.97 0.97 0.88 0.90
Specificity 0.80 0.82 0.85 0.86
Accuracy 0.90 0.91 0.87 0.88
Sensitivity 95% Confidence 0.86–0.94 0.87–0.95 0.86–0.87 0.88–0.89
Interval

installed instruments, the damage patterns can be more easily assessments of the residual collapse capacity. Multiple instances
obtained after an earthquake through visual inspection. of a damaged structure are created by subjecting a nonlinear struc-
tural model to a suite of ‘‘damaging” ground motions scaled to
4. Conclusion achieve pre-defined target drift demands. The spatial distribution
of engineering demand parameters from a single damaging analy-
A machine learning framework for assessing post-earthquake sis is described as a response pattern. The term damage pattern is
structural safety is developed by integrating probabilistic seismic used to describe the spatial variation of component-level damage,
demand analysis, component-level damage simulation and robust which is generated through Monte Carlo Simulation, by coupling
Y. Zhang et al. / Structural Safety 72 (2018) 1–16 15

local engineering demand parameters with damage fragility func- References


tions. The collapse capacity of the intact and damaged structures
is assessed by performing incremental dynamic analysis using a [1] SEAOC, Vision. Performance based seismic engineering of
buildings. Sacramento, CA: Structural Engineers Association of California;
suite of ‘‘collapsing” ground motions. For the damaged structure, 2000. p. 1995.
the model is subjected to a single damaging record followed by a [2] ATC, 40. Seismic evaluation and retrofit of concrete buildings. Redwood,
collapsing ground motion whose spectral intensity is incremen- CA: Applied Technology Council; 1996.
[3] FEMA, 273/274. NEHRP guidelines for the seismic rehabilitation of
tally increased in order to cause collapse. The ratio of the median buildings. Washington, DC: Federal Emergency Management Agency; 1996.
collapse capacity of the damaged and intact structure is used as [4] Cornell CA, Krawinkler H. Progress and challenges in seismic performance
the measure of the reduced collapse safety and the criterion for assessment. PEER Center News 2000;3(2):1–4.
[5] Deierlein GG, Krawinkler H, Cornell CA. A framework for performance-based
the post-earthquake safety state (safe or unsafe) of the building.
earthquake engineering. Pacific Conf. Earthq. Eng. 2003;273(140):140–8.
A robust dataset comprising of 935 response patterns, 93,500 dam- [6] Moehle J, Deierlein GG. A framework methodology for performance-based
age patterns and their corresponding safety state is used to define a earthquake engineering. In: Proc. 13th World Conf. Earthq. Eng.; 2004, no. 679,
classification problem. pp. 3812–4.
[7] Porter KA, Kiremidjian AS, LeGrue JS. Assembly-based vulnerability of
Machine learning algorithms including CART and Random For- buildings and its use in performance evaluation. Earthquake Spectra 2001;17
ests are implemented to map the response and damage patterns (2):291–312.
to their classified structural safety states, where the predictor [8] Porter KA. An overview of PEER’s performance-based earthquake engineering
methodology. In: 9th Int. Conf. Appl. Stat. Probab. Civ. Eng.; 2003, vol. 273, no.
space is recursively partitioned to capture the underlying relation- 1995, pp. 973–80.
ship. Previously developed post-earthquake structural evaluation [9] Cimellaro GP, Reinhorn AM, Bruneau M. Framework for analytical
methods have based the safety state criteria on the exceedance quantification of disaster resilience. Eng. Struct. 2010;32(11):3639–49.
[10] Cimellaro GP, Reinhorn AM, Bruneau M. Performance-based metamodel for
of pre-defined response demand levels or damage state ratios healthcare facilities. Earthq. Eng. Struct. Dyn. Sep. 2011;40(11):1197–217.
within individual structural component groups. In contrast to prior [11] Iervolino I, Giorgio M. In: Stochastic modeling of recovery from seismic shocks.
approaches, the outcome of the proposed framework consists of p. 1–9.
[12] Burton HV, Deierlein G, Lallemant D, Lin T. Framework for incorporating
classification trees, each comprising a high-dimension space of probabilistic building performance in the assessment of community seismic
response and damage patterns partitioned into multiple subspaces. resilience. J. Struct. Eng. Aug. 2016;142(8):C4015007.
Given an arbitrarily observed pattern for a damaged structure, each [13] Fereshtehnejad E, Shafieezadeh A. Multiple hazard incidents lifecycle cost
assessment of structural systems considering state-dependent repair times
tree is intelligently searched to find the matching subspace, which
and fragility curves. Earthq. Eng. Struct. Dyn. 2016.
will serve as the basis for classifying the safety state of the struc- [14] ATC. ATC-20 Procedures for Post-earthquake building safety evaluation
ture. The result is a discrete probability distribution of the struc- procedures. Redwood, CA: Applied Technology Council; 1995.
tural safety state, which serves as an indicator of the confidence [15] ATC. ATC-20-2, Addendum to the ATC-20 postearthquake building safety
evaluation procedures, 1995. Redwood, CA.
of the prediction. [16] Robertson I, Kindred T, Lau E. Compilation of observations of the October 15;
The proposed methodology is applied to a 4-story reinforced 2006.
concrete special moment frame building located in a region of high [17] Mitrani-Resier J, Wu S, Beck JL. Virtual Inspector and its application to
immediate pre-event and post-event earthquake loss and safety assessment of
seismicity. The results show that, although different response and buildings. Nat. Hazards Apr. 2016;81(3):1861–78.
damage patterns exist for the safe and unsafe structural states, [18] FEMA. Seismic performance assessment of buildings volume 1 –
they cannot be separated by assigning a certain criterion for any methodology. Washington, DC: Federal Emergency Management Agency;
2012.
individual engineering demand parameter or damage measure. [19] FEMA. Seismic performance assessment of buildings volume 2 –
Such a criterion would result in considerable overlap between implementation guide. Washington, DC: Federal Emergency Management
the two safety states. The predictive models presented in this Agency; 2012.
[20] Vamvatsikos D, Allin Cornell C. Incremental dynamic analysis. Earthq. Eng.
study, which are trained using machine learning algorithms, are Struct. Dyn. 2002;31(3):491–514.
able to provide predictions with high performance in terms of [21] Vamvatsikos D, Cornell CA. The incremental dynamic analysis and its
accuracy, sensitivity and specificity. The prediction accuracy is application to performance-based earthquake engineering. Eur. Conf. Earthq.
Eng. 2002:10.
above 90% for both the CART and Random Forests models when
[22] Dolsek M. Incremental dynamic analysis with consideration of modeling
applied to response patterns with unknown safety states. A mini- uncertainties. Earthq. Eng. Struct. Dyn. 2009;38(6):805–25.
mum accuracy of 86% is achieved for damage patterns with [23] Joe M, Karl T, Mohr D, William H. Test applications of advanced seismic
unknown safety states. Relatively high sensitivity is also observed assessment guidelines. Pacific Earthq. Eng. Res. Cent. 2006.
[24] Maffei J, Telleen K, Nakayama Y. Probability-based seismic assessment of
in the prediction models developed for the case study, which is buildings, considering post-earthquake safety. Earthq. Spectra 2008;24
critical for reducing the rate of high-risk erroneous predictions (3):667–99.
i.e. model predicts safe when building is unsafe. [25] Yeo G.L., Cornell CA. Building tagging criteria based on aftershock PSHA. In:
Proceeding 13WCEE, Vancouver, Canada. Pap.; 2004; vol. 3283, no. 3283.
The proposed framework could be used for rapid probabilistic [26] Raghunandan M, Liel AB, Luco N. Aftershock collapse vulnerability assessment
assessment of whether a damaged building is safe to reoccupy fol- of reinforced concrete frame structures. Earthq. Eng. Struct. Dyn. 2015;44
lowing an earthquake. Additionally, the trees generated by the (3):419–39.
[27] Burton HV, Deierlein GG. Integrating visual damage simulation, virtual
machine learning algorithms could be used to prioritize field inspection, and collapse capacity to evaluate post-earthquake structural
inspections following an earthquake. Moreover, the probabilistic safety of buildings. Earthq. Eng. Struct. Dyn. 2017 (in press).
safety state predictions could be used in community resilience [28] Breiman RA, Friedman L, Stone J, Olshen CJ. Classification and regression
trees. CRC Press; 1984.
evaluations and individual building life-cycle performance assess- [29] Efron B, Tibshirani R. Bootstrap methods for standard errors, confidence
ment and optimization. intervals, and other measures of statistical accuracy. Stat. Sci. 1986:54–75.
[30] R. Kohavi et al., A study of cross-validation and bootstrap for accuracy
estimation and model selection. In Proceedings of the Fourteenth
Acknowledgements International Joint Conference on Artificial Intelligence; 1995; vol. 14, no.
2, pp. 1137–45.
[31] Breiman L. Bagging predictors. Mach. Learn. 1996;24(2):123–40.
The research presented in this paper is supported by the National [32] Kuhn M, Johnson K. Applied predictive modeling. Springer; 2013.
Science Foundation CMMI Research Grant No. 1538866. The author [33] Breiman L. Random forests. Mach. Learn. 2001;45(1):5–32.
[34] Hanley JA, McNeil BJ. The meaning and use of the area under a receiver
is grateful for the high-performance computing resources at operating characteristic (ROC) curve. Radiology Apr. 1982;143(1):29–36.
UCLA’s Hoffman2 Cluster which were utilized for the large [35] Bradley AP. The use of the area under the ROC curve in the evaluation of
numbers of NRHAs in this study. machine learning algorithms. Pattern Recognit. 1997;30(7):1145–59.
16 Y. Zhang et al. / Structural Safety 72 (2018) 1–16

[36] Haselton CB, Deierlein GG. Assessing seismic collapse safety of modern [43] Haselton CB, Liel AB, Lange ST. Beam-column element model calibrated for
reinforced concrete moment-frame buildings. Berkeley, CA: Pacific Earthquake predicting flexural response leading to global collapse of rc frame buildings,
Engineering Research Center; 2008. PEER Report 2007/08. PEER Report 2007/03. Berkeley, CA: Pacific Earthquake Engineering Research
[37] International Code Council Inc. 2015 International building code. Birmingham, Center; 2008.
AL: International Code Council Inc; 2014. [44] Campbell KW, Bozorgnia Y. Campbell-Bozorgnia NGA empirical ground
[38] American Society of Civil Engineers. Minimum design loads for buildings and motion model for the average horizontal component of PGA, PGV, PGD and
other structures. Reston, VA: American Society of Civil Engineers; 2010. SA at selected spectral periods ranging from 0.01–10.0 Seconds (Version 1.1);
[39] ACI Committee 318. Building code requirements for structural concrete (ACI 2006.
318-08), 2008. [45] Campbell KW, Bozorgnia Y. NGA-West2 ground motion model for the average
[40] Mazzoni S, Mckenna F, Scott MH, Fenves GL. Open System for Earthquake horizontal components of PGA, PGV, and 5% damped linear acceleration
Engineering Simulation (OpenSees) OpenSees Command Language Manual, response spectra. Earthq. Spectra 2014;30(3):1087–115.
2007. [46] Baker JW, Cornell CA. Spectral shape, epsilon and record selection. Earthq. Eng.
[41] Ibarra LF, Medina RA, Krawinkler H. Hysteretic models that incorporate Struct. Dyn. 2006;35(9):1077–95.
strength and stiffness deterioration. Earthq. Eng. Struct. Dyn. 2005;34 [47] Baker JW. Conditional mean spectrum: tool for ground-motion selection. J.
(12):1489–511. Struct. Eng. 2010;137(3):322–31.
[42] Panagiotakos TB, Fardis MN. Deformation of reinforced concrete at yielding
and ultimate. ACI Struct. J. 2001;98(2):135–47.

S-ar putea să vă placă și