"I Knew It Would Happen" - Remembered Probabilities of Once-Future Things

^ ''/ I
AD-780 971
"I KNEW IT WOULD HA PPEN" - - REME MBERE D

PROBABILITIES OF ONCE-FUTURE THINGS
Baruch Fischhoff, et al
Oregon Research Institute
Prepared for:
Office of Naval Research

Advanced Research Projects Agency
May 1974
DISTRIBUTED BY:
LKJU
National Technical Information Service
U. S. DEPARTMENT OF COMMERCE
5285 Port Royal Road, Springfield Va. 22151
J
ii nl
'*
"P" ^^
Unclassified
Security Claifi_caUon_ /?/) vro 9 7/
DOCUMENT CONTROL DATA -R&D
fSgcuflty clmtiltlctllon ot llll; body ol tOtltmcl mnd Indtnlnj onnolmllon muH bt nfnd mhn Ihm ovimtt rmporl IM clmstUfdj
I ORIGINATING C T I VI T Y fCoipor! Oul/lOf) 2. RCPOMT tCCURITY CLASSIFICATION
Oregon Research Institute Unclassified

Eugene, Oregon lh. GROUP
3 REPORT TITL E
"I Knew It Would Happen'^..Remembered Probabilities of Once- Future Things
4 DESCRIPTIVE NOTES (Typ* ot report and Ineluilv dmli)

Technical Report
s AUTHORISI (Flrtl nmm, mlddlm Inlllml, latl nmma)
Baruch Fischhoff and Ruth Beyth
< REPORT DATE 7a. TOTAL NO. Of PACES 7b. NO. OF RKFS
May, 1974 8
ia. CONTRACT OR GRANT NO. a. ORIGINATOR'S REPORT NUMVCRdt
N0001A-73-C-0438 Oregon Research Institute Research Bulleti

6. PROJEC T NO. Vol. U, No. 4.
NR (197-026)
e. ARPA Order No. 24A9 9b. OTH-H REPORT NOISI (Any otfiar nunbara thai may bo aaalgnad
Utl* r iportj
10. DISTRIBUTION STATEMENT
This document has been approved for public release and sale, Its distribution
is unlimited.
II. SUPPLEMENTARY NOTES 12. SPONSORING MILITARY ACTIVITY
Office of Naval Research

Code 455
Arlington, Virginia 22217
13. ABSTRACT
Judges who had estimated the likelihood of various possible outcomes of President
Nixon's trips to Peking and Moscow were unexpectedly asked to remember, or reconstru t
in the event that they had forgotten, their own predictions some time after the
visits were completed. In addition, they indicated whether or not the., thought that
each event had ii fact occurred, lemembered-reconstructed probabilities were
generally higher than the originally assigned prcbabilities for events believed to
have occurred and lower for those which had not (although the latter effect was
less pronounced). In their original predictions, subjects overestimated low pro-
babilities and underestimated high probabilities, although they were generally
accurate. Judging by their reconstructed-remembered probabilities, however, subject;
seldom perceived having been very surprised by what had or had not happened. These
results are discussed in terms of cognitive "anchoring" and possible detrimental
etfects of outcome-feedback.
'"NATIONALTFCHNICAL
INFORMATION SERVICE
U S DopaMmont of Commerce
SpriiiKfinld VA 22151
nf\ fOMtn 4 M TfO 'LAC DO FOMM I4T. t J AN , WHICH I*

LIU MOVS.I**/? OMOLBTB FOI. ARMY Ui.
:i. Security lasslflcatlon
i J
J
-stai ' -* ^L. mm
T ^
BS'J^J _..,,.-
uncxassitled
Security Cl>>inctloti
LSNK A LINK C
KBV WONOt
NOLK WT MOLK HOL
Judgment
Prediction
Decision Making
Uncertainty
Information Processing
\tU Sscuritr CUttlflceUon
-i_^Mai^
T$
-1.
"T KNEW IT WOULD HAPPEN"
REMEMBERED PROBABILITIES OF OKCE-FUTURE THINGS
Baruch Fischhoff Ruth Beyth

The Hebrew University of Jerusalem The Hebrew University of Jerusalem
Most judges engaged in predictive tasks are presumably interested in
improving their own performance. A logical first step in this direction is
ro evaluate the accuracy of their own past predictions in the light of what
has subsequently happened. In order to be evaluated, these predictions must,
of course, either be remembered per se or reconstructed on the basis of what
judges remember having known about the event at the time of the original pre-
diction or estimated on the basis of the event's post facto likelihood. The
effectiveness of the evaluation process depends in part upon the veracity of
these remembered or reconstructed predictions. Little, if anything, however,
is known about the extent of systematic or random error in remembere-1. and re-
constructed predictions
For some time, we have been studying the differences between predictive
and postdictive (post facto) judgment (Fischhoff, 1974). Some of our results
suggest that remembered predictions may be consistently biased in a manner
r-plored in the study reported here. In particular, we have found that events
reported to have happened tend to be assigned higher postdictive than predictive
probabilities; i.e., reporting an event's occurrence increases Its perceived
inevitability. This tendency was named "creeping determinism" as it expresses
a tendency toward determinism which is nonetheless short of that advocated by
at I M> ArfBrm
Fischhoff-Beyth 2
theories of historical inevitability (Berlin, 1954; Carr, 1961).
To summarize briefly the more detailed discussion appearing in Fischhoff
(1974), creeping determinism seems most readily understood through consideration
of the demand characteristics of the retrospective judge's task. Typically,
judges are called upon to predict the future and to "make sense" out of the
past. Attempting to understand why a particular outcome occurred seems, among
other things, to increase the salience of data and reasons which can be
integrated into coherent explanatory patterns. Unintegratable data tend to
be forgotten, deemphasized, or reinterpreted to fit the dominant explanation.
Postdicted probabilities are estimated on the basis of such "updated" sets
of event-descriptive data. Given this mode of outcome knowledge processing,
the judgmental heuristics for probability estimation described by (Tversky &
Kahneman fl973a), imoly that oostdictive probabilities will be higher than the
corresponding predictive estimates.
Although the name creeping determinism has clear pejorative connotations,
in many cases the postdictive probability of events which have happened is
justifiably higher than the corresponding predictive probability. Consider
/ sampling with replacement from an urn containing an unspecified proportion of

r/
red and blue balls. Of the first four balls drawn, two are red and two are
blue. The fifth ball drawn is blue. Prior to the fifth drawing, the probability
of a blue ball was 50%, following the drawing, that probability is properly
evaluated as greater than 50%, i.e., the postdicted probability is higher than
the predicted probability. It is, however, our conviction that in real-life
such retrospective increases frequently constitute little more than facile
reductions in the "surprisingness" of what has happened. Rather than reflecting
some "wisdom of hindsight", they seem to reflect what might be called a
J
_.i i i>\fc nag ^.i-
I III I
Fischhoff-Beyth 3
"knew It all along" attitude.
VeriTication of such suspicions is only possible in the relatively rare
(for real-life judges) instances in which a well-defined model of the data-
generating process is available. A model allows the calculation of predictive
and postdictive probabilities, as well as actual data-diagnosticity. As our
primary interest is the judgment of unique events, we have been studying an
interesting side effect of creeping determinism whose non-normative status is
readily established and whost; consequences are of considerable interest in
their own right. In particular, we have found that judges appear to be generally
incapable of assessing the changes in their judgments induced by possession of
outcome knowledge. A further experiment showed that subjects who were provided
with outcome knowledge regarding various events and asked to respond as they
would have "had they not known what happened" responded more like subjects
who knew what had happened than those who did not; i.e., they believed that with-
out outcome knowledge they would have assigned significantly higher probabilities
to events reported to have happened than did other, truly outcome-ignorant
subjects.
Extrapolating these results of between-subject comparisons, we hypothesize
that judges may also tend to remember having assigned higher probabilities
than they actually did to events which they subsequently found to have happened
(and vice versa for events which did not). That is to say, the "remembered
or reconstructed probability" of an event will tend to be larger than the
probability originally assigned to it if the event is believed to have occurred,
and smaller if it is believed not to have occurred. The present study tests
this hypothesis.
J
*sm JhL
***^mm*i*me
'i ; IIPMWIIIIMI mi
"^

Fischhoff-Beyth
Method
Design: The effect of outcome knowledge on prediction recall-reconstruction
was tested in the following fashion: subjects estimated the probability of a
number of events whose outcome would be known within a fixed period of time
(Prediction). Sometime after the time period had elapsed, these same subjects
were asked to remember or reconstruct their own predictions as accurately as
possible (Prediction Memory). No mention was made of the Prediction Memory
task at the time of the original prediction. Finally, subjects indicated whether
they thought that each event had or had not occurred on an Information question-
naire which was listributed immediately after the collection of the Prediction
Memory questionnaire. The purpose of the Information questionnaire was to ascer-
tain what each subject believed had happened. It was a fortuitous inclusion,
as subjects frequently disagreed with one another and with "usually reliable"
press reports. The order of the Prediction Memory and Information questionnaires
was such as to obscure the purpose of the experiment. Reversing their order
might be expected to heighten the hypothesized effect by increasing the salience
of what had and had not occurred. Events used were possible outcomes of
President Nixon's visits to China and the USSR in the first half of 1972.
Subjects: Participants in the present experiment were students in an
Advanced Methodology class and an Introductory Psychology class at the Hebrew
Liivcsity of Jerusalem, au 1 an Intermediate Statistics class at the University
of the Nep,ev, Beer Sheba, Israel. All responses were collected on questionnaires
distributed in classrooms. Each class was visited twice, once to distribute
the Prediction (Before) Questionnaire, and once to distribute the Prediction
HA I A aJU
~
^
Fischhoff-Beyth 5
Memory and Information (After) Questionnaires. Due to absenteeism, only 53%
of the subjects who completed Prediction questionnaires were present at the
administration of the Prediction Memory and Information questionnaires. Pre-
diction Memory and Information questionnaires were mailed to all subjects who
had completed the Prediction questionnaires but had not been present for the
second questionnaire administration (and who had provided addresses). Their
responses are designated as Group V. Although most subjects were Hebrew-
speaking, English versions of all questionnaires were available for those who
requested them.
The five experimental groups were:
I. Predictions relating to the China trip made shortly before the
visit (2.20.72. N = 53); recollection shortly after (3.5.72. N = 26).
Subjects: Advanced Methodology class.
II. Predictions relating to the China trip made shortly before the
visit (2,20,72; N = 87); recollection long after (6.11.72; N = 41).
Subjects: Introductory Psychology class.
III. Predictions relating to the USSR trip made shortly before the visit
(5.23.72; N = 34); recollection shortly after (6.6.72; N = 26).
Subjects: students in Intermediate Statistics class.
IV. Predictions relating to the USSR trip long before the visit; recollec-
tion shortly after. Subjects and dates: same as II.
V. Predictions relating to th USSR trip long before the visit (2.20.72;
N = 52); recollection long after (approximately 10.15.72 - week of
mailing, N = 23). Subjects in Advanced Methodology and Introductory
Psychology classes not present in class during the administration of
.*. " hhi .^M^MMBOMflfal

Fischhoff-Beyth
After questionnaires.
Instructions: The following is an example of Prediction instructions:
President Nixon is currently on the eve of his visit to

China. The possible outcomes of this visit are still in
doubt. Commentators have offered a number of possible
outcomes, some of which are presented below. We would
like to have you estimate the probability of each of
these eventualities coming to pass. That is to say, we
would like you to give each outcome a probability value
from 0-100%.
0% there is no chance of the outcome happening.
100% the outcome is certain to happen.
These instructions were appropriately adapted for each group.
The following is an example of Prediction Memory instructions:
As you remember, about two weeks ago, on the eve of

President Nixon's trip to China, you completed a
questionnaire by providing probabilities for the
occurrence of a number of poasible outcomes of the
trip. We are presently interested in the relation
between the quality of peopls's predictions and their
ability to remember their predictions. For this reason,
we would like to have you fill out once again the same
questionnaire which you completed two weeks ago, giving
the same probabilities which you gave then. If you
cannot remember the probability which you then assigned,
give the probability which you would have given to each
of the various outcomes on the eve of President Nixon's
trip to China.
/
The answer sheets of the Prediction and Prediction Memory questionnaires
differed only in the order of the possible outcomes. This was done to prevent
the intrusion of possible incidental memory (e.g., a subject might just happen
to recall what he predicted for the first item of the Prediction questionnaire,
or recall that his last three estimates had been 0%).
Instructions for the Information questionnaires read:
One of our hypotheses is that memory for probability

judgments is influenced by information relating to what
'
J
W "^
w
Fischhoff-Beyth
actually did or did not happen. The enclosed question-

naire is identical to that which you just completed,
except that this time we would like you to indicate
whether or not each event occurred. Beside each possible
outcome you will find the four following possibilities:
A. I believe that the event occurred and was publicized.

B. I believe that the event did not occur.
C. I believe that the event occurred and was not publicized.
D. I don't know.
For each possible outcome of the visit, please circle

that possibility which best suits the information at
your disposal. Circle only one possibility. This is
not a test of your political knowledge, and consultation
with your neighbor is only liable to distort the results.
Students who had not filled out Before, questionnaires, but happened to be
present at the administration of the After questionnaires, were asked to produce
reconstructed probabilities, giving "the probabilities which you would have
given had you been asked on the eve of President Nixon's visit to China (the USSR)."
Sixty-four subjects gave such postdictions of the China trip outcomes (Groups VI,
VII), twenty-seven for the USSR trip outcomes (Group VII).
Outcomes: Fifteen possible outcomes of each trip were presented. They
were chosen so as to: 1) cover most areas of potential activity (especially
those of interest to our subjects); and 2) elicit a wide range of probability

3
values. Representative events are:
China: 1) The U.S.A. will establish a permanent diplo-
matic mission in Peking, but not grant diplomatic
recognition;
2) President Nixon will meet Mao at least once;
3) President Nixon will announce that his trip was
successful;
J
- Jan i i i fchfcn i
Flschhoff-Beyth
USSR: 1) A group of Soviet Jews will be arrested
attempting to speak with President Nixon;
2) The U.S.A. and the USSR will agree to a joint
space program.
Results
Each subject produced two probabilities for each of fifteen possible
outcomes: one before the relevant trip, p , and one after, p; as well as
an answer for the knowledge of outcome question (A, B, C, or D). Thus, for
each outcome, it could be determined whether each subject's responses supported
the hypothesis about the relation between prediction memory and outcome knowledge
(+), contradicted the hypothesis (-), or were irrelevant to the hypothesis (0).
Response sets were evaluated:
+ if p < p and the subject reported A or C (event happened)
if p1 > p and the subject reported B (event did not happen)
- if p. < p_ and the subject reported B (event did not happen)
if p1 > p and the subject reported A or C (event happened)
0 if p, = p and the subject reported A, B, C or D
if the subject reported D or any p. and p
if p, = 100% and the subject reported A or C (the natural ceiling
of the probability measure make 't impossible for p_ to be higher
than p1 in accordance with the hypothesis).
if p. = 0 and the subject reported B (corresponding floor effect).
Inclusion of the relatively few instances in which subjects reported (A or C;
p.. = 100 p2 < 100) or (B; p.. = 0; p > 0) would not have appreciably altered the
J
J - i* I li
T?
Fischhoff-Beyth 9
results presented below and appears to be an unduly conservative policy.
Only 8.3% of all Information responses fell in Category C, (events which
happened but which had not been publicized). As the response patterns for
Category C were quite similar to those for Category A (events which had happened
and had been publicized), the two categories were combined to obtain more stable
estimates. Category A-C refers, then to all events judged to have happened,
whether publicized or not.
For each judge, the numbers of hypothesis-consistent (+) and hypothesis-
inconsistent (-) responses were counted within each Information category
(A-C & B). If the number of +'s was greater than the number of -'s in a
category, the subject was considered by be "hypothesis-supporting" for that

A
category. Each subject's total number of +'s and -'s were also combined
across all three categories. A, B and C, to produce an overall evaluation of
whether the subject was "hypothesis supporting." The number of hypothesis-
supporting and non-supporting subjects in Experimental Groups I-V appear in
Table 1.
INSERT TABLE 1 ABOUT HERE
The number of hypothesis-supporting and non-supporting subjects were compared
by a sign test and the result translated to a normal variate to facilitate
comparison (a negative sign indicates that the majority of subjects were
non-hypothesis-supporting).
Main Effect
In general, the results presented in Table 1 provide support for the
notion that receipt, of outcome knowledge may be associated with systematic
biases in prediction recollection and reconstruction. The combinrd totals
for Groups I-V show that, for about two-thirds of the subjects, mis-remembered
and mis-reconstructed probabilities generally erred in the anticipated direction
J
ma* *At. ik *am riMHdi
m*
- - ^J
Fischhoff-Beyth 10
(z = +3.12; sign test). About three-fourths of these subjects
remembered or reconstructed having assigned higher probabilities than they
actually had to events which they believed had happened (A-C) (z = +4.54; sign
test). However, only fifty-seven percent generally reported lower p_ for events
believed not to have happened (B) (z = +1,08; sign test). The difference in
the proportions of hypothesis-supporting subjects for events which were and
were not perceived to have occurred was significant (z = +2.52). Most of this
difference arose from Groups II and IV (composed of the same subjects responding
to different stimuli) where sixty percent of subjects tended to reconstruct-
remember higher probabilities for events perceived not to have happened.
Several determinants of effect-size may be ascertained. One is the period
of time which elapsed between the estimation and memory tasks. Regarding events
believed to have happened, in Groups II, IV, & V, where three to six months
separated the tasks, some 84% of subjects evidenced the predicted bias; compared
with 67% for Groups I and III, where but two weeks elapsed (z for difference
in proportions = +1.945). For events perceived not to have occurred, this
trend is reversed, owing largely to the negative result with Groups II and IV.
For Groups I and III (short time period) 64% of subjects supported the hypothesis;
for Groups II, IV and V, 51% (z for difference = +1.286).
Another determinant is the apriori likelihood (p..) of events, evidently
reflecting the floor and ceiling imposed by the natural upper and lower limits
of the probability measure. These limits might be expected to attenuate the
present effect in a fairly straightforward fashion. For unlikely events, which
generally did not happen, p1 may have been so low that there was little "room"
(given random fluctuations and the slight regresp jn effect noted below) for
p to be consistently lower when the event did not happen (and conversely for
J
-^ *' Kt^rMin. i mm^^^m^m^mt
^
Fischhoff-Beyth 11
likely events). Thus, the less extreme the Initial probability, the more "room"
there is for the anticipated change and the stronger the effect which may be
expected. Considering the number of hypothesis-supporting responses as a
measure of the size of the effect for individual events, we found a substantial
quadratic (inverted-U) relationship between effect size and median p.. of
individual events (F (2.60) = 9.304, p < .0005).
The size and nature of the effect may be further understood by comparing
typical (median) probabilities remembered and reconstructed for the various
possible p values. Figure 1 presents this information for A-C and B events
separately. For A, B and C events combined (not shown), the regression line
of median p values on p1 is y = 9.4 + .87X (r = .99; df = 19; p < .0005). The
fact that the slope is less than one may be interpreted as a mild regression
toward the mean effect to the extent that the p. are measures of the original
probability estimate, rather than response memories. A regression effect would
mitigate against the research hypothesis, resulting in higher p- for unlikely
events (which tended not to occur), and lower p for likely events (which tended
to occur). Quite a different picture is obtained by considering A-C and B
events separately. The two separate regression lines are highly distinct. For
events perceived to have occurred, p_ tended to be higher than p, for all but
the largest p. values (y = 54 + .37x; r = .80; df = 19; p < .0005). For events
perceived not to have happened, p_ tended to be lower than p1 for all but the
smallest p values (y = 7.0 +.63x; r = .85; df = 19; p < .0005). One summary
description is that remembered-reconstructed probability estimates for A-C
and B events "regressed" about highly distinct means. At the other extremes, few
A-C events were perceived to have been very unlikely (x = 0% intercept equal to
54%); few B events were perceived to have been very likely (x = 100% intercept
equal to 70%).
J
TJ
Fischhoff-Beyth 12
After Only Subjects
All of the subjects considered above (Groups I-V) explicitly stated their
predictions regarding the various trip outcomes (p..). It might be wondered
whether this act improved their memories for cue configurations and the inferences
drawn from the p. and consequently reduced the vulnerability of their reconstruc-
tions to systematic biases. A partial answer to this question may be derived
from the p responses of those subjects merely asked to reconstruct the predic-
tions which they would have provided had they been asked prior to the trips
(Groups VI-VIII). In the absence of p1 responses for these After Only subjects,
their recmstructed probabilities (p) were compared with the median apriori
probabilities (p..) given by the other (Before and After) subjects, on the
assumption that these probabilities were close to what they would have responded,
had they been asked earlier. This mode of analysis is, of course, somewhat
less sensitive than the strictly vithin-subject analysis reported above. Its
power is also reduced by smaller sample size. Nevertheless, It is instructive
that essentially the same results were derived (see Table 2). Over sixty percent
of these After Only subjects generally supported the hypothesis; about two-thirds
did so for events perceived to have occurred; and somewhat over half for events
perceived not to have happened. Thus, there is little evidence that expressly
stating predictions reduces the vulnerability of reconstructions to systematic
biases of the type under consideration here.
INSERT TABLE 2 ABOUT HERE
Surprisingness
If a "surprise" is defined as the occurrence of an unlikely event or the
non-occurrence of a likely event, one result of the bias considered here is to
'S:
i -.
ffi!
J
%
v
Fischhoff-Beyth 13
reduce the "surprisingnepo" of the past: the occurrence of an event increases
its reconstructed probability and makes it less surprising than it would have
been had the original probability been remembered. The surprisingness of a set
of events in the light of predictions may be ascertained by evaluating the per-
centage of events assigned various probabilities perceived to occur. For a per-
fectly calibrated set of judgments, X% of those events assigned X% probability
of occurrence would actually occur. The percentage of events assigned X%
probability which were perceived to have occurred was calculated separately for
the p and p responses of Before and After subjects, and for the p responses
of After Only subjects. These results appear in Figure 2. Due to subjects'
tendency to use round numbers and the very large quantities of data needed to
obtain stable occurrence rate estimates only 12 probability categories were
used: 0-4%, 5-9% 10-19%, 20-29%, 30-39%, 40-49%, 50-59%, 60-69%, 70-79%, 80-89%,
90-99%, and 100%. Roughly equal numbers of the 1921 Before, 1909 Before and
After and 832 After Alone predictions fall into each category.
INSERT FIGURE 2 ABOUT HERE
A considerably smaller proportion of unlikely events (p < 30%) and a
somewhat larger proportion of likely events were perceived to have occurred in
retrospect (p) than in the light of p.. That is to say, subjects reconstructed-
remembered having been less surprised by the events which did and did not occur
in the course of President Nixon's trip than they really should have been
(judging by their own predictions). The original predictions were, in general,
quite well calibrated, except with regard to unlikely events where they met
too many substantial surprises: ten percent of the events assigned 0% probabi-
lity of occurrence were perceived to have occurred, as well as 16% of those
J
BPMQt

Fischhoff-Beyth 1A
assigned 5% probability. lu contrast, in the light of p , there were too few
big surprises. Very few events with remembered-reconstructed probabilities
less than 30% were perceived to have occurred. All groups somewhat underestimated
very likely probabilities (90% lv - 100%), i.e., encountered too many unlikely
occurrences. Thus, although very few events which happened had low reconstruc-
ted probabilities, there were still some events which did not happen with high
reconstructed probabilities. This is consistent with the differential effect
obtained with events which did and did not happen.
Discussion
Why are remembered probabilities biased in the manner shown above? Two
explanations, each applicable to a different tactic which subjects might adopt
in retrieving p , seem particularly attractive. Both reflect the notion of
judgmental "anchoring and adjustment". As described in Slovic (1972), "In this
process, a natural starting point is used as a first approximation to the
judgment, an anchor so to speak. This anchor is then adjusted to accomodate
the implications of additional information. Typically, the adjustment is a
I crude and imprecise one which fails to do justice to the importance of additional
... /
information." (p 16) Given the original creeping determinism results (Fischhoff,
1974), it may be assumed that After judges have a mental set, a "state of mind,"
in which reported outcomes tend to appear more likely than they did before
their occurrence.
The prediction memory judge intent upon retrieving p1 may try to do so
by first retrieving his own previous (Before) state of mind, and then reestimating
P1. That is to say, he might ask himself, "considering what I knew then, how
likely did the event seem?" He may, however, find himself so "anchored" in his
present (After) state of mind that his previous state is beyond retrieval, i.e.,
his adjustment is inadequate. The probability value which he produces from
J
,.%'î,....,.. ^.k.- i -'-
ii
^
Fischhoff-Beyth 15
this underadjusted state of tfind (p ) will tend to lie between what he presently
believes (his postdicted probability) and what he originally believed (P,).
That is to say, p will tend to be higher than p.. for events reported to have
happened, lower for events reported not to have happened. If for example he
judges the likelihood of events by his ability to build scenarios leading to their
occurrence (Tversky & Kahneman, 1973b), he may find scenarios of occurrence
more available in reconstruction.
An alternative, although related, approach to retrieving p1 is to take
the nostdictive probability of the outcome as an anchor and to adjust upward
or downward from there, as seems appropriate. However valid the perceived reasons
for adjustment, the combination of creeping determinism and underadjustment
would lead to the effect studied here. The judge may, for example, find it
difficult to imagine how he could ever have imagined that things could work
out otherwise.
The differential effect with A-C and B events was an unexpected and
interesting result meriting further attention. One possible explanation is
that reports of non-occurrence tend to have a smaller and/or more readily
/^ ignored (eliminated) impact on the judge's "state of mind" than reports of
occurrence. If as E. H. Carr (1961) claims, "history is by and large a record
of what people did and not of what they failed to do " (p. 26), reports of non-
occurrence may tend not even to be noticed. Possibly, distinguishing between
"events reported not to have happened" and "events not reported which have not
happened", as we distinguished between A and C events, would sharpen the analysis.
i- A supplementary explanation relevant to these particular stimulus materials
arises from the fact that the Nixon trips were noted more for what did not happen
than what did. Whatever their symbolic and long-range significance, there were
fewer substantive results than many observers anticipated. Such non-occurrences
- ' -^ riMMl
^7
laiiiM^M
"^
Fischhoff-Beyth 16
as observers did note may have included many acknowledged "surprises." After
judges may have remembered the surprisingnejs of these non-occurrences and tended
to reconstruct p. higher than p.; actually, even if remembering surprises
merely erased the tendency for p to be lower than p1, random fluctuations,
along w.^th the slight regression effect, would have produced many instances
of p higher than p,. An additional situation-specific consideration is the
fact that none of the outcomes could have happened had the trips been cancelled,
a real possibility at the time of p estimation. In retrospect, however, the
doubt which surrounded the trip may be unavailable and the likelihood of
contingent outcomes enhanced.
The "surprisingness" of a set of events might be defined as the extent
to which unlikely events are perceived to occur and likely events not to occur.
For a judge evaluating his own performance, the surprisingness of a set of
events is an indicator of his degree of understanding of those events. For
the judge with perfect knowledge of a set of determinate events, there will be
no "surprises," as he assigns 100% and 0% to A-C and B events, respectively.
The more surprising a set of events is perceived to be, the greater the negative
4
feedback and impetus to learn from experience which it presumably provides.
In this light, the above results reflect a retrospective reduction in the
surprisingness of the events judged, a reduction which also constitutes a
tendency to convert negative feedback to positive. Although a causal link has
not been established it seems reasonable to speculate that once distorted in
memory, knowledge of unexpected outcomes may actually encourage ineffective
predicting instead of compelling the judge to improve his prediction-producing
mechanisms. The judge who is insufficiently aware of the surprises the past held
- - '*'-" .*. - - M>

"^
Fischhoff-Beyth 17
for him, and of the need to improve his performance, seems likely to continue
being surprised by what happens in the future. Figure 2 offers the contrast
of a relatively surprise-free past (p9) with a relatively surprise-full future
(p ) although, of course, here judgments of the future temporally preceded
those of the past. The "inertia effect" reported by Geller and Pitz (1968) is
one case in which judges' conversion of negative feedback to positive is
detrimental to learning.
Consider also a judge who has been raucht unprepared by some turn of
events. Looking backward, he "remembers" that what happened seemed to him to
have b(?en relatively likelier before its occurrence than it actually was. He
may conclude that he, more or less, "knew that it was iôing to happen," but wasn't
ready for it when it did, and that in the future he'll do better. If, for example,
P1 = 30% and p = 50%, he might decide that next time he'll be doubly ready
for any 50% likely eventwhich would leave him unprepared for the occurrence
of a similarly likely (p- = 30%) event. Had he remembered his own prediction,
he might have learned that the data at his disposal is quite indeterminate
and that he should be ready for a substantial number of surprises. As Wohl-
stetter (1962) noted in concluding her study of the surprise attack at Pearl
Harbor, "We have to accept the fact of uncertainty and learn to live with it.
No magic, in code or otherwise, will provide certainty. Our plans must work
without it" (p. 401).
.A-
TT
18
REFERENCES
Berlin, I. Historical Inevitability. Oxford University Press, London, 195A.
Carr, E. H. What is History? Macmillan, London, 1961.
Fischhoff, B. Some Aspects of PostJictive Judgment. Unpublished doctoral
dissertation, 1974, Hebrew University of Jerusalem, Jerusalem.
Geller, E. S., & Pitz, G. F. Confidence and decision speed in the revision of
opinion. Organizational Behavior and Human Performance, 1968, 3^, 190-201.
Slovic, P. From Shakespeare to Simon: Speculationsand some evidenceabout
man's ability to process information. Oregon Research Institute Research
Monograph, 1972, 12, No. 12.
Tversky, A., & Kahneman, D. Judgment under uncertainty: heuristics ana biases.
Oregon Research Institute Research Bulletin, 1973a, 13_, No. 1.
Tversky, A., & Kahneman, D. Availability: a heuristic for judging frequency
and probability. Cognitive Psychology, 1973b, 5^, 207-232.
Wohlstetter, R. Pearl Harbor: Warning and Decision. Stanford University
Press, Stanford, 1962.
J
_>ai.
tfutoi *^ mm
"'s T$
19
Footnotes
1. This research was supported by the Advanced Research Projects Agency

of the Department of Defense (ARPA Order No. 2A49), and was monitored by ONR
under Contract No. N00014-73-C-0438 (NR 197-026).
2. We are indebted to Professor D. Kahneman, Professor A. Tversky and

Professor P. Slovic for comments on a previous '/ersion of this article.
3. Details on the individual events used and subjects' responses to them may
be obtained from the at^.hors.
A. An implicit assumption throughout this discussion is that, to be effective,

learning from experience must be at least partially conscious. A case might
he made that what is important for learning in the present context is that
postdictive probabilities be in order, and not that predictive probabilities
be remembered and the reasons for the prediction-postdiction difference recog-
nized. Considering the evidence available, we believe both that this is usually
not the case and that postdictive probabilities are generally not in order,
for reasons mentioned in the text and in Fischhoff (1974).
^
TABLE 1
Number of generally hypthesis-supporting (+) and non-supporting (-) subjects,
for each experimental group and Information response category (A-C, B, overall)
(A-C,B, overall)
Group I (N = 29) Group II (N = 41)

China: shortly before, shortly after China; shortly before, long after
A-C B overall A-C B overall
+ 17 15 17 + 30 15 26
- 7 8 11 7 22 14
z + 1.84 + 1.25 + .99 z +3.62 -.99 + 1.74
%+ 70.8 65.2 60.7 %+ 81.1 40.5- 65.0
Group III (N = 26) Group IV (N = 41.)

USSR; shortly before, shortly after USSR: long b eforc. shortly after
+ 14 13 15 + 30 15 19
- 8 8 8 5 24 13
z + 1.06 + .87 + 1.25 z +4.04 -1.28 + .88
%+ 63.6 61.9 65.2 %+ 85.7 38.5 59.4
Group V (N = 23) Groups I,II,III,V combined (N = 116)

USSR: long before, long after Groups 1,111,IV,V combined (N = 116: in
A-C B overall parentheses)
A-C B overall
+ 17 15 18 + 74(74) 58(58) 76(69)
- 3 6 4 - 26(24) 44(46) 37(36)
z + 3.10 + 1.76 + 2.88 z +4.54(+4.72) +1.29C+1.08) +3.58(+3.12)

%+ 85.0 71.4 83.6 %+ 74.0(75.5) 56.9(55.8) 67.2(65.7)
As Groups II and IV crnsist of the same subjects responding to different

questions, they were not combined.
J
gjhi j. t ,. û. ,
a^m Mta
^
TABLE 2
Number of Hypothesis-Supporting and Non-Supporting Subjects for Groups
Who Only Responded After the Events.
Group VI (N 27) Group VII (N = 27)

USSR: no before, shortly after China: no before, long after
+ 20 10 16 + 16 13 14
- . 4 14 8 - 5 13 12
z +3.10 -.38 + 1.61 z + 2.56 .00 + .19
%+ 83.3 41.7 66.7 %+ 76.2 50.0 53.8
Group VIII (N = 37) Groups VI, VIII combined (N = 64)

China: no before, long after Groups VII, VIII combined (N = 64 ; in
,1
A-C B overall parentheses)'
A-C B- overall
+ 18 22 24 + 38(34) 32(35) 40(38)
13 12 13 17(18) 2^(25) 21(25)
z +.71 + 1.54 + 1.64 z +2.69(+1.90) +.92(+1.16) +2.56(+1.51)
%+ 58.1 64.7 64.9 %+ 69.1(65.4) 55.2(58.3) 65.6(60.3)
Note: Each S^ s V responses were compared with the median]* responses derived
from Groups I-V.
As Groups VI and VII consist of the same subjects responding to different

questions, they were not combined.
J
î ""i mi I ttmmmmm^tmU^mî ^
ÎP
^r
^
100 >?
()
(D)
80-
c (e) .. o
>
60
00
<
CO
o
Q. 0 o (O)
CC
UJ
\ 40 0 (0)
<
D
UJ
5
20- oo .o' (O)
- o (o)
3 ..'
I
20 40 60 80 100
BEFORE PROBABILITY (in x)
Regression lines:
Perceived to have happened y= 54*.37x r = .80 (df = l9)
O Perceived not to have happened y= 7 + .63x r = .85 (df = l9)
figure 1. Median After probabilities of events assigned Before probability of X%;

presented separately for events perceived to have happened (darkened circles) and
perceived not to have happened (open circles). Parentheses indicate median deter-
mined by five or fewer judgments.
i Mhi
^
mr^ ff.^C3 "; r-
^
100 -
a
Before, P,
.... After with Before, p2
a
a:
-

r
After a orê, P2
80
o
li)
>
<
-
-M
.y
X
i/ /
S60
>
"
/rf
iiD40 /
>
fe
at
j^J/
O,
z,
20 -
QC
LU
^^T *
r ,'S
.s /
Q. ."' /
!l^ ' / /
" ^-
5w , 1 1 1 1 1 i i I I
20 40 60 80 100
EVENT PROBABILITY (in %)
Figure 2. Percentage of events perceived to have occurred of those assigned X%

probability of occurrence.
â aa _ t_J hfc MMMMll

"I Knew It Would Happen" - Remembered Probabilities of Once-Future Things

Încărcat de

Informații document

Titlu original

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

"I Knew It Would Happen" - Remembered Probabilities of Once-Future Things

Încărcat de

Drepturi de autor:

Formate disponibile

^ ''/ I

"I KNEW IT WOULD HA PPEN" - - REME MBERE D

Oregon Research Institute

Office of Naval Research

Oregon Research Institute Unclassified

"I Knew It Would Happen'^..Remembered Probabilities of Once- Future Things

4 DESCRIPTIVE NOTES (Typ* ot report and Ineluilv dmli)

Baruch Fischhoff and Ruth Beyth

N0001A-73-C-0438 Oregon Research Institute Research Bulleti

10. DISTRIBUTION STATEMENT

II. SUPPLEMENTARY NOTES 12. SPONSORING MILITARY ACTIVITY

Office of Naval Research

nf\ fOMtn 4 M TfO 'LAC DO FOMM I4T. t J AN , WHICH I*

:i. Security lasslflcatlon

\tU Sscuritr CUttlflceUon

"T KNEW IT WOULD HAPPEN"

REMEMBERED PROBABILITIES OF OKCE-FUTURE THINGS

Baruch Fischhoff Ruth Beyth

Most judges engaged in predictive tasks are presumably interested in

improving their own performance. A logical first step in this direction is

has subsequently happened. In order to be evaluated, these predictions must,

of course, either be remembered per se or reconstructed on the basis of what

effectiveness of the evaluation process depends in part upon the veracity of

these remembered or reconstructed predictions. Little, if anything, however,

suggest that remembered predictions may be consistently biased in a manner

reported to have happened tend to be assigned higher postdictive than predictive

probabilities; i.e., reporting an event's occurrence increases Its perceived

inevitability. This tendency was named "creeping determinism" as it expresses

a tendency toward determinism which is nonetheless short of that advocated by

theories of historical inevitability (Berlin, 1954; Carr, 1961).

To summarize briefly the more detailed discussion appearing in Fischhoff

(1974), creeping determinism seems most readily understood through consideration

of the demand characteristics of the retrospective judge's task. Typically,

past. Attempting to understand why a particular outcome occurred seems, among

integrated into coherent explanatory patterns. Unintegratable data tend to

be forgotten, deemphasized, or reinterpreted to fit the dominant explanation.

Postdicted probabilities are estimated on the basis of such "updated" sets

of event-descriptive data. Given this mode of outcome knowledge processing,

the judgmental heuristics for probability estimation described by (Tversky &

corresponding predictive estimates.

Although the name creeping determinism has clear pejorative connotations,

in many cases the postdictive probability of events which have happened is

justifiably higher than the corresponding predictive probability. Consider

/ sampling with replacement from an urn containing an unspecified proportion of

the predicted probability. It is, however, our conviction that in real-life

such retrospective increases frequently constitute little more than facile

reductions in the "surprisingness" of what has happened. Rather than reflecting

some "wisdom of hindsight", they seem to reflect what might be called a

"knew It all along" attitude.

VeriTication of such suspicions is only possible in the relatively rare

(for real-life judges) instances in which a well-defined model of the data-

generating process is available. A model allows the calculation of predictive

and postdictive probabilities, as well as actual data-diagnosticity. As our

primary interest is the judgment of unique events, we have been studying an

interesting side effect of creeping determinism whose non-normative status is

readily established and whost; consequences are of considerable interest in

incapable of assessing the changes in their judgments induced by possession of

to events reported to have happened than did other, truly outcome-ignorant

Extrapolating these results of between-subject comparisons, we hypothesize

or reconstructed probability" of an event will tend to be larger than the

probability originally assigned to it if the event is believed to have occurred,

Design: The effect of outcome knowledge on prediction recall-reconstruction

was tested in the following fashion: subjects estimated the probability of a