Documente Academic
Documente Profesional
Documente Cultură
AD-780 971
Baruch Fischhoff, et al
Prepared for:
May 1974
DISTRIBUTED BY:
LKJU
National Technical Information Service
U. S. DEPARTMENT OF COMMERCE
5285 Port Royal Road, Springfield Va. 22151
J
ii nl
'*
"P" ^^
Unclassified
Security Claifi_caUon_ /?/) vro 9 7/
DOCUMENT CONTROL DATA -R&D
fSgcuflty clmtiltlctllon ot llll; body ol tOtltmcl mnd Indtnlnj onnolmllon muH bt nfnd mhn Ihm ovimtt rmporl IM clmstUfdj
I ORIGINATING C T I VI T Y fCoipor! Oul/lOf) 2. RCPOMT tCCURITY CLASSIFICATION
3 REPORT TITL E
< REPORT DATE 7a. TOTAL NO. Of PACES 7b. NO. OF RKFS
May, 1974 8
ia. CONTRACT OR GRANT NO. a. ORIGINATOR'S REPORT NUMVCRdt
This document has been approved for public release and sale, Its distribution
is unlimited.
Judges who had estimated the likelihood of various possible outcomes of President
Nixon's trips to Peking and Moscow were unexpectedly asked to remember, or reconstru t
in the event that they had forgotten, their own predictions some time after the
visits were completed. In addition, they indicated whether or not the., thought that
each event had ii fact occurred, lemembered-reconstructed probabilities were
generally higher than the originally assigned prcbabilities for events believed to
have occurred and lower for those which had not (although the latter effect was
less pronounced). In their original predictions, subjects overestimated low pro-
babilities and underestimated high probabilities, although they were generally
accurate. Judging by their reconstructed-remembered probabilities, however, subject;
seldom perceived having been very surprised by what had or had not happened. These
results are discussed in terms of cognitive "anchoring" and possible detrimental
etfects of outcome-feedback.
'"NATIONALTFCHNICAL
INFORMATION SERVICE
U S DopaMmont of Commerce
SpriiiKfinld VA 22151
i J
J
-stai ' -* ^L. mm
T ^
BS'J^J _..,,.-
uncxassitled
Security Cl>>inctloti
LSNK A LINK C
KBV WONOt
NOLK WT MOLK HOL
Judgment
Prediction
Decision Making
Uncertainty
Information Processing
-i_^Mai^
T$
-1.
ro evaluate the accuracy of their own past predictions in the light of what
judges remember having known about the event at the time of the original pre-
diction or estimated on the basis of the event's post facto likelihood. The
is known about the extent of systematic or random error in remembere-1. and re-
constructed predictions
For some time, we have been studying the differences between predictive
and postdictive (post facto) judgment (Fischhoff, 1974). Some of our results
r-plored in the study reported here. In particular, we have found that events
at I M> ArfBrm
Fischhoff-Beyth 2
judges are called upon to predict the future and to "make sense" out of the
other things, to increase the salience of data and reasons which can be
Kahneman fl973a), imoly that oostdictive probabilities will be higher than the
blue. The fifth ball drawn is blue. Prior to the fifth drawing, the probability
of a blue ball was 50%, following the drawing, that probability is properly
evaluated as greater than 50%, i.e., the postdicted probability is higher than
J
_.i i i>\fc nag ^.i-
I III I
Fischhoff-Beyth 3
their own right. In particular, we have found that judges appear to be generally
outcome knowledge. A further experiment showed that subjects who were provided
with outcome knowledge regarding various events and asked to respond as they
would have "had they not known what happened" responded more like subjects
who knew what had happened than those who did not; i.e., they believed that with-
out outcome knowledge they would have assigned significantly higher probabilities
subjects.
that judges may also tend to remember having assigned higher probabilities
than they actually did to events which they subsequently found to have happened
(and vice versa for events which did not). That is to say, the "remembered
and smaller if it is believed not to have occurred. The present study tests
this hypothesis.
J
*sm JhL
***^mm*i*me
'i ; IIPMWIIIIMI mi
"^
Fischhoff-Beyth
Method
number of events whose outcome would be known within a fixed period of time
(Prediction). Sometime after the time period had elapsed, these same subjects
task at the time of the original prediction. Finally, subjects indicated whether
they thought that each event had or had not occurred on an Information question-
naire which was listributed immediately after the collection of the Prediction
tain what each subject believed had happened. It was a fortuitous inclusion,
as subjects frequently disagreed with one another and with "usually reliable"
press reports. The order of the Prediction Memory and Information questionnaires
was such as to obscure the purpose of the experiment. Reversing their order
of what had and had not occurred. Events used were possible outcomes of
President Nixon's visits to China and the USSR in the first half of 1972.
of the Nep,ev, Beer Sheba, Israel. All responses were collected on questionnaires
HA I A aJU
~
^
Fischhoff-Beyth 5
diction Memory and Information questionnaires were mailed to all subjects who
had completed the Prediction questionnaires but had not been present for the
speaking, English versions of all questionnaires were available for those who
requested them.
II. Predictions relating to the China trip made shortly before the
III. Predictions relating to the USSR trip made shortly before the visit
IV. Predictions relating to the USSR trip long before the visit; recollec-
After questionnaires.
/
The answer sheets of the Prediction and Prediction Memory questionnaires
differed only in the order of the possible outcomes. This was done to prevent
the intrusion of possible incidental memory (e.g., a subject might just happen
to recall what he predicted for the first item of the Prediction questionnaire,
J
W "^
w
Fischhoff-Beyth
Students who had not filled out Before, questionnaires, but happened to be
given had you been asked on the eve of President Nixon's visit to China (the USSR)."
Sixty-four subjects gave such postdictions of the China trip outcomes (Groups VI,
recognition;
successful;
J
- Jan i i i fchfcn i
Flschhoff-Beyth
space program.
Results
outcomes: one before the relevant trip, p , and one after, p; as well as
an answer for the knowledge of outcome question (A, B, C, or D). Thus, for
the hypothesis about the relation between prediction memory and outcome knowledge
(+), contradicted the hypothesis (-), or were irrelevant to the hypothesis (0).
p.. = 100 p2 < 100) or (B; p.. = 0; p > 0) would not have appreciably altered the
J
J - i* I li
T?
Fischhoff-Beyth 9
happened but which had not been publicized). As the response patterns for
Category C were quite similar to those for Category A (events which had happened
and had been publicized), the two categories were combined to obtain more stable
estimates. Category A-C refers, then to all events judged to have happened,
(A-C & B). If the number of +'s was greater than the number of -'s in a
category. Each subject's total number of +'s and -'s were also combined
Table 1.
non-hypothesis-supporting).
Main Effect
for Groups I-V show that, for about two-thirds of the subjects, mis-remembered
J
ma* *At. ik *am riMHdi
m*
- - ^J
Fischhoff-Beyth 10
actually had to events which they believed had happened (A-C) (z = +4.54; sign
test). However, only fifty-seven percent generally reported lower p_ for events
believed not to have happened (B) (z = +1,08; sign test). The difference in
were not perceived to have occurred was significant (z = +2.52). Most of this
difference arose from Groups II and IV (composed of the same subjects responding
of time which elapsed between the estimation and memory tasks. Regarding events
believed to have happened, in Groups II, IV, & V, where three to six months
separated the tasks, some 84% of subjects evidenced the predicted bias; compared
with 67% for Groups I and III, where but two weeks elapsed (z for difference
trend is reversed, owing largely to the negative result with Groups II and IV.
For Groups I and III (short time period) 64% of subjects supported the hypothesis;
reflecting the floor and ceiling imposed by the natural upper and lower limits
generally did not happen, p1 may have been so low that there was little "room"
(given random fluctuations and the slight regresp jn effect noted below) for
p to be consistently lower when the event did not happen (and conversely for
J
-^ *' Kt^rMin. i mm^^^m^m^mt
^
Fischhoff-Beyth 11
likely events). Thus, the less extreme the Initial probability, the more "room"
there is for the anticipated change and the stronger the effect which may be
measure of the size of the effect for individual events, we found a substantial
The size and nature of the effect may be further understood by comparing
possible p values. Figure 1 presents this information for A-C and B events
separately. For A, B and C events combined (not shown), the regression line
fact that the slope is less than one may be interpreted as a mild regression
toward the mean effect to the extent that the p. are measures of the original
events (which tended not to occur), and lower p for likely events (which tended
events separately. The two separate regression lines are highly distinct. For
events perceived to have occurred, p_ tended to be higher than p, for all but
the largest p. values (y = 54 + .37x; r = .80; df = 19; p < .0005). For events
perceived not to have happened, p_ tended to be lower than p1 for all but the
smallest p values (y = 7.0 +.63x; r = .85; df = 19; p < .0005). One summary
and B events "regressed" about highly distinct means. At the other extremes, few
A-C events were perceived to have been very unlikely (x = 0% intercept equal to
54%); few B events were perceived to have been very likely (x = 100% intercept
equal to 70%).
J
TJ
Fischhoff-Beyth 12
All of the subjects considered above (Groups I-V) explicitly stated their
whether this act improved their memories for cue configurations and the inferences
drawn from the p. and consequently reduced the vulnerability of their reconstruc-
from the p responses of those subjects merely asked to reconstruct the predic-
tions which they would have provided had they been asked prior to the trips
(Groups VI-VIII). In the absence of p1 responses for these After Only subjects,
their recmstructed probabilities (p) were compared with the median apriori
probabilities (p..) given by the other (Before and After) subjects, on the
assumption that these probabilities were close to what they would have responded,
had they been asked earlier. This mode of analysis is, of course, somewhat
less sensitive than the strictly vithin-subject analysis reported above. Its
that essentially the same results were derived (see Table 2). Over sixty percent
of these After Only subjects generally supported the hypothesis; about two-thirds
did so for events perceived to have occurred; and somewhat over half for events
perceived not to have happened. Thus, there is little evidence that expressly
Surprisingness
'S:
i -.
ffi!
J
%
v
Fischhoff-Beyth 13
its reconstructed probability and makes it less surprising than it would have
been had the original probability been remembered. The surprisingness of a set
probability which were perceived to have occurred was calculated separately for
the p and p responses of Before and After subjects, and for the p responses
tendency to use round numbers and the very large quantities of data needed to
used: 0-4%, 5-9% 10-19%, 20-29%, 30-39%, 40-49%, 50-59%, 60-69%, 70-79%, 80-89%,
90-99%, and 100%. Roughly equal numbers of the 1921 Before, 1909 Before and
After and 832 After Alone predictions fall into each category.
retrospect (p) than in the light of p.. That is to say, subjects reconstructed-
remembered having been less surprised by the events which did and did not occur
in the course of President Nixon's trip than they really should have been
quite well calibrated, except with regard to unlikely events where they met
too many substantial surprises: ten percent of the events assigned 0% probabi-
J
BPMQt
Fischhoff-Beyth 1A
less than 30% were perceived to have occurred. All groups somewhat underestimated
very likely probabilities (90% lv - 100%), i.e., encountered too many unlikely
occurrences. Thus, although very few events which happened had low reconstruc-
ted probabilities, there were still some events which did not happen with high
Discussion
Why are remembered probabilities biased in the manner shown above? Two
I crude and imprecise one which fails to do justice to the importance of additional
... /
information." (p 16) Given the original creeping determinism results (Fischhoff,
1974), it may be assumed that After judges have a mental set, a "state of mind,"
in which reported outcomes tend to appear more likely than they did before
their occurrence.
by first retrieving his own previous (Before) state of mind, and then reestimating
P1. That is to say, he might ask himself, "considering what I knew then, how
likely did the event seem?" He may, however, find himself so "anchored" in his
present (After) state of mind that his previous state is beyond retrieval, i.e.,
J
,.%'^i,....,.. ^.k.- i -'-
ii
^
Fischhoff-Beyth 15
this underadjusted state of tfind (p ) will tend to lie between what he presently
That is to say, p will tend to be higher than p.. for events reported to have
happened, lower for events reported not to have happened. If for example he
judges the likelihood of events by his ability to build scenarios leading to their
or downward from there, as seems appropriate. However valid the perceived reasons
would lead to the effect studied here. The judge may, for example, find it
difficult to imagine how he could ever have imagined that things could work
out otherwise.
The differential effect with A-C and B events was an unexpected and
of what people did and not of what they failed to do " (p. 26), reports of non-
"events reported not to have happened" and "events not reported which have not
arises from the fact that the Nixon trips were noted more for what did not happen
than what did. Whatever their symbolic and long-range significance, there were
- ' -^ riMMl
^7
laiiiM^M
"^
Fischhoff-Beyth 16
as observers did note may have included many acknowledged "surprises." After
judges may have remembered the surprisingnejs of these non-occurrences and tended
merely erased the tendency for p to be lower than p1, random fluctuations,
along w.^th the slight regression effect, would have produced many instances
fact that none of the outcomes could have happened had the trips been cancelled,
doubt which surrounded the trip may be unavailable and the likelihood of
to which unlikely events are perceived to occur and likely events not to occur.
the judge with perfect knowledge of a set of determinate events, there will be
The more surprising a set of events is perceived to be, the greater the negative
4
feedback and impetus to learn from experience which it presumably provides.
mechanisms. The judge who is insufficiently aware of the surprises the past held
Fischhoff-Beyth 17
for him, and of the need to improve his performance, seems likely to continue
being surprised by what happens in the future. Figure 2 offers the contrast
those of the past. The "inertia effect" reported by Geller and Pitz (1968) is
detrimental to learning.
Consider also a judge who has been raucht unprepared by some turn of
have b(?en relatively likelier before its occurrence than it actually was. He
may conclude that he, more or less, "knew that it was i^oing to happen," but wasn't
ready for it when it did, and that in the future he'll do better. If, for example,
P1 = 30% and p = 50%, he might decide that next time he'll be doubly ready
for any 50% likely eventwhich would leave him unprepared for the occurrence
of a similarly likely (p- = 30%) event. Had he remembered his own prediction,
he might have learned that the data at his disposal is quite indeterminate
stetter (1962) noted in concluding her study of the surprise attack at Pearl
Harbor, "We have to accept the fact of uncertainty and learn to live with it.
No magic, in code or otherwise, will provide certainty. Our plans must work
.A-
TT
18
REFERENCES
Geller, E. S., & Pitz, G. F. Confidence and decision speed in the revision of
Tversky, A., & Kahneman, D. Judgment under uncertainty: heuristics ana biases.
J
_>ai.
tfutoi *^ mm
"'s T$
19
Footnotes
3. Details on the individual events used and subjects' responses to them may
be obtained from the at^.hors.
TABLE 1
for each experimental group and Information response category (A-C, B, overall)
(A-C,B, overall)
J
gjhi j. t ,. ^u. ,
a^m Mta
^
TABLE 2
Note: Each S^ s V responses were compared with the median]* responses derived
from Groups I-V.
J
^i ""i mi I ttmmmmm^tmU^m^i ^
^IP
^r
^
100 >?
()
(D)
80-
c (e) .. o
>
60
00
<
CO
o
Q. 0 o (O)
CC
UJ
\ 40 0 (0)
<
D
UJ
5
- o (o)
3 ..'
I
20 40 60 80 100
BEFORE PROBABILITY (in x)
Regression lines:
Perceived to have happened y= 54*.37x r = .80 (df = l9)
O Perceived not to have happened y= 7 + .63x r = .85 (df = l9)
i Mhi
^
mr^ ff.^C3 "; r-
^
100 -
a
Before, P,
.... After with Before, p2
a
a:
-
r
After a or^e, P2
80
o
li)
>
<
-
-M
.y
X
i/ /
S60
>
"
/rf
iiD40 /
>
fe
at
j^J/
O,
z,
20 -
QC
LU
^^T *
r ,'S
.s /
Q. ."' /
!l^ ' / /
" ^-
5w , 1 1 1 1 1 i i I I
20 40 60 80 100
EVENT PROBABILITY (in %)