Documente Academic
Documente Profesional
Documente Cultură
x
.....................................................................................................................................................................................................................
Summary
We previously defined surgical list ‘efficiency’ as: maximising theatre utilisation, minimising over-
running, and minimising cancellations. ‘Efficiency’ maximises output for input; ‘productivity’
emphasises total output. We define six criteria that any measure of productivity (better termed
‘quantitative performance’) needs to satisfy. We then present a theoretical analysis that fulfils these
by incorporating: ‘speed’ of surgery (with reference to average speeds), ‘patient contact’ (synon-
ymous with minimising gaps between cases), and ‘efficiency’ (as previously defined). ‘Speed’ and
‘patient contact’ together constitute a ‘productive potential’. Our formula satisfies the pre-set
criteria and yields plausible results in both hypothetical and real data sets, To be productive in these
quantitative terms, teams in any specialty need to achieve minimum quality standards defined by
their sub-specialty; to plan their lists to utilise the time available with no cancellations or over-runs
and to work at least as fast as average with minimal gaps between cases. ‘Productive potential’
combined with ‘efficiency’ yielding ‘actual productivity’ in our theoretical analysis more compl-
etely describes quantitative surgical list performance than any other single measure.
. ......................................................................................................
Correspondence to: Dr Jaideep J. Pandit
E-mail: jaideep.pandit@dpag.ox.ac.uk
Accepted: 26 November 2008
With nearly seven million hospital operations performed of performance [5]. However it would seem challeng-
each year in England and Wales and an annual NHS ing to develop a measure encompassing all surgical
budget of > £1 billion, operating theatres are a significant teams, undertaking as they do different combinations of
expense [1]. As all stakeholders – patients, politicians, procedures.
staff, managers – seek to maximise benefits from them, The word ‘productivity’ has various connotations in
proper measurement of processes and outcomes becomes economics, engineering, or administration. The Orga-
necessary. nisation for Economic Co-operation and Development
Previously, we concluded that surgical operating list (OECD) offered more than 20 meanings, recognising
‘efficiency’ is the completion of all the scheduled that ‘productivity’ and ‘efficiency’ are often confused
operations (that is, with no cancellations) whilst utilising and emphasising that each industry needs to develop its
properly the time available (that is, no under- or over- own measures [6, 7]. For surgical operating lists, we
runs) [2, 3]. In response, Sanders et al. observed that this therefore use the engineering sense: ‘efficiency’ mea-
notion of ‘efficiency’ was possibly incomplete [4]. It is sures how well the service functions (that is, work
theoretically possible for two teams to be equally efficient, output for costs or effort); ‘productivity’ measures the
fully utilising their time without over-runs or cancella- output. It is theoretically possible for a service to be
tions, but one team consistently completes more work highly efficient but have low total work output, and
than the other. Thus, there is a need to define ‘produc- conversely to work at high output, but extremely
tivity’ in addition to ‘efficiency’ to complete the picture inefficiently [4, 5].
These preliminary considerations led us to define – a booking and completing just two nephrectomies within
priori – the following six core criteria for an ideal the scheduled time. But – as a matter of policy – it is the
quantitative performance measure: latter that is properly better performing.
1 the measure should not be influenced by casemix: We emphasise another caveat already alluded to:
the type of operations conducted – or co-morbidities measuring quantitative performance is only meaningful
prolonging surgery commonly present in associated where predefined quality or safety standards are met. An
patient groups – should not influence whether a team is increasing number of specialties, most notably cardiac
regarded as ‘productive’. Thus, intrinsic procedure surgery, now define minimum standards for risk-adjusted
duration should not be influential (that is, short and mortality rates [14]. It is a deficiency that few sub-
long procedures are equally potentially productive) [12]; specialties offer such clear standards, but applying pro-
2 adoption of new techniques to achieve the same ductivity measures is inappropriate for teams that fail to
surgical aim should not be influential. For example, meet existing criteria. By analogy, a factory making more
even after any ‘learning curves’, laparoscopic tech- televisions than any other is unproductive if its televisions
niques (argued to improve safety, pain scores or do not work.
postoperative stay) often take longer to perform [13].
A team which previously completed, say, three open A formula describing quantitative list performance
nephrectomies per list and which now completes two If each of the above criteria is accepted then, logically, it is
laparoscopically does not automatically make it less possible to fulfil them all by the following empirical
productive; formula:
3 notwithstanding point (2), for any given surgical
quantitative performance (actual productivity) index
procedure, productive potential should partly be
related to the speed with which the operation is ¼ 111 ½efficiency 10ð0 productive potentialÞ ð1Þ
completed: the faster the team operates, the more
productive it is likely to be. Thus, a team consistently in which ‘productive potential’ (as used by Schmenner
completing a hernia repair in 30 min should correctly [12, 15] to refer to ‘flow’ of activity within a service
be viewed as having a higher productive potential than industry) is:
a team that always takes 1 h undertaking exactly the productive potential ¼ ðspeedÞ ðpatient contactÞ ð2Þ
same procedure. A team wishing to be more produc-
tive should rationally try to operate more quickly (so In these formulae, ‘patient contact’, ‘efficiency’ and
long as this preserves quality of care); ‘speed’ are each expressed as decimal fractions, the first
4 the total anaesthetist-surgeon contact time with the from 0–1, the last two can be > 1. Multiplying the result
patient should be reflected in the measure of quanti- in large square brackets by the constant 111, as shown,
tative performance. Although other aspects of the yields an index of performance (actual productivity)
surgical process (such as cleaning theatre, stocking expressed in %, in which an ‘ideal’ team (one with
sutures, preparing equipment) are undeniably impor- efficiency, speed and patient contact all equal to 1.0) has
tant, maximising anaesthesia-surgical contact minimises an actual productivity of 100%. We now consider each of
any idle gaps; the terms of the equation in detail.
5 where time savings might be made by improved
practices, reducing idle gaps or greater speed, these Efficiency
should only contribute to performance if extra cases are This is as detailed in our previous paper [2], where:
accommodated into the saved time. Earlier finishes in efficiency
themselves do not yield extra income (which accrues
from cases) nor generally save staffing costs (if staff are fraction of scheduled fraction of scheduled
¼
employed for fixed base units of time); time utilised time over-running
6 quantitative performance is only meaningful in the fraction of scheduled
context of efficiency (the latter being the notion that all : ð3Þ
operations completed
scheduled list time is utilised, with no over-runs or
cancellations). The reason is that high levels of apparent The ‘fraction of scheduled time utilised’ means that for
‘productivity’ are easily achieved by overwork and a list scheduled for 8 h which finishes in 6 h, this
over-expense. Overbooking a 4 h list with, say, five quantity = 0.75 and the ‘fraction of scheduled time
nephrectomies (each taking 2 h), over-running by over-running’ = 0. The ‘fraction of scheduled time
> 4 h to complete four cases (but cancelling the over-running’ means that for a list scheduled for 8 h
remaining patient) certainly achieves more work than which over-runs by 2 h, this quantity = 0.25, and the
fraction of scheduled time utilised = 1. Thus, the first times, it is also possible to assume that the team operates at
two terms operate mutually exclusively; a single list ‘average’ speed, with a value of 1.0.
cannot be at once both under- or over-utilised. The
finish time of the list is the time of arrival of the last Patient contact
patient in recovery; the start time of the list is the This is the time actually spent conducting anaesthesia and
scheduled start of anaesthesia for the first patient on the surgery, expressed as a proportion of the total actual list
list. The ‘fraction of scheduled operations completed’ time. If a list started promptly at 09.00 h and the last
means that if four out of five of the patients booked patient arrives in recovery at 17.00 h the actual list
onto the list have their operations (that is, one patient is duration has been 8 h. If six operations each of which
cancelled), this quantity = 0.80. lasted 1 h were completed, the patient contact is thus
6 ⁄ 8 = 0.75 (that is, 75% of the list time was spent with
Productive potential anaesthetist and ⁄ or surgeon in contact with the patient).
It is the product of ‘speed’ and ‘patient contact’ (Equation The formula is:
(2)), implying that any team consistently working fast
with few or no gaps is potentially more productive than a patient contact
team that is always slower and ⁄ or has long gaps between P
ð each individual operation durationsÞ
cases. The emphasis is on potential productivity because ¼ :
time from scheduled start time to actual end of list
the factors detailed in the equations will determine if this ð6Þ
is translated into actual productivity.
The converse of patient contact index is the ‘gap time’; in
Speed the example above, 25% of the list involved no patient
If an operation normally takes 1 h to complete and a contact.
team’s average for this is 30 min, the team is twice as fast Equation (6) correctly recognises late starts as gaps. In
as the norm and its relative speed is 2.0 (200%). If the the example above, if the scheduled list start time was
team takes 2 h to complete the procedure, its speed is just 08.00 h, then the denominator in Equation (6) is 9 h (the
0.5 (50%). For each operation, the relevant formula for first hour was a ‘gap’), then patient contact is 6 ⁄ 9 or 0.67
speed is thus: (67%), with a corresponding gap time of 33%. In addition
to reducing gaps, patient contact could be increased by
relative speed for an operation (a) maximising cases on a list (which should also increase
reference duration of operation utilisation and so increase efficiency); (b) parallel process-
¼ : ð4Þ
team’s duration of operation ing of anaesthesia (as we describe further below), or
(c) operating more slowly (a strategy which is perverse,
To obtain the speed for the team, the above relative and which would adversely influence the ‘speed’ term of
speeds for individual operations are averaged across all the formula).
operations undertaken. Thus if a team undertakes n1
operations with a relative speed X and n2 operations at Overview of the quantitative performance index
relative speed Y in the time period under consideration, Equation (1) reflects that the potential to be productive
then: (embedded in a team’s operating speed and degree of
contact with the patient) is translatable into actual
ðn1 :XÞ þ ðn2 :Y Þ þ þ ðnx ZÞ productivity by an efficient system. We can draw analogy
speed ¼ : ð5Þ
n1 þ n2 þ þ nx with oxygen partial pressure, which can only be translated
into actual oxygen content of blood if there is sufficient
The source of the ‘reference duration of operation’ in haemoglobin.
Equation (4) is undefined, and could be variously from The six criteria defined and Equation (1) have some
national databases, published literature [16–18], or from analogy with measuring performance quantitatively in
local data (such as the team’s own past operating times). companies that undertake skilled, complex tasks (for
Calculating average speed means that prolonged surgery example, antique clock restoration [19, 20]), as opposed
due to occasional patient co-morbidity has little influence to unskilled, repetitive tasks. High productivity results
on ‘team speed’. Co-morbidity inherent to the type of when the company estimates its workload accurately, for
surgery (such as cardiac disease needing invasive moni- example by accepting enough clocks for repair to fill its
toring in cardiac surgery) is automatically taken into time, but not too many that it needs to cancel or postpone
account by the longer reference times for those opera- orders or pay staff overtime. This is akin to sensible
tions. However, if data is unobtainable for reference booking of patients onto operating lists. Staff should
in the bottom right. An inefficient and potentially operating per se only yields ‘credit’ by opportunity for
unproductive team lies in the bottom left. Improvements more patient contact.
in efficiency drive the points and curves upwards while In summary, consideration of hypothetical scenarios
increases in productive potential drive the point to the yields very plausible and reasonable results which fulfil the
right, along a given efficiency curve. criteria defined. These hypothetical scenarios are not
dependent upon (or require any knowledge of) the
Hypothetical scenarios surgical subspecialty of the teams or the details of the
A hypothetical team operates slowly (50% expected operations they perform.
speed) and suffers long gaps (patient contact 80%). It is
also inefficient (75%) due to overbooking, over-runs and Analysis of real data sets
cancellations. The team’s quantitative performance is low Table 1 shows one example list and how data were
(X1, Fig. 2, index = 44%). Simply accepting its own extracted to construct Table 2. If we crudely plot simply
slowness and more realistically scheduling fewer patients the ‘number of cases scheduled per list’ (as do some
onto lists, this team would finish on time, reduce hospitals, [21]) then Team C appears to perform well,
cancellations and move higher to point X2 (Fig. 2). while Teams A, B and E less so (Fig. 3a). Since this
Further gains could then be made by increasing speed measure might be affected by 8 h vs 4 h list allocations or
and ⁄ or reducing gaps (to point X3, Fig. 2). However, casemix (for example, Team E is cardiac), a perhaps more
from its original situation X1, increasing speed or reducing relevant plot is ‘cases ⁄ hour completed’ (Fig. 3b). This
gaps alone achieves relatively little as movement is along increases the apparent performance of all teams except
the efficiency curve (point X4 index << X2). Teams B and E whose performance remains ‘poor’.
If by adopting a new, inherently slower surgical Table 2, however, shows more detailed data in which
technique a team continues to complete all its cases and Team B (although weakest on start times) appears to
accomplish similar list utilisation as it did before the new perform considerably better across a range of measures,
technique, then Equation (1) will yield it the same especially cancellation rate. Team E (though poor on
performance, regardless of the number of patients or cases cancellation rate in percentage terms) shows utilisation
completed on the list (data not plotted). If a team has high closest to 100% and acceptable efficiency. It also has the
operating speed and ⁄ or minimal gaps between cases, it highest patient contact index due to ‘parallel processing’
will have spare time. Accommodating extra cases into of anaesthesia, in which anaesthetic induction and
this time will increase patient contact and so increase invasive monitoring is started while the previous patient’s
performance. If, however, the team chooses instead to surgery is being completed [22, 23]. These disparate
finish early regularly then Equation (3) will reflect this as measures can now be combined meaningfully into a
consistent ‘under-utilisation’ and low performance. Faster single measure using our formulae.
First, the plots of efficiency vs utilisation (Fig. 4)
indicate that Teams B and E (the worst-performing in
Fig. 3) generally complete all booked cases on time, with
no over-run yet near-full utilisation. Team A (which also
performed rather poorly in Fig. 3) is also quite efficient,
perhaps with a slight tendency to under-run. The teams
performing apparently well in Fig. 3 (Teams C and D)
are, in fact, inefficient. Team C suffers cancellations on
every single list and also consistently under-runs. It is
most probably over-booking its lists and then managing
this by prompt cancellation of cases to avoid over-run.
This interpretation is supported by its high rates of
booking (Table 2). The main problem with Team D is
prolonged and consistent over-runs. Like Team C, it
likely over-books its lists but manages them with gross
over-running to complete the list rather than cancellation.
Figure 5 offers further detail. Teams A, B and E show
generally acceptable performance (Team A efficiency
is very slightly < 85% threshold; Team E productive
Figure 2 Plots of Performance Index vs Productive Potential for potential is slightly < 0.9). For Teams A and B, significant
one hypothetical team as discussed in text. improvements in performance will not be achieved by
Summary data
their working any faster or reducing gaps as they lie on over-runs and help both improve efficiency and produc-
the flat part of the relationship (that is, their speed is tive potential.
already better than average for the operations they In summary, neither the hypothetical nor the real data
perform and the gaps are broadly acceptable). Team A sets revealed any internal contradictions when subjected
could do better instead by improving its cancellation rate to our formulae. Instead, the interpretations seem
(Table 2). Team E could, by contrast, improve by reasonable and could be applied to compare teams within
working slightly faster (though its patient contact time is and across specialties.
already high). As a cardiac team it has a low absolute
number of cases, so the percentage impact of even the
Discussion
occasional cancellation is magnified in Table 2: nonethe-
less it is desirable to prevent even these. The six core criteria properly reflect what is required of a
Team C’s performance is quantitatively very poor, quantitative measure of performance, and they apply to
despite its misleadingly impressive plot in Fig. 3. Like any surgical list. We present formulae that fulfil these
Teams A and B, it also cannot improve by simply criteria. Finally we demonstrate that this analysis can apply
working faster or eliminating gaps as it also lies on the flat to a range of hypothetical and actual scenarios.
part of its relationship (that is, its speed is faster than the
average and it has few gaps). Since Team C books the Comments on the mathematical aspects
most cases onto its lists and has the highest cancellation To its credit, our measure avoids complex variables or
rate (> 21%), scheduling fewer cases is rational improve- misleading surrogates such as ‘numbers of operations
ment. done’. It is, however, possible that we failed to identify
The main problem for Team D may be its low average additional criteria beyond the six we defined, which
speed (Table 2), which might be due to teaching or more should be met by an ideal measure of quantitative
complex surgical methods needing further investigation. performance. The Audit Commission and related bodies
Increasing speed would also reduce the magnitude of suggested 100 individual measures of ‘performance’,
Teams A and D are half-day (4 h) lists; Teams B and C are full-day (8 h) lists; Team E is a full-day (10 h) list. Values are medians (interquartile range) [range], except where indicated. Start time is
given with reference to the scheduled start time (+ indicates late start, ) early start). Efficiency is calculated using Equation (3) in text; Speed using Equation (4); Patient contact index using
(47–108) [32–111]
(73–107) [56–128]
(22–84) [14–96]
(23–79) [14–89]
(62–88) [6–95]
Peformance
index (%)
84
88
68
63
79
(cancellation rate, %)
Cases completed ⁄
Table 2 Data from five actual teams (Teams A, C, D in gynaecology; Team B in urology; Team E cardiac: ten consecutive lists for each team).
cases booked
(21.4)
(7.6)
(2.4)
(6.5)
(9.5)
⁄ 39
⁄ 42
⁄ 84
⁄ 46
⁄ 21
36
41
66
43
19
Equation (5) and Performance index using Equation (1). Cases completed are shown as a fraction of those booked (with cancellation rate %).
(89–100) [80–100]
(88–97) [67–100]
(97–99) [72–100]
(89–93) [84–99]
(85–96) [64–98]
Patient contact
index (%)
96
92
93
94
98
(108–153) [84–187]
(91–148) [58–247]
(99–116) [88–122]
(69–97) [39–103]
(83–95) [65–124]
Speed term (%)
117
111
120
78
88
the interquartile ranges, the error bars (where different from the
(60–93)
(88–93)
(39–82)
(58–83)
(73–92)
latter) the 10th and 90th centiles, and the symbols any outlier
data. On this measure, the rank order of performance is: Team
C > D > B > A > E. (b) Boxplots of the number of cases
84
89
69
75
85
(83–95) [55–130]
(65–93) [58–98]
Utilisation (%)
()18–+3) [)30–+5]
(+4–+15) [)5–+23]
)8
)8
0
+9
D
B
C
propose is no more complex than, say, equations involved exists on how long operations take to perform is shared by
in oxygen delivery calculations (including the oxyhaemo- Macario who argued that lack of data was a key factor in
globin dissociation curve). poor scheduling of cases to fit the available time [9]. A
We do not know if an alternative, simpler formula hospital could use its own local data sets, but statistical bias
exists. Strictly, our primary aim was simply to demon- will favour the team as its own data generate the reference
strate that the six criteria could be described mathemat- times. Also new hospitals may be disadvantaged as they
ically to satisfaction. Even if only a close approximation of have no historical data. Thus how ‘reference times’ are
what ‘quantitative performance’ means, our formula is a acquired might critically influence the results of the
significant advance on the use of colloquial terminology formula, and ideally each team’s ‘reference times’ must be
in which efficiency, productivity, utilisation, etc are collected in the same manner or from equivalent sources
synonymous. to make them comparable. For example, they should
reflect subtle variants in procedure and significant co-
Limitations and utility of our analysis morbidity in patient groups (even very closely-coded
It is in the application of our measure, rather than in operations could differ in duration by as much as 1 h [9]).
its mathematical construct, that we encounter certain If the ‘aim’ of the productivity calculation is to compare
practical limitations. Perhaps the weakest part of our teams (perhaps within a hospital to generate internal
measure is accuracy with which ‘speed’ of surgery can be competition or as ‘league tables’ across hospitals) it is
measured. Our regret that so little publicly-accessible data essential that the ‘reference times’ are robust. This will be
Interpreting the quantitative performance index last conclusion surprising, but it underlines the difficulty
Whether the graphical presentation of our formulae adds in showing objectively that late starts have a detrimental
further insight than simple perusal of the raw data as impact in the face of other, more influential adverse
spreadsheets (for example, Table 2) probably depends on factors [40–42].
the experience of the analyst. An experienced physiol- Similarly, by our formulae gaps between cases have
ogist need only glance at individual values for haemo- only a modest effect on performance. Gaps are subjec-
globin, saturation, partial pressure, cardiac output, etc, to tively felt to be important [37] but objective analysis
understand an aberration in oxygen delivery. But this suggests that gap (turnover) times are in practice
experience itself derives from knowledge of the inter- modest, so that even eliminating these has little overall
relationships. A novice on the other hand often finds effect [38, 42–45]. Some authors have gone as far as
it helpful to perform the calculations consciously step- describing analysis of turnover times as ‘meaningless’,
by-step (for example, starting with a plot of oxygen implying that the effort is rarely worth the expense [46].
saturation vs partial pressure, etc). Similarly, our process We found that even the poorest performing teams (C
formally describes a method of how to analyse the data in and D in our data) had gap times < 10% of scheduled
Table 2. list time (Table 2) which is similar to that reported by
Even experienced analysts may be misled by cursory Cook [38] and the ideal objective suggested by Macario
inspection of spreadsheets. From Table 2 it does not seem [9]. This is close to the Audit Commission’s suggested
obvious that the late starts of Team B are in fact trivial; or target for gaps of < 8% of list time, and represents a
that Team A cancellation rate (though needing improve- maximum of 40 min [1, 24–26]. Even if all this time
ment) little impairs overall performance; or that slow were ‘saved’ it is difficult to see which procedures could
operating speeds of Team D (but not Team E) are be comfortably accommodated. Thus, the way we
significant; or that the cancellation rate of Team E is modelled gap ⁄ patient contact time appropriately reflects
simply a function of a small absolute number of cases, as the limited impact of reducing modest gap times, while
expected in a cardiac team. at the same time reflecting the utility of reducing very
Targets or objectives can result in perverse outcomes. A large gap times.
well-intentioned policy of never over-running theatres Previous work has indicated that faster operating can
may lead to an undesirable increase in cancellations [36]. improve throughput of cases, but only relatively modestly
A focus on operating speed may raise complication rates. [47, 48]: consistent with the notion that the lists in these
Maximising theatre utilisation alone may lead to over- studies lay near the top of the asymptotic curve (Fig. 1).
run and ⁄ or cancellation. In contrast to single aims, our
formula balances relevant variables to provide a summary Comparison with economic measures of
statistic, without creating perverse incentives. productivity
Our analysis can only be meaningfully applied in It is beyond the scope of this article to consider our
healthcare systems that are fundamentally rational and notion of productivity in the context of formal economic
ascribe to the six core criteria we defined. Hospitals are measures. Nonetheless, there are some parallels. Sumanth
free instead to choose alternative priorities. If the only distinguished ‘efficiency’ (that is, how well the business
outcome judged important were, say, completing as many works) from ‘production’ (that is, how many goods are
cases as possible (regardless of case type), or to start on produced) [49]. Scott argued that ‘productivity’ and
time (regardless of how subsequent time is utilised), then ‘profitability’ should be distinct, as the latter can be
our analysis is irrelevant to achieving such isolated influenced by demand and complex pricing issues (for
objectives. example, price-fixing) [50]. By avoiding the latter, our
approach seems consistent with Scott’s view. Our graph-
Consistency of our formula with other data ical plot of Equation (1) closely resembles an asymptotic
Our approach is consistent with – and extends the results function used by Wen in plotting ‘business productivity’
of – a number of other studies. In our model, late starts against ‘input’. Wen demonstrated a family of curves
have only modest influence on performance. We were (similar to our efficiency curves), each representing a
surprised to find that so many lists started on or before discrete pattern of economic growth or institutional
time (Table 2). Although often blamed for inefficiencies, organisation [51]. Increasing ‘input’ (for example through
objective studies indicate that factors such as overbooking faster or greater work) could increase business produc-
dwarf any measurable impact of late starts [4, 5, 37, 38]. tivity only up to curve’s asymptote. Further increases in
Starts as late as by 15 min have no discernible impact productivity could then only be achieved by major
[39–41] and some authorities have suggested that even structural change in the business (or as we describe it,
delays of < 45 min are classed acceptable [9]. We find this ‘efficiency’).
Summary: how teams can perform well subjected to our measure will assess any unforeseen
To perform well in quantitative terms by our measure, weaknesses or features that require modification. Second,
teams need to achieve the following: first, to satisfy any wider publication of operating times will assist in the
minimum quality or safety criteria defined by their own development of ‘reference times’. Finally, more accurate
sub-specialty; next, to plan lists to utilise the time available, information on costs involved in each team’s operating
with no cancellations or over-runs; then – and best only lists will enable the hypothesis to be tested that our index
then – to focus on working at least at average speeds with of performance is related to financial success. Any
minimal gaps between cases, using any time saved to disparity between these two might indicate perverse
accommodate extra cases. These appear very reasonable incentives within the system, which would be important
aspirations, independent of type of surgery, institution or to rectify.
healthcare system in which the work takes place. Science works by future work adapting past hypoth-
Fitting in an extra case may be (inevitably) somewhat eses. We hope to stimulate readers to offer alternatives as
easier for surgical subspecialties where short procedures better descriptors of productivity. Perhaps crude in some
are common than it is for those with long cases. minor details, our formula seems to work rather well for a
However, this does not bias against the latter, for the range of hypothetical and real lists. We suspect any
following reason. Our measure is able to distinguish a improvement in accuracy will require very complex
well- from a poorly-performing cardiac team. We can mathematics, but we think the only way of finding out if
also distinguish a well- from a poorly-performing day case this is so is for others, on viewing our analysis, to set about
hernia team. Furthermore, we can make direct compar- trying to improve it.
isons of any of these four teams. However, it is generally
more difficult for the already well-performing cardiac
Acknowledgements
team to perform even better, than it is for an already well-
performing hernia team. This is because (as implicit in our We thank Professor Alex Macario, Professor of Anesthesia
measure) cardiac teams generally need to save > 2 h of & Health Research Policy, Stanford University, CA,
time to accommodate an extra case, while hernia teams USA and Mr Andrew Vincent, Director, Medicology
need to save only 30 min. The bias is not large because Ltd, Specialists in Organisational Performance Through
both teams, already performing well, lie on the flat part of People, for their helpful comments on the manuscript.
the performance curve (Fig. 5). So any real gains for the
hernia team over the cardiac team are necessarily small.
References
Furthermore, it is possible for cardiac (or any other team)
to adjust the duration of their list (for example, from 1 Audit Commission. Operating Theatres. Review of National
8 h to 10 h etc) to accommodate an optimal number of Findings. London: HMSO, 2003.
cases [3]. 2 Pandit JJ, Westbury S, Pandit M. The concept of surgical
It is, however, similarly easy for currently poorly- operating list ‘efficiency’: a formula to describe the term.
Anaesthesia 2007; 62: 895–903.
performing cardiac and hernia teams to improve as both
3 McIntosh C, Dexter F, Epstein RH. The impact of service-
these teams lie on the steep part of their curves (Fig. 1);
specific staffing, case scheduling, turnovers, and first-case
that is, the ‘poor performance’ of the cardiac team implies starts on anaesthesia group and operating room productivity:
that a time saving of 2 h is realistic. Thus, the real value a tutorial using data from an Australian hospital. Anesthesia
of the asymptotic function is to emphasise that improve- and Analgesia 2006; 103: 1499–516.
ments in performance are reasonably confined to those 4 Sanders D, Grayling M, Lillie H. Defining efficiency
teams lying on the steep part of the performance curve requires more fidelity. Anaesthesia 2008; 63: 205–6.
(Fig. 1). 5 Pandit JJ, Westbury S, Pandit MJ. A reply. Anaesthesia 2008;
63: 205–6.
Conclusion and areas for future work 6 Organisation for Economic Co-operation and Develop-
If it is desirable to develop a measure of quantitative ment. Measuring Productivity. OECD Manual: Measurement
of Aggregate and Industry-Level Productivity Growth. Paris:
performance of a surgical list, which applies to all surgical
OECD Publications, 2001.
subspecialties rather than just specific for some, then our
7 Schreyer P. Measures of productivity. Proceedings of the Con-
six criteria define what that measure needs to do. ference on Next Steps for the Japanese System of National Accounts,
Equation (1) fulfils these defined criteria and we have March 2005. http://www.esri.go.jp/en/workshop/050325/
shown it is useful when applied to actual and hypothet- 0503325main-e.html (accessed 18 December 2008).
ical surgical lists, with no internal contradictions in the 8 Zbinden AM. Introducing a balanced scorecard manage-
results. There are clear areas for further work. First, data ment system in a university anesthesiology department.
from more operating lists from a variety of centres Anesthesia and Analgesia 2002; 95: 1731–8.
9 Macario A. Are your operating rooms ‘efficient?’: a scoring ProductivityHealth(27_2_06).pdf (accessed 18 December
system with eight performance indicators. Anesthesiology 2008).
2006; 105: 237–40. 26 Scottish Executive. National Theatres Project: Final Report.
10 Dexter F, O’Neill L. Data envelopment analysis to deter- Edinburgh: HMSO, 2006.
mine by how much hospitals can increase elective inpatient 27 Nath B, Silverman BG, Mukherjee A, Stout RG,
surgical workload for each specialty. Anesthesia and Analagesia Holt N. Relationship of system-specific disorders in the
2004; 99: 1492–500. pre-anesthesia evaluation to hospital length of stay.
11 Basson MD, Butler T. Evaluation of operating suite effi- Anesthesia and Analgesia 2008; 106: S70.
ciency in the Veterans Health Administration system by 28 The Audit Commission. The Right Result? Payment by Results
using data-envelopment analysis. American Journal of Surgery 2003–2007. London: Audit Commission, 2008.
2006; 192: 649–56. 29 Goldstein LB, Moore WS, Robertson ST, Chaturvedi S.
12 Lehtonen J-M, Kukala J, Kouri J, Hippeläinen M. Cardiac Complication rates for carotid endarterectomy: a call to
surgery productivity and throughput improvements. arms. Stroke 1997; 28: 889–90.
International Journal of Health Care Quality Assurance 2007; 20: 30 Pandit JJ, Satya-Krishna R, Gration P. Superficial or deep
30–52. cervical plexus block for carotid endarterectomy: a system-
13 Maione G, Sansalone CV, Aseni P. Laprascopic hand- atic review of complications. British Journal of Anaesthesia
assisted living donor nephrectomy: the Niguarda experience. 2007; 99: 159–69.
Transplantation Proceedings 2005; 37: 2445. 31 American Association of Anesthesia Clinical Directors.
14 Bridgewater B. Mortality data in adult cardiac surgery for Glossary of times used for scheduling and monitoring of
named surgeons: retrospective examination of prospectively- diagnostic and therapeutic procedures. AORN Journal 1997;
collected data on coronary artery surgery and aortic valve 66: 601–6.
replacement. British Medical Journal 2005; 330: 506–10. 32 Norton A. The (anaesthetic) times, they are a-changin’.
15 Schmenner RW. Service business and productivity. Bulletin of the Royal College of Anaesthetists 2008; 48:
Decision Sciences 2004; 35: 333–47. 2462–4.
16 Pandit JJ, Carey A. Estimating the duration of common 33 Durani P, Seagrave M, Neumann L. The use of theatre time
elective operations: implications for operating list manage- in elective orthopaedic surgery. Annals of the Royal College of
ment. Anaesthesia 2006; 61: 768–76. Surgeons 2005; 87 (Suppl.): 170–2.
17 Silber JH, Rosenbaum PR, Zhang X, Even-Shoshan O. 34 Pitches D, Burls A, Fry-Smith A. How to make a silk purse
Influence of patient and hospital characteristics on anesthesia from a sow’s ear – a comprehensive review of strategies to
time in medicare patients undergoing general and ortho- optimise data for corrupt managers and incompetent clini-
pedic surgery. Anesthesiology 2007; 106: 356–64. cians. British Medical Journal 2003; 327: 1436–9.
18 Pandit JJ. Similarity of operation times for common general 35 Royal Statistical Society Working Party on Performance
surgical procedures in the United Kingdom and the United Monitoring in the Public Service (2004). Performance
States. Anesthesiology 2007; 107: 512–3. indicators: good, bad and ugly. Journal of the Royal Statistical
19 Harvey J. Measuring productivity in professional services. Society A 2005; 168: 1–27.
Public Productivity Review 1987; 11: 29–38. 36 Tessler MJ, Kleiman SJ, Huberman MM. A ‘zero tolerance
20 Starosel’skii VA, Khanin MN. Economic efficiency of spe- for overtime’ increases surgical per case costs. Canadian
cialized instrument maintenance. Measurement Techniques Journal of Anaesthesia 1997; 44: 1036–41.
1969; 12: 1762–6. 37 Cook TM. The concept of surgical operating list ‘efficiency’:
21 East Kent Hospitals University NHS Trust. Trust Board a formula to describe the term. Anaesthesia 2007; 63: 98.
Papers, Sept 2007. http://www.ekht.nhs.uk/home-page/ 38 Pandit JJ, Westbury S, Pandit MJ. A reply. Anaesthesia 2008;
about-the-trust/the-trust-board/trust-board-meetings- 63: 98–9.
minutes-and-agendas/trust-board-papers-and-minutes-7- 39 Lebowitz P. Why can’t my procedures start on time? AORN
september-2007/ (accessed 5 September 2008). Journal 2003; 77: 594–7.
22 Abouleish A. Increasing operating room throughput: 40 Ray N, Tung A, Glick DB. Who cares if I’m on time?
just buzzwords for this decade? Anesthesiology 2008; 109: Anesthesia and Analgesia 2008; 106: S–71.
3–4. 41 Cutter RW, Carli N, O’Connor M. The modified staggered
23 Smith MP, Sandberg WS, Foss J, et al. High-throughput start. Anesthesia and Analgesia 2008; 106: S–72.
operating room system for joint arthroplasties durably 42 Ray N, Tung A, Glick DB. Why rush? You won’t get out
outperforms routine processes. Anesthesiology 2008; 109: any earlier!! Anesthesia and Analgesia 2008; 106: S–73.
25–35. 43 Abouleish AE, Hensley SL, Zornow MH, Prough DS.
24 Northern Ireland Audit Office. The Use of Operating Theatres Inclusion of turnover time does not influence identification
in the Northern Ireland Health and Personal Social Services. of surgical services that over- and underutilize allocated
Belfast: HMSO, 2003. block time. Anesthesia and Analgesia 2003; 96: 813–8.
25 Office for National Statistics. Public Service Productivity: 44 Dexter F, Abouleish AE, Epstein RH, Whitten CW,
Health. London: Office for National Statistics, 2006. http:// Lubarsky DA. Use of operating room information system
www.statistics.gov.uk/articles/nojournal/PublicService data to predict the impact of reducing turnover times
on staffing costs. Anesthesia and Analgesia 2003; 97: 48 Seim AR, Dahl DM, Sandberg WS. Small changes in
1119–26. operative time can yield discrete increases in operating
45 Mathias JM. Operating room efficiency: benchmarking room throughput. Journal of Endourology 2007; 7: 703–
operating room turnover times. Operating Room Manager 8.
2000; 16: 1–4. 49 Sumanth DJ. Productivity Engineering and Management. New
46 Brenn BR, Reilly JS, Deutsch ES, Hettrick MH, Cook SC. York: McGraw-Hill, 1984.
Analysis of efficiency of common otolaryngology operations: 50 Scott S. Productivity Management: Planning, Measurement and
comparison of operating room vs. short procedure room in a Evaluation, Control and Improvement. New York: John Wiley
pediatric tertiary hospital. Archives of Otoloaryngology, Head and Sons, 1987.
and Neck Surgery 2003; 129: 435–7. 51 Wen GJ. Total factor productivity change in China’s farm-
47 Strum DP, Sampson AR, May JH, Vargas LG. Surgeon and ing sector: 1952–1989. Economic Development and Cultural
type of anesthesia predict variability in surgical procedure Change 1993; 42: 1–41.
times. Anesthesiology 2000; 92: 1454–66.