Documente Academic
Documente Profesional
Documente Cultură
REVIEW ARTICLE
a r t i c l e i n f o a b s t r a c t
Article history: Background: Cervical cancer screening is offered to women to identify and treat cervical intraepithelial neoplasia
Received 22 January 2015 (CIN). Objectives: To support WHO guidelines, a systematic review was performed to compare test accuracy of
Received in revised form 25 June 2015 the HPV test, cytology (cervical smear), and unaided visual inspection with acetic acid (VIA); and to determine
Accepted 2 November 2015 test accuracy of HPV and colposcopy impression. Search strategy: Medline and Embase were searched up to
September 2012, and experts were contacted for references. Selection criteria: Studies of at least 100 nonpregnant
Keywords:
women (aged ≥18 years) not previously diagnosed with CIN were included. Data collection and analysis: Two in-
Cervical intraepithelial neoplasia
Cervical smear
vestigators independently screened and collected data. Pooled sensitivity and specificity, and absolute differences
Colposcopy were calculated, and the quality of evidence assessed using GRADE (Grading of Recommendations Assessment,
Cytology Development and Evaluation). Main results: High to moderate quality evidence was found. The greatest difference
HPV in overtreatment occurred with VIA instead of the cervical smear (58 more per 1000 women). Differences in
Meta-analysis missed treatment ranged from 2–5 per 1000 women. For 1000 women screened positive and then sent to colpos-
Review copy, 464 would be falsely diagnosed with CIN grade 2–3 and treated. Conclusions: Although differences in sensi-
Visual inspection with acetic acid tivity between tests could be interpreted as large, absolute differences in missed diagnoses were small. By contrast,
small differences in specificity resulted in fairly large absolute differences in overtreatment.
© 2015 International Federation of Gynecology and Obstetrics. Published by Elsevier Ireland Ltd. This is an open
access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
1. Introduction There has been much debate about which screening tests should be
recommended. Ideally, reviews of studies comparing complete screen-
Cervical cancer is the fourth most common cancer in women world- ing, diagnosis, and treatment strategies would provide high-quality ev-
wide and the most common cancer in many low- and middle-income idence to support clinical decision making [2]. However, this type of
countries [1]. For this reason, screening is typically offered to women direct evidence assessing the effect of providing a test followed by treat-
to identify precursors that can be treated to avoid progression to cervi- ment is frequently lacking and, when available, it usually does not pro-
cal cancer. Common screening methods widely used today include tests vide information about important patient outcomes for decision
for HPV, and tests to detect cervical lesions by cytology (cervical smear) making. Instead, different decision makers—clinicians, program man-
or unaided visual inspection with acetic acid (VIA). These tests can be agers, policy makers, and patients—have to rely on test accuracy data
used alone or in a sequence (e.g. HPV test followed by a cervical (e.g. sensitivity and specificity of tests) together with data about the
smear when HPV positive). consequences of treatment to decide which screening tests to provide.
In 2012, WHO committed to developing recommendations for
⁎ Corresponding author at: Department of Clinical Epidemiology and Biostatistics,
screen-and-treat strategies to prevent cervical cancer and commis-
Room 2C16, Health Sciences Centre, McMaster University, 1280 Main Street West,
Hamilton, ON L8S 4L8, Canada. Tel.: +1 905 525 9140x24931; fax: +1 905 522 9507. sioned a series of reviews to inform those recommendations [3]. Given
E-mail address: schuneh@mcmaster.ca (H.J. Schünemann). the lack of studies directly testing complete screen-and-treat strategies,
http://dx.doi.org/10.1016/j.ijgo.2015.07.024
0020-7292/© 2015 International Federation of Gynecology and Obstetrics. Published by Elsevier Ireland Ltd. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.org/licenses/by-nc-nd/4.0/).
260 R.A. Mustafa et al. / International Journal of Gynecology and Obstetrics 132 (2016) 259–265
the literature was reviewed for the accuracy of screening tests to pro- of the evidence/confidence in the effect estimates of each outcome
vide essential information for decision making. The present paper pre- were assessed as high, moderate, low, or very low. To better illustrate
sents a series of systematic reviews of test accuracy comparing the the impact of the sensitivity and specificity, absolute differences in ef-
three test options for screening to prevent cervical cancer: the HPV fects were calculated for each comparison as TPs, TNs, FPs, and FNs.
test, the cervical smear, and VIA. Because screening can also be provided These effects were calculated for a population with a 2% prevalence of
as a sequence of tests (e.g. HPV followed by colposcopy), reviews of the CIN, and with a 20% prevalence for colposcopy impression because
accuracy of HPV and of colposcopy impression were also performed. women would receive colposcopy impression after a positive screening
test. These effects and quality of evidence are presented in GRADE
2. Materials and methods Evidence profiles.
findings for this comparison. Sensitivity estimates for HPV ranged from findings for the diagnostic test accuracy of the HPV test. Sensitivity
0.69 to 1.00, and the sensitivity estimates for cervical smears ranged estimates for HPV ranged from 0.64 to 1.00 and the specificity
from 0.43 to 0.94. Specificity estimates for HPV ranged from 0.79 to ranged from 0.56 to 0.97. The pooled estimates for HPV sensitivity
0.97, whereas specificity estimates for cervical smears ranged from and specificity were 0.94 (95% CI 0.89–0.97) and 0.88 (95% CI
0.78 to 0.98. The pooled estimates for HPV sensitivity and specificity 0.84–0.92), respectively. The quality of the evidence for TP and FN
were 0.94 (95% CI 0.89–0.97) and 0.90 (95% CI 0.86–0.93), respectively. was high, but moderate for TN and FP due to inconsistency
The pooled estimates for cervical smear sensitivity and specificity were across the studies and imprecision resulting in wide confidence
0.70 (95% CI 0.57–0.80) and 0.95 (95% CI 0.92–0.97), respectively. The intervals.
quality of the evidence for TP and FN was high, but moderate for TN
and FP when considering inconsistency and some imprecision. 3.6. Colposcopy impression
3.5. HPV test (from comparisons) There are 12 reports of 11 studies [8,30–39] that provided test
accuracy data for colposcopy impression among 6370 participants.
HPV test accuracy data were pooled from 15 studies, with a total Fig. 6 provides a summary of the findings for the diagnostic test
of 45 783 participants [8–12,20–29]. Fig. 5 shows a summary of the accuracy of colposcopy impression. Sensitivity estimates for colposcopy
262 R.A. Mustafa et al. / International Journal of Gynecology and Obstetrics 132 (2016) 259–265
Fig. 2. Summary of findings for diagnostic test accuracy of HPV test versus VIA. Abbreviations: VIA, visual inspection with acetic acid; CI, confidence interval; GRADE, Grading of Recom-
mendations Assessment, Development, and Evaluation; CIN, cervical intraepithelial neoplasia.
impression ranged from 0.29 to 1.00, and the specificity ranged from 4. Discussion
0.12 to 0.88. The pooled estimates for HPV sensitivity and specificity
were 0.95 (95% CI 0.86–0.98) and 0.42 (95% CI 0.26–0.61), respectively. The present review gives the results of a series of systematic reviews
The quality of the evidence for TP and FN was high, but moderate for TN and meta-analyses of test accuracy data for screening tests in the pre-
and FP because of inconsistency across the studies and imprecision vention of cervical cancer. High to moderate quality evidence was
resulting in wide confidence intervals. found of test accuracy for direct comparisons between the HPV test
Fig. 3. Summary of findings for diagnostic test accuracy of VIA versus cervical smears. Abbreviations: VIA, visual inspection with acetic acid; CI, confidence interval; GRADE, Grading of
Recommendations Assessment, Development, and Evaluation; CIN, cervical intraepithelial neoplasia.
R.A. Mustafa et al. / International Journal of Gynecology and Obstetrics 132 (2016) 259–265 263
Fig. 4. Summary of findings for diagnostic test accuracy of HPV tests versus cervical smears. Abbreviations: CI, confidence interval; GRADE, Grading of Recommendations Assessment, De-
velopment, and Evaluation; CIN, cervical intraepithelial neoplasia.
and VIA, VIA and cervical smears, and the HPV test and cervical smears, By contrast, small differences in the specificity for all comparisons re-
as well as for pooled estimates for the HPV test and for colposcopy im- sulted in fairly large absolute differences between tests for TN and FP for a
pression. For the direct comparisons, the absolute differences between given disease prevalence. In particular, the difference between VIA and
the outcomes of TP, FN, TN, and FP were calculated to provide a clearer cervical smears was 58 more FPs; this means that 58 more women
representation of the impact of sensitivity and specificity on diagnosis would receive treatment unnecessarily if VIA alone (instead of the cervi-
and subsequent treatment [40]. For example, the differences in sensitiv- cal smear) was used to screen women for CIN grade 2–3 lesions. Again,
ity of HPV when compared with VIA (95% compared to 69%) could be decision makers, including those making recommendations, would
interpreted as large. However, for a given prevalence, the absolute dif- need to consider the downstream consequences of these differences
ferences in the number of women correctly identified with CIN grade when choosing to provide these tests in a screening program followed
2–3 was five more per 1000 tested with the HPV test, and five fewer by treatment. The small variation in the absolute differences between
women per 1000 tested falsely identified as negative for CIN grade FN values and the large variation between the FP values among the differ-
2–3. Decision makers would need to consider whether missing five in ent tests is primarily due to the fairly low prevalence of CIN 2–3 of 2%. Col-
every 1000 women when using VIA is acceptable for CIN grade 2–3 le- poscopy impression has also been used in screening programs after a
sions, of which 70% could progress with time. screen positive test. However, the present results show that its use
Fig. 5. Summary of findings for diagnostic test accuracy of HPV tests. Abbreviations: CI, confidence interval; GRADE, Grading of Recommendations Assessment, Development, and
Evaluation; CIN, cervical intraepithelial neoplasia.
264 R.A. Mustafa et al. / International Journal of Gynecology and Obstetrics 132 (2016) 259–265
Fig. 6. Summary of findings for diagnostic test accuracy of colposcopy impressions. Abbreviations: CI, confidence interval; GRADE, Grading of Recommendations Assessment, Develop-
ment, and Evaluation; CIN, cervical intraepithelial neoplasia.
resulted in 464 false positives in every 1000. In other words, for every However, these data do not address the consequences of screening—
1000 women screened positive and then sent to colposcopy, 464 would treatment and the subsequent outcomes. When deciding on a screen-
be falsely diagnosed as CIN grade 2–3 positive and subsequently treated. and-treat strategy, it is critical to consider the downstream consequences
Confidence in the present estimates was high to moderate. Given the after treatment (for a positive test) or no treatment (for a negative test),
unknown degree of influence of risk of bias on test accuracy estimates, such as cervical cancer and related mortality, recurrence of CIN grade
only studies at low risk of bias were included and paired analyses were 2–3, adverse effects of treatment (and overtreatment), and use of re-
conducted for direct comparisons. Many studies were excluded that sources. Therefore, the differences in sensitivity and specificity of the
have been included in other reviews of test accuracy data, because the test comparisons found in the present review should not be considered
women who screened negative in these studies did not receive the refer- as standalone figures, but should be considered in the context of treat-
ence standard. This means that there would be little to no data to deter- ment and patient outcomes. In fact, the sensitivity and specificity results
mine FNs and TNs. Also excluded were studies that did not provide the were used in a model to determine the effects of different strategies to
two screening tests to all women. By including studies providing at least screen and treat women. Using the model, the outcomes due to treatment
two tests, it was possible to make a direct comparison of the results in or missed treatment were calculated and then used to make recommen-
the same women within studies. In addition, a paired analysis of studies dations about the use of the strategies. Ideally, rigorous studies comparing
was conducted that had a direct comparison of the tests within the highly relevant screen-and-treat strategies and following up all women to
same group of women. Not only does this provide more rigorous esti- assess effect on important outcomes need to be conducted to inform
mates of the effects [6], but it also provides relevant data for decision decision makers.
makers who will almost always be faced with a decision to provide one
test over another. However, two analyses without comparisons were in- Supplementary data to this article can be found online at http://dx.
cluded: HPV and colposcopy impression. Because these tests are often doi.org/10.1016/j.ijgo.2015.07.024.
used as part of a sequence of tests (e.g. HPV test followed by colposcopy
impression), the overall test accuracy of each test was calculated sepa- Acknowledgments
rately. For these analyses, only studies that had a low risk of bias were
pooled. In particular, data were used only from studies that included The present review was commissioned and funded by WHO as an in-
women who were screened positive and screened negative, thus ensuring dependent review.
appropriate calculations of the TN and FN, and reducing verification bias.
There are few limitations of the present review. One could argue that Conflict of interest
other important information has been omitted that has been included in
other published reviews of these tests, because the analyses were limit- The authors have no conflicts of interest.
ed to studies of low risk of bias. However, adding in studies with high
risk of bias would substantially reduce the confidence—i.e. the quality
References
of the evidence—in the effects we found. In addition, although well de-
veloped, the methods for pooling test accuracy data and exploring het- [1] Ferlay J, Soerjomataram I, Ervik M, Dikshit R, Eser S, Mathers C, et al. GLOBOCAN
erogeneity continue to improve. It was not possible to explore or 2012 v1.0, Cancer Incidence and Mortality Worldwide: IARC CancerBase No. 11.
Lyon, France: International Agency for Research on Cancer; 2013.
explain the high heterogeneity in many of the analyses. Instead, the
[2] Schünemann HJ, Mustafa R, Brozek J. Diagnostic accuracy and linked evidence—testing
range of sensitivity and specificity from the included studies has been the chain [in German]. Z Evid Fortbild Qual Gesundhwes 2012;106(3):153–60.
presented in addition to the confidence intervals. [3] Santesso N, Mustafa RA, Schünemann HJ, Arbyn M, Blumenthal PD, Cain J, et al.
World Health Organization Guidelines: Treatment of cervical intraepithelial
In conclusion, the ideal studies to inform decision making about a
neoplasia 2–3 and Screen and Treat strategies to prevent cervical cancer. Int J
screen-and-treat strategy for screening for precancerous cervical lesions Gynecol Obstet 2016;132(3):252–8.
would be direct comparisons between use of one screening test and an- [4] Koliopoulos G, Arbyn M, Martin-Hirsch PPL, Kyrgiou M, Prendiville WJP,
other, and measuring the effect on important patient outcomes. Instead, Paraskevaidis E. Cytology versus HPV testing for cervical cancer screening in the
general population. Cochrane Libr 2010;7, CD008587.
recommendations are typically based on the accuracy of a test, which is [5] Irwig L, Bossuyt P, Glasziou P, Gatsonis C, Lijmer J. Designing studies to ensure that
determined by calculating the sensitivity and specificity of the test. estimates of test accuracy are transferable. BMJ 2002;324(7338):669–71.
R.A. Mustafa et al. / International Journal of Gynecology and Obstetrics 132 (2016) 259–265 265
[6] Whiting P, Rutjes AW, Reitsma JB, Bossuyt PM, Kleijnen J. The development of DNA testing in the screening and diagnostic settings. Cancer Epidemiol Biomarkers
QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included Prev 2008;17(10):2865–71.
in systematic reviews. BMC Med Res Methodol 2003;3:25. [23] de Cremoux P, Coste J, Sastre-Garau X, Thioux M, Bouillac C, Labbé S, et al. Efficiency
[7] Schünemann HJ, Oxman AD, Brozek J, Glasziou P, Jaeschke R, Vist GE, et al. Grading of the hybrid capture 2 HPV DNA test in cervical cancer screening. A study by the
quality of evidence and strength of recommendations for diagnostic tests and French Society of Clinical Cytology. Am J Clin Pathol 2003;120(4):492–9.
strategies. BMJ 2008;336(7653):1106–10. [24] Depuydt CE, Makar AP, Ruymbeke MJ, Benoy IH, Vereecken AJ, Bogers JJ. BD-ProExC
[8] Belinson J, Qiao YL, Pretorius R, Zhang WH, Elson P, Li L, et al. Shanxi Province as adjunct molecular marker for improved detection of CIN2+ after HPV primary
Cervical Cancer Screening Study: a cross-sectional comparative trial of multiple screening. Cancer Epidemiol Biomarkers Prev 2011;20(4):628–37.
techniques to detect cervical neoplasia. Gynecol Oncol 2001;83(2):439–44. [25] Hovland S, Arbyn M, Lie AK, Ryd W, Borge B, Berle EJ, et al. A comprehensive evalu-
[9] De Vuyst H, Claeys P, Njiru S, Muchiri L, Steyaert S, De Sutter P, et al. Comparison of ation of the accuracy of cervical pre-cancer detection methods in a high-risk area in
pap smear, visual inspection with acetic acid, human papillomavirus DNA-PCR East Congo. Br J Cancer 2010;102(6):957–65.
testing and cervicography. Int J Gynecol Obstet 2005;89(2):120–6. [26] Mahmud SM, Sangwa-Lugoma G, Nasr SH, Kayembe PK, Tozin RR, Drouin P, et al.
[10] Qiao YL, Sellors JW, Eder PS, Bao YP, Lim JM, Zhao FH, et al. A new HPV-DNA test for Comparison of human papillomavirus testing and cytology for cervical cancer
cervical-cancer screening in developing regions: a cross-sectional study of clinical screening in a primary health care setting in the Democratic Republic of the
accuracy in rural China. Lancet Oncol 2008;9(10):929–36. Congo. Gynecol Oncol 2012;124(2):286–91.
[11] Shastri SS, Dinshaw K, Amin G, Goswami S, Patil S, Chinoy R, et al. Concurrent [27] Monsonego J, Hudgens MG, Zerat L, Zerat JC, Syrjänen K, Halfon P, et al. Evaluation of
evaluation of visual, cytological and HPV testing as screening methods for the oncogenic human papillomavirus RNA and DNA tests with liquid-based cytology in
early detection of cervical neoplasia in Mumbai, India. Bull World Health Organ primary cervical cancer screening: the FASE study. Int J Cancer 2011;129(3):
2005;83(3):186–94. 691–701.
[12] Sodhani P, Gupta S, Sharma JK, Parashari A, Halder K, Singh V, et al. Test [28] Petry KU, Menton S, Menton M, van Loenen-Frosch F, de Carvalho Gomes H, Holz B,
characteristics of various screening modalities for cervical cancer: a feasibility et al. Inclusion of HPV testing in routine cervical cancer screening for women above
study to develop an alternative strategy for resource-limited settings. Cytopathology 29 years in Germany: results for 8466 patients. Br J Cancer 2003;88(10):1570–7.
2006;17(6):348–52. [29] Blumenthal PD, Gaffikin L, Chirenje ZM, McGrath J, Womack S, Shah K. Adjunctive
[13] Cremer M, Conlisk E, Maza M, Bullard K, Peralta E, Siedhoff M, et al. Adequacy of testing for cervical cancer in low resource settings with visual inspection, HPV,
visual inspection with acetic acid in women of advancing age. Int J Gynecol Obstet and the Pap smear. Int J Gynecol Obstet 2001;72(1):47–53.
2011;113(1):68–71. [30] Cantor SB, Cárdenas-Turanzas M, Cox DD, Atkinson EN, Nogueras-Gonzalez GM,
[14] Elit L, Baigal G, Tan J, Munkhtaivan A. Assessment of 2 cervical screening methods in Beck JR, et al. Accuracy of colposcopy in the diagnostic setting compared with the
Mongolia: cervical cytology and visual inspection with acetic acid. J Low Genit Tract screening setting. Obstet Gynecol 2008;111(1):7–14.
Dis 2006;10(2):83–8. [31] Cremer ML, Peralta EI, Dheming SG, Jimenez ME, Davis-Dao CA, Alonzo TA, et al.
[15] Ghaemmaghami F, Behtash N, Gilani MM, Mousavi A, Marjani M, Moghimi R. Visual Digital assessment of the reproductive tract versus colposcopy for directing biopsies
inspection with acetic acid as a feasible screening test for cervical neoplasia in Iran. in women with abnormal Pap smears. J Low Genit Tract Dis 2010;14(1):5–10.
Int J Gynecol Cancer 2004;14(3):465–9. [32] Cristoforoni PM, Gerbaldo D, Perino A, Piccoli R, Montz FJ, Capitanio GL. Computerized
[16] Goel A, Gandhi G, Batra S, Bhambhani S, Zutshi V, Sachdeva P. Visual inspection of colposcopy: results of a pilot study and analysis of its clinical relevance. Obstet Gynecol
the cervix with acetic acid for cervical intraepithelial lesions. Int J Gynecol Obstet 1995;85(6):1011–6.
2005;88(1):25–30. [33] Durdi GS, Sherigar BY, Dalal AM, Desai BR, Malur PR. Correlation of colposcopy using
[17] Hedge D, Shetty H, Shetty PK, Rai S. Diagnostic value of acetic acid comparing with Reid colposcopic index with histopathology—a prospective study. J Turk Ger Gynecol
conventional Pap smear in the detection of colposcopic biopsy-proved CIN. J Cancer Assoc 2009;10(4):205–7.
Res Ther 2011;7(4):454–8. [34] Ferris DG, Miller MD. Colposcopic accuracy in a residency training program: defining
[18] Sahasrabuddhe VV, Bhosale RA, Kavatkar AN, Nagwanshi CA, Joshi SN, Jenkins CA, competency and proficiency. J Fam Pract 1993;36(5):515–20.
et al. Comparison of visual inspection with acetic acid and cervical cytology to detect [35] Homesley HD, Jobson VW, Reish RL. Use of colposcopically directed, four-quadrant
high-grade cervical neoplasia among HIV-infected women in India. Int J Cancer cervical biopsy by the colposcopy trainee. J Reprod Med 1984;29(5):311–6.
2012;130(1):234–40. [36] Jones DE, Creasman WT, Dombroski RA, Lentz SS, Waeltz JL. Evaluation of the
[19] Sankaranarayanan R, Wesley R, Thara S, Dhakad N, Chandralekha B, Sebastian P, atypical Pap smear. Am J Obstet Gynecol 1987;157(3):544–9.
et al. Test characteristics of visual inspection with 4% acetic acid (VIA) and Lugol's [37] Kierkegaard O, Byrjalsen C, Frandsen KH, Hansen KC, Frydenberg M. Diagnostic
iodine (VILI) in cervical cancer screening in Kerala, India. Int J Cancer 2003; accuracy of cytology and colposcopy in cervical squamous intraepithelial lesions.
106(3):404–8. Acta Obstet Gynecol Scand 1994;73(8):648–51.
[20] Agorastos T, Dinas K, Lloveras B, de Sanjose S, Kornegay JR, Bonti H, et al. Human [38] Mousavi AS, Fakour F, Gilani MM, Behtash N, Ghaemmaghami F, Karimi Zarchi M. A
papillomavirus testing for primary screening in women at low risk of developing prospective study to evaluate the correlation between Reid colposcopic index
cervical cancer. The Greek experience. Gynecol Oncol 2005;96(3):714–20. impression and biopsy histology. J Low Genit Tract Dis 2007;11(3):147–50.
[21] Bigras G, de Marval F. The probability for a Pap test to be abnormal is directly [39] Patil K, Durdi G, Lakshmi KS, Swamy MK. Comparison of diagnostic efficacy of visual
proportional to HPV viral load: results from a Swiss study comparing HPV testing inspection of cervix with acetic acid and pap smear for prevention of cervical cancer:
and liquid-based cytology to detect cervical cancer precursors in 13,842 women. Is VIA superseding pap smear? J SAFOG 2011;3(3):131–4.
Br J Cancer 2005;93(5):575–81. [40] Hsu J, Brożek JL, Terracciano L, Kreis J, Compalati E, Stein AT, et al. Application of
[22] Cárdenas-Turanzas M, Nogueras-Gonzalez GM, Scheurer ME, Adler-Storthz K, GRADE: making evidence-based recommendations about diagnostic tests in clinical
Benedet JL, Beck JR, et al. The performance of human papillomavirus high-risk practice guidelines. Implement Sci 2011;6:62.