Annals of Emergency Medicine
Volume 55, Issue 1 , Pages 51-59.e1, January 2010

Diagnostic Accuracy of Noncontrast Computed Tomography for Appendicitis in Adults: A Systematic Review

  • Veronica Hlibczuk, MD

      Affiliations

    • Division of Emergency Medicine, Columbia University Medical Center, New York–Presbyterian Hospital, New York, NY
    • Corresponding Author InformationAddress for correspondence: Veronica Hlibczuk, MD, Division of Emergency Medicine, Columbia University Medical Center, New York-Presbyterian Hospital, 622 West 168th St, PH1-137, New York, NY 10032; 212-305-2995, fax 212-305-6792
  • ,
  • Judith A. Dattaro, MD

      Affiliations

    • Division of Emergency Medicine, Weill Cornell Medical Center, New York–Presbyterian Hospital, New York, NY
  • ,
  • Zhezhen Jin, PhD

      Affiliations

    • Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY
  • ,
  • Louise Falzon, BA

      Affiliations

    • Mount Sinai Heart, Mount Sinai Medical Center, New York, NY
  • ,
  • Michael D. Brown, MD, MSc

      Affiliations

    • Division of Emergency Medicine, Michigan State University College of Human Medicine, Grand Rapids, MI

Received 25 March 2009; received in revised form 20 May 2009 and 15 June 2009; accepted 24 June 2009. published online 07 September 2009.

Article Outline

Study objective

We seek to determine the diagnostic test characteristics of noncontrast computed tomography (CT) for appendicitis in the adult emergency department (ED) population.

Methods

We conducted a search of MEDLINE, EMBASE, the Cochrane Library, and the bibliographies of previous systematic reviews. Included studies assessed the diagnostic accuracy of noncontrast CT for acute appendicitis in adults by using the final diagnosis at surgery or follow-up at a minimum of 2 weeks as the reference standard. Studies were included only if the CT was completed using a multislice helical scanner. Two authors independently conducted the relevance screen of titles and abstracts, selected studies for the final inclusion, extracted data, and assessed study quality. Consensus was reached by conference, and any disagreements were adjudicated by a third reviewer. Unenhanced CT test performance was assessed with summary receiver operating characteristic curve analysis, with independently pooled sensitivity and specificity values across studies.

Results

The search yielded 1,258 publications; 7 studies met the inclusion criteria and provided a sample of 1,060 patients. The included studies were of high methodological quality with respect to appropriate patient spectrum and reference standard. Our pooled estimates for sensitivity and specificity were 92.7% (95% confidence interval 89.5% to 95.0%) and 96.1% (95% confidence interval 94.2% to 97.5%), respectively; the positive likelihood ratio=24 and the negative likelihood ratio=0.08.

Conclusion

We found the diagnostic accuracy of noncontrast CT for the diagnosis of acute appendicitis in the adult population to be adequate for clinical decisionmaking in the ED setting.

 

Back to Article Outline

Introduction 

Acute appendicitis is frequently in the differential diagnosis of patients presenting to the emergency department (ED) with right lower quadrant abdominal pain. Unfortunately, the diagnosis is difficult, with the classic presentation of periumbilical pain followed by nausea and vomiting, with migration of pain to the right lower quadrant occurring in only 50% to 60% of patients with appendicitis.1 An increased leukocyte count occurs in 70% to 90% of cases but is nonspecific because it can occur in other disease processes.2 Although there has been some conflicting evidence, the use of computed tomography (CT) has been reported to significantly reduce the negative appendectomy rate in many studies.3, 4, 5, 6 Institutions use different combinations of oral, intravenous, and rectal contrast, as well as noncontrast protocols. Noncontrast CT is particularly appealing in today's crowded EDs because there is no delay caused by waiting for oral contrast transit, no risk of contrast-induced nephropathy, and no risk of allergic reaction. However, the use of noncontrast CT has been controversial because of concerns about diagnostic accuracy and the need for repeated scanning with contrast in a subset of patients when the interpretation is inconclusive.7, 8 The goal of this systematic review was to assess the evidence for the use of noncontrast CT for the diagnosis of acute appendicitis among adults.

Editor's Capsule Summary

 

What is already known on this topic

Abdominal computed tomography (CT) is widely used for evaluation of suspected appendicitis, but the utility of CT without contrast remains controversial.

What question this study addressed

What is the diagnostic accuracy of noncontrast CT in the emergency department (ED) evaluation of adults with suspected appendicitis?

What this study adds to our knowledge

This 7-study systematic review of noncontrast CT yielded pooled estimates of sensitivity 93%, specificity 96%, positive likelihood ratio 24, and negative likelihood ratio 0.08, which are comparable to findings of previously published reviews.

How this might change clinical practice

Noncontrast CT has high sensitivity and specificity in the ED diagnosis of adult appendicitis and is a reasonable alternative to contrast CT for this indication.

Back to Article Outline

Materials and Methods 

The clinical question addressed in this systematic review is, in adult patients presenting to the ED with acute abdominal pain and suspected of having acute appendicitis, what are the test characteristics of noncontrast helical CT? A written systematic review protocol was developed to directly address this clinical question and was reviewed and agreed upon by all authors a priori.

All relevant electronic databases were searched for studies assessing the accuracy of noncontrast helical (multislice) CT to diagnose acute appendicitis in adults. No age, language, or publication limits were applied. Databases searched were MEDLINE (Ovid 1950 to March, week 3, 2008), EMBASE, the Cochrane Library (including the Cochrane Central Register of Controlled Clinical Trials, the Database of Abstracts of Review of Effects, and the Health Technology Assessment Database), the American College of Physicians Journal Club, and Emergency Medicine Abstracts (available online at http://ccme.org/). The MEDLINE search used a combination of the following MeSH headings and free-text terms: “appendicitis,” “appendectomy,” “append$,” “tomography,” “x-ray computed,” “contrast media,” “computed tomography,” “ct,” “helical,” “enhanced,” “unenhanced,” “contrast$,” and “noncontrast.” Similar search terms were then adapted for each database. The complete MEDLINE search strategy is provided as Appendix E1 (available online at http://www.annemergmed.com). The MEDLINE search was updated weekly for any new relevant citations until March 2009. The bibliographies of studies meeting the final inclusion criteria and those of previous systematic reviews9, 10, 11, 12 were also searched.

An appropriate patient spectrum and reference standard were deemed essential to the validity of the systematic review. Therefore, specific inclusion and exclusion criteria for selecting studies were applied to ensure that any inferences made according to the results of the meta-analysis were appropriate and applicable to the ED setting.13 Only studies with patients who presented or were referred to the ED with acute abdominal pain suspicious for acute appendicitis, but not immediate candidates for surgery, were included. Studies with a mix of adult and pediatric patients were excluded unless data for those aged 16 years or older were clearly reported or made available from authors who were contacted. Studies were included if the CT was completed using a multislice helical scanner. Studies aimed at assessing the accuracy of a single-row CT were excluded because this technology is no longer used in the United States. Studies were included only if an appropriate reference standard for diagnosis or exclusion of acute appendicitis was used in all patients; in other words, partial verification bias was an exclusion criterion. The diagnosis of appendicitis could be determined at surgery by the surgeon or on assessment of the specimen by a pathologist. Given our requirement that studies include an appropriate patient spectrum, it was expected that not all patients would undergo laparotomy. Therefore, uneventful clinical follow-up for a minimum of 2 weeks was also considered an acceptable reference standard for the exclusion of clinically significant acute appendicitis. Studies were not included if length of clinical follow-up was not reported or could not be confirmed by author contact.

Data Collection and Processing 

Two reviewers (V.H., J.A.D.) independently performed a relevance search; both reviewers examined the titles and abstracts of all references identified to determine whether the article was relevant to the research question. The 2 reviewers then compared their inclusion and exclusion logs, and the level of agreement was calculated with the κ statistic. Any disagreements were adjudicated by a third investigator (M.D.B.).

A data collection form was used to extract data from each study that appeared to satisfy the inclusion criteria. An attempt was made to contact the authors of studies that required data clarification or supplementation. Again, 2 reviewers independently abstracted the data, and consensus was reached by conference between the 2 reviewers (V.H., J.A.D.); disagreements were resolved by the third party (M.D.B.).

As described under the inclusion criteria, an appropriate reference standard was required for a study to be included in the systematic review. Given that an appropriate patient spectrum was required for inclusion, it was recognized that differential verification bias would likely be present among the included studies (ie, some patients would have clinical follow-up as the reference standard as opposed to surgical confirmation).

We elected to be restrictive by including only higher-quality studies based on the 2 key quality measures discussed under the inclusion criteria above; studies had to include an appropriate patient spectrum and use an acceptable reference standard.14, 15 We then assessed the quality of each study according to the most relevant items from the QUADAS tool.14 QUADAS was developed as an evidence-based quality assessment tool to be used in systematic reviews to assess the quality of primary studies of diagnostic accuracy.14 Each item was rated as yes, no, or unclear, as defined by the authors of QUADAS.14 Although results were reported for all 11 items of the QUADAS instrument, there were 6 items considered particularly relevant to the clinical question: (1) did patients receive the same reference standard regardless of the index test result (ie, was differential verification bias avoided)?; (2) were the index test results interpreted without knowledge of the results of the reference standard?; (3) were the reference standard results interpreted without knowledge of the results of the index test?; (4) were the same clinical data available when test results were interpreted as would be available when the test is used in practice (ie, was the test used in clinically applicable circumstances)?; (5) were uninterpretable/indeterminate test results reported?; and (6) were withdrawals from the study explained? No attempt was made to combine the quality assessment into an overall score; instead, the results of the quality assessment were reported in a summary figure.13

Primary Data Analysis 

Unenhanced CT test performance was assessed with the traditional summary receiver operating characteristic (SROC) curve analysis, with independently pooled sensitivity and specificity values across studies using a random-effects model (for technical details, see Appendix E2, available online at http://www.annemergmed.com).16 A subgroup analysis based on CT technology to detect any difference in test performance with multislice scanners (ie, 4, 16, or 64 slice) versus dual-slice scanners was planned. In addition, a sensitivity analysis was planned with a bivariate random-effects regression model that focuses on estimating the average sensitivity and specificity and also allows for adjustment for sources of bias and variability among studies.17 The statistical analysis was performed with RevMan 5.0.16,18 SAS statistical application program (version 9.1; SAS Institute, Inc., Cary, NC), and S-Plus 8 Enterprise (TIBCO Software Inc., Palo Alto, CA). The precision for the test characteristics of noncontrast CT test was reported with 95% confidence intervals (CIs).

The validity of using funnel plots or statistical models to detect publication bias for diagnostic test meta-analysis has not been established, so we did not formally test for the presence of publication bias.13, 19, 20

Back to Article Outline

Results 

There was good agreement between the 2 reviewers (V.H., J.A.D.) for the relevance screen of the 1,161 titles identified in the MEDLINE, Cochrane, and EMBASE databases (κ=0.62). The comprehensive search yielded a total of 1,258 publications (Figure 1). After adjudication of the relevance search, a full article review was completed on the remaining 32 articles. Upon full article review, 15 studies did not meet our inclusion criteria for various reasons such as inappropriate patient spectrum, inadequate data available for extraction, or the use of contrast.8, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34 A further 10 studies were excluded.7, 35, 36, 37, 38, 39, 40, 41, 42, 43 Reasons for excluding 15 studies after the full review and another 10 studies after attempts at author contact are provided in Table 1. Seven studies were included in the final analysis, providing a sample of 1,060 subjects.44, 45, 46, 47, 48, 49, 50

Table 1. Summary of exclusion of studies.
AuthorsReason for Exclusion
Exclusion of 15 articles after full review
Bendeck et al21
Includes pediatric patients

All patients underwent appendectomy

Length of clinical follow-up not stated

Brontvein et al22
Age range of patients not reported

Abdominal pain not specific for appendicitis

Length of clinical follow-up not stated

D'Ippolito et al23
Includes pediatric patients

All patients underwent appendectomy

Kailidou et al24
Results combined contrast with noncontrast CT scans

Keyzer et al25
Could not extract data from low-dose and standard-dose CT scans

Malone et al26
Includes pediatric patients

Morris et al27
Length of clinical follow-up not stated

Specific CT type and protocol not stated

Styrud et al28
Age range of patients not reported

All patients underwent appendectomy

Torbati et al29
Includes pediatric patients

Results combined contrast with noncontrast CT scans

Vadeboncoeur et al8
Age range of patients not reported

Results combined contrast with noncontrast CT scans

Van Breda Vriesman et al30
Includes pediatric patients

All patients underwent appendectomy

Yüksekkaya et al33Includes pediatric patients
Yetkin et al31Could not verify length of clinical follow-up
Yeung et al32
Includes pediatric patients

Suspicion for appendicitis is not high in all patients; included other causes of acute abdomen

Wise et al34Definition of unenhanced CT scan included oral contrast
Exclusion of 10 articles/abstracts after potential inclusion
Cakirer et al35Could not verify length of clinical follow-up
Christopher et al36Includes pediatric patients
Gamanagatti et al37Includes pediatric patients
Heaston et al38
Age range of patients could not be verified

Could not verify length of clinical follow-up

Hershko et al39Could not verify length of clinical follow-up
Lane, Katz et al40Could not verify length of clinical follow-up
Lane, Liu et al7
Includes pediatric patients

Could not verify length of clinical follow-up

Peck et al41
Includes pediatric patients

Could not verify length of clinical follow-up

Poortman et al42Includes pediatric patients
Togawa et al43
Includes pediatric patients

Could not verify length of clinical follow-up

Could not verify type of CT scan

The prevalence of appendicitis among patients presenting or referred to the ED with acute abdominal pain who were suspected of having appendicitis, but were not immediate candidates for surgery, ranged from 20.1% to 84.5%, with a median of 39.3 % (Table 2). As expected, in the majority of studies (6/7) there were patients who did not undergo surgery and were followed up clinically. In all 6 of these studies, none of the patients who were followed up were later diagnosed with appendicitis. The single study that did not rely on clinical follow-up (in't Hof et al)47 used laparoscopy performed by a surgeon blinded to the CT result as the reference standard for all patients.19 Although enrollment was described as including consecutive ED patients suspected of appendicitis, the prevalence of appendicitis in this study was quite high (84.5%), indicating that the results were likely biased toward patients with a more severe presentation compared with the typical ED population presenting with abdominal pain. The sensitivity and specificity of each study are presented in Table 2 and Figure 2.

Table 2. Summary of study characteristics.
Study by First AuthorYearNo.Age RangePrevalence of Appendicitis, %TPTNFPFNSe/Sp
Ashraf20065816–6739.72135020.91/1.00
Ege200229616–6936.5104185340.96/0.98
Horton20004918–6577.63711010.97/1.00
in't Hof200410316–8284.58316040.95/1.00
Keyzer20059416–8131.92659540.87/0.92
Stacher19995623–6139.32134010.95/1.00
Tamburrini200740418–8620.1733101380.90/0.96

TP, True positive; TN, true negative; FP, false positive; FN, false negative; Se, sensitivity; Sp, specificity.

The strict inclusion criteria required that all 7 studies be rated “yes” as having a representative patient spectrum and acceptable reference standard (Figure 3). All 7 studies were also rated “yes” for an acceptable delay between the CT and the measurement of the reference standard; after noncontrast CT, patients were either operated on in a timely manner or had adequate follow-up of at least 2 weeks. Except for the study by in't Hof et al,47 in which all patients had laparoscopy regardless of the CT result, the other studies were rated “no” with respect to avoiding differential verification bias. Differential verification bias was unavoidable in studies that used clinical follow-up as a reference standard when the CT result was negative. Indeterminate (uninterpretable) results were clearly reported in 2 studies, Keyzer et al48 and Tamburrini et al.50 The patients with indeterminate results in the study by Tamburrini et al50 were excluded from their primary analysis and underwent a repeat CT with contrast; retrospective review of the indeterminate noncontrast CT results agreed with the prospective reading of the contrast CT in 73% of observations (κ=0.322).50 The study by Keyzer et al48 reported 6 CT interpretations as inconclusive and included these in the negative CT result group.48 Although the other 5 studies did not separately report the number of indeterminate CT readings, they did include indeterminate scans among their negative CT results. Patient withdrawals were adequately reported in only 2 studies.46, 50

Sensitivity Analyses 

According to the traditional SROC curve analysis,51 there was no evidence of a significant threshold effect (β=–0.3; 95% CI –1.9 to 1.3). The random-effects pooled estimates for sensitivity and specificity were 92.7% (95% CI 89.5% to 95.0%) and 96.1% (95% CI 94.2% to 97.5%), respectively. According to these summary estimates, the positive likelihood ratio=24 and the negative likelihood ratio=0.08. The SROC curve plot is presented in Figure 4. The small size and number of studies precluded any meaningful subgroup analysis, and the sensitivity analysis using the bivariate random-effects method failed to converge.

  • View full-size image.
  • Figure 4. 

    SROC curve analysis of unenhanced CT in the diagnosis of acute appendicitis. Individual studies are depicted as circles. The SROC curve was created using the (unweighted) method as described by Moses et al2 (see text for explanation).

Back to Article Outline

Limitations 

There were a number of studies that were omitted from our systematic review after we were unable to confirm length of follow-up with the authors.35, 39, 40 If we had been able to confirm adequate follow-up, the addition of these studies to our meta-analysis may have affected our overall estimates of sensitivity and specificity.

The inconsistency of reporting inconclusive CT results is an important limitation because at least 1 study has demonstrated a 41% incidence of acute appendicitis with equivocal CT interpretations.41 In the only study within our systematic review that separately reported data on inconclusive CT scans, 24% were considered inconclusive with an associated likelihood ratio=0.9, indicating that a CT result interpreted by the radiologist as equivocal should neither increase nor decrease the clinician's pretest probability of disease. The important clinical question addressing how to treat the patient with an inconclusive CT is an area ripe for further investigation.

The quality assessment demonstrated that the majority of studies were prone to differential verification bias, which may have inflated the estimates for test accuracy.15 For example, if patients with a negative CT result and uneventful clinical follow-up actually had subclinical appendicitis that resolved without treatment, a false-negative result would have been missed. On the other hand, many would argue that cases of subclinical appendicitis that resolve without treatment are not patient-important outcomes. All 7 studies reported adequate blinding of the radiologist interpreting the CT. Although only 1 study clearly reported blinding of the reference standard interpretation, it is unlikely that single blinding had a significant effect on the overall results.15

As described in the “Materials and Methods,” we did not formally test for the presence of publication bias, given the questionable validity of using funnel plots or statistical models to detect publication bias for diagnostic test meta-analysis.13 However, if publication bias were present, it would be expected that our estimates for test sensitivity and specificity would be overstated.

Back to Article Outline

Discussion 

Acute appendicitis remains the most common cause of acute abdominal pain requiring urgent surgery.44 Despite the high prevalence of appendicitis, the diagnosis is still problematic and perforation can occur within 24 hours of the onset of symptoms.12 The lifetime incidence of appendicitis in the Western world is 6.7% for females and 8.6% for males, but the lifetime chance of appendectomy is 23.1% and 12.0%, respectively.47 Traditionally, a high negative appendectomy rate of 10% to 20% has been considered acceptable to minimize the number of missed cases of appendicitis.52 However, removal of a normal appendix is associated with an early complication rate of 7% to 13% and a late complication rate of 4%47; hence, it is not a benign procedure. The clinical presentation of acute appendicitis is often atypical and may mimic other abdominal conditions, confounding its clinical diagnosis and resulting in a clinical diagnostic accuracy of only 60% to 80%.21, 44 Thus, CT has become the favored diagnostic test for adults presenting to the ED with suspected acute appendicitis. Our pooled summary estimate for specificity would suggest that, when unenhanced CT is used, the acceptable negative appendectomy rate should be only approximately 4%.

There are currently many CT protocols for the diagnosis of acute appendicitis, including noncontrast CT, intravenous or oral contrast, rectal contrast, and focused appendiceal imaging versus scanning through the entire abdomen and pelvis. The potential advantages of nonenhanced studies include earlier completion of imaging, thus avoiding delayed diagnoses and allowing more expedient care, which is especially important in the setting of ED crowding; avoidance of exposure to contrast materials; potential cost savings; and decreased patient discomfort and increased patient satisfaction. The usefulness of oral contrast is not clear, and there is considerable variability in oral contrast protocols with respect to type of oral contrast, quantity, and timing of administration. Earlier-generation CT scans may have mandated its use because the longer image acquisition times and artifact caused by respiratory and peristaltic motion led to image degradation. However, helical scanning and multidetector CT with rapid image acquisition times have made the use of oral contrast of questionable value.53 Factors that may contribute to improved accuracy of noncontrast CT interpretation include presence of intra-abdominal fat, increased severity of disease at image acquisition, thin sections (<5 mm), coronal and sagittal reformatting, and the experience of the interpreting radiologist. Difficulties in interpretation of CT may occur when normal, fluid-filled small bowel has the appearance of a dilated enlarged appendix. In addition, varying positions of the cecum and calcification in the right lower quadrant may also be misinterpreted. Diagnostic accuracy may be further compromised in patients with little abdominal and intrapelvic fat45; however, one study that directly addressed this issue did not find a significant difference in CT accuracy with varying body mass index.48

Part of the difficulty in comparing contrast and noncontrast CT is the degree of interobserver variability among radiologists. Simple agreement in nonenhanced CT and contrast-enhanced CT interpretation for the diagnosis of acute appendicitis ranges from 80% to 97% and varies with specific radiologic findings (eg, appendicolith versus wall thickening).53 Studies assessing interobserver variability report fair to excellent agreement according to a κ statistic.54 Of course, the experience and training of the radiologists may have an effect on the accuracy of interpretation.55

Six of the 7 studies in this systematic review included radiologists as investigators and all support the use of noncontrast CT scans for the diagnosis of appendicitis.44, 45, 47, 48, 49, 50 Ashraf et al44 state that a “certain level of experience is required for skillful interpretation” of noncontrast CT and have established an imaging protocol at their institution that includes noncontrast CT for evaluating the appendix. Similarly, Lane et al40 mention that as a result of their study on noncontrast CT for suspected appendicitis, their institution now considers noncontrast CT as an alternative to ultrasonography for diagnosing appendicitis. Therefore, it appears that there is already some degree of acceptance for using noncontrast CT scans among radiologists for the diagnosis of appendicitis.

Two recently published meta-analyses comparing CT and ultrasonography in the diagnosis of appendicitis reported similar results to ours, with respect to the ability of CT to rule out appendicitis. The negative likelihood ratio of 0.08 according to our SROC analysis was consistent with that reported by van Randen et al.52 Terasawa et al11 reported summary estimates of 94% (95% CI 91 to 95%) for sensitivity, 95% (95% CI 93 to 96%) for specificity, and 0.09 for the negative likelihood ratio. Terasawa et al11 observed that the test characteristics among the individual studies were similar despite variation in the use of contrast and CT technology but methodological limitations may have inflated estimates of diagnostic accuracy.

Our systematic review is unique in that it specifically focuses on the diagnostic accuracy of noncontrast CT in adult patients suspected of having acute appendicitis. Reviews by Terasawa et al,11 Weston et al,12 and van Randen et al52 compare the use of CT to ultrasonography in diagnosis of acute appendicitis but include only 2 to 3 studies that assess the accuracy of noncontrast CT and also include pediatric populations. The systematic review by Anderson et al9 focuses on whether oral contrast is necessary for the CT diagnosis of appendicitis in adults but includes rectal contrast in the noncontrast CT group.

As with any diagnostic test that does not have perfect sensitivity or specificity, CT cannot exclude appendicitis with 100% certainty and must be interpreted within the clinical context. Depending on the individual patient's condition and circumstances, clinical judgment must be used when deciding to perform contrast-enhanced or unenhanced CT for suspected appendicitis. Our 7.3% summary estimate for the false-negative rate is within the range of false-negative rates (3% to 17%) reported in a systematic review that included various CT contrast protocols.9 The ultimate goal of CT imaging in patients presenting with abdominal pain suspicious for appendicitis is to make a prompt diagnosis and decrease the rate of appendectomies performed on patients without appendicitis. Although some authors have reported a decrease in the rate of appendectomies performed on patients without appendicitis with the use of helical CT,41 others argue that there has been little change in the rate of surgical intervention or rate of perforation.56 Unfortunately, these observations were made using older technology and this type of outcome data has not been reported since the introduction of multislice CT. Despite the lack of data on long-term outcomes, our systematic review demonstrates high sensitivity and specificity for non-enhanced CT in the diagnosis of acute appendicitis and should be considered as an alternative to contrast CT, particularly in patients with contraindications to contrast such as those at risk for contrast-induced nephropathy.

Back to Article Outline

 

The authors thank Patrick Bossuyt, PhD, for his recommendations about the bivariate method of analysis. The authors also thank Diana Delgado, AHIP, and Paul Albert, MLS, of the Weill Cornell Medical Library for their library and technical assistance. The authors acknowledge the following corresponding authors: Kashif Ashraf, MD, Shivanand Gamanagatti, MD, Marc D. Horton, MD, and Stefania Tamburrini, MD

Back to Article Outline

Appendix E1. Ovid MEDLINE 1950 to March, week 3, 2009 


1.Appendicitis/

2.Appendectomy/

3.append$.tw.

4.or/1-3

5.exp Tomography, X-Ray Computed/

6.exp Contrast Media/

7.ra.fs.

8.computed tomography.tw.

9.cat scan$.tw.

10.ct.tw.

11.helical.tw.

12.enhanced.tw.

13.unenhanced.tw.

14.contrast$.tw.

15.noncontrast.tw.

16.or/5-15

17.4 and 16

18.exp “Sensitivity and Specificity”/

19.sensitivity.tw.

20.specificity.tw.

21.((pre-test or pretest) adj probability).tw.

22.post-test probability.tw.

23.predictive value$.tw.

24.likelihood ratio$.tw.

25.or/18-24

26.17 and 25

Back to Article Outline

Appendix E2. Technical information 

The traditional SROC curve analysis was carried out with a regression model based on unweighted least-squares estimation, along with plots on the difference between the logit of the true-positive rate (TPR) and the logit of the false-positive rate (FPR) (D=logit TPR–logit FPR) on the y axis and the sum (S=logit TPR+logit FPR) on the x axis.1, 2, 3 The y axis (D) is equivalent to the log diagnostic odds ratio, and the x axis (S) is a measure of how the test characteristics vary with the test threshold. A regression equation (D=α+β×S) derived from the SROC curve analysis was used to assess for a threshold effect.4, 5 A threshold effect was assumed to be absent if the β coefficient was near zero, implying a symmetric SROC curve.1, 6 The sensitivity and specificity were pooled independently of each other. The logits of the sensitivity and its variance were calculated, and then the random-effects model of DerSimonian and Laird7 was used for pooling. The pooled logit sensitivity was then transformed back to the original representation. The pooled specificity was obtained similarly. If there was a value of zero in a 2×2 table, then a correction factor of one-half was added to each cell for the logit calculation to avoid numeric problems.1, 7 The validity of using formal statistical tests for assessing heterogeneity in diagnostic meta-analysis has been recently questioned8; therefore, we did not calculate the I-squared statistic for the diagnostic odds ratio as originally described in our protocol.

Back to Article Outline

References 

  1. Birnbaum BA, Wilson SR. Appendicitis at the millennium. Radiology. 2000;215:337–348
  2. Paulson EK, Kalady MF, Pappas TN. Suspected appendicitis. N Engl J Med. 2003;348:236–243
  3. Rao PM, Rhea JT, Novelline RA, et al. Effect of computed tomography of the appendix on treatment of patients and use of hospital resources. N Engl J Med. 1998;338:141–146
  4. Kim K, Rhea JE, Lee CC, et al. Impact of helical computed tomography in clinically evident appendicitis. Emerg Med J. 2008;25:477–481
  5. Balthazar EJ, Rofsky NM, Zucker R. Appendicitis: the impact of computed tomography imaging on negative appendectomy and perforation rates. Am J Gastroenterol. 1998;93:768–771
  6. Antevil JA, Rivera L, Langenberg BL, et al. Computed tomography-based clinical diagnostic pathway for acute appendicitis: prospective validation. J Am Coll Surg. 2006;203:849–856
  7. Lane MJ, Liu DM, Huynh MD, et al. Suspected acute appendicitis: nonenhanced helical CT in 300 consecutive patients. Radiology. 1999;213:341–346
  8. Vadeboncoeur TF, Heister RR, Behling CA, et al. Impact of helical computed tomography on the rate of negative appendicitis. Am J Emer Med. 2006;24:43–47
  9. Anderson BA, Salem L, Flum DR. A systematic review of whether oral contrast is necessary for the computed tomography diagnosis of appendicitis in adults. Am J Surg. 2005;190:474–478
  10. Doria AS, Moineddin R, Kellenberger CJ, et al. US or CT for diagnosis of appendicitis in children and adults? (a meta-analysis). Radiology. 2006;241:83–94
  11. Terasawa T, Blackmore C, Bent S, et al. Systematic review: computed tomography and ultrasonography to detect acute appendicitis in adults and adolescents. Ann Intern Med. 2004;141:537–546
  12. Weston AR, Jackson TJ, Blamey S. Diagnosis of appendicitis in adults by ultrasonography or computed tomography: a systematic review and meta-analysis. Int J Technol Assessm Health Care. 2005;21:368–379
  13. Leeflang MMG, Deeks JJ, Gatsonis C, et al. Cochrane Diagnostic Test Accuracy Working Group Systematic reviews of diagnostic test accuracy. Ann Intern Med. 2008;149:889–897
  14. Whiting P, Rutjes A, Reitsma J, et al. The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Med Res Methodol. 2003;3:25
  15. Rutjes AW, Reitsma JB, Di Nisio M, et al. Evidence of bias and variation in diagnostic accuracy studies. CMAJ. 2006;174:469–476
  16. Dinnes J, Deeks J, Kirby J, et al. A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy. Health Technol Assess. 2005;9:iii, 1-113
  17. Reitsma JB, Glas AS, Rutjes AW, et al. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. 2005;58:982–990
  18. Review Manager (RevMan) [computer program]. Version 5.0.16. Copenhagen, Denmark: Nordic Cochrane Centre, Cochrane Collaboration; 2008;
  19. Terrin N, Schmid CH, Lau J. In an empirical evaluation of the funnel plot, researchers could not visually identify publication bias. J Clin Epidemiol. 2005;58:894–901
  20. Tatsioni A, Zarin DA, Aronson N, et al. Challenges in systematic reviews of diagnostic technologies. Ann Intern Med. 2005;142(12 pt 2):1048–1055
  21. Bendeck SE, Nino-Murcia M, Berry GJ, et al. Imaging for suspected appendicitis: negative appendectomy and perforation rates. Radiology. 2002;225:131–136
  22. Brontvein AT, Egorov VI, Vitko NK, et al. Radiodiagnosis of acute appendicitis. Khirurgiia (Mosk). 2002;12:13–16
  23. D'Ippolito G, Netto de Mello GG, Szejnfeld J. The value of unenhanced CT in the diagnosis of acute appendicitis. Sao Paulo Med J (Rev Paul Med). 1998;116:1838–1845
  24. Kailidou E, Pikoulis E, Katsiva V, et al. Spiral computerized tomography: a diagnostic aid in patients with atypical acute appendicitis. Acta Chir Belg. 2006;106:47–51
  25. Keyzer C, Tack D, de Maertelaer V, et al. Acute appendicitis: comparison of low-dose and standard-dose unenhanced multi-detector row CT. Radiology. 2004;232:164–172
  26. Malone AJ, Wolf CR, Malmed AS, et al. Diagnosis of acute appendicitis: value of unenhanced CT. AJR Am J Roentgenol. 1993;160:763–766
  27. Morris KT, Kavanagh M, Hansen P, et al. The rational use of computed tomography scans in the diagnosis of appendicitis. Am J Surg. 2002;183:547–550
  28. Styrud J, Josephson T, Eriksson S. Reducing negative appendectomy: evaluation of ultrasonography and computer tomography in acute appendicitis. Int J Qual Health Care. 2000;12:65–68
  29. Torbati SS, Guss DA. Impact of helical computed tomography on the outcomes of emergency department patients with suspected appendicitis. Acad Emerg Med. 2003;10:823–829
  30. Van Breda Vriesman AC, Kole BJ, Puylaert JBCM. Effect of ultrasonography and optional computed tomography on the outcome of appendectomy. Eur Radiol. 2003;13:2278–2282
  31. Yetkin G, Basak M, Işgör A, et al. Can negative appendectomy rate be decreased by using spiral computed tomography without contrast material?. Acta Chir Belg. 2002;102:334–337
  32. Yeung KW, Sheu RS, Chen CY, et al. Evaluation of significance of clinical information and intravenous contrast administration for diagnostic accuracy during emergent abdominal computed tomographic examination. Emerg Radiol. 1997;4:276–282
  33. Yüksekkaya R, Akgül E, İnal M, et al. Unenhanced spiral CT in the diagnosis of acute appendicitis. Tani Girisim Radyol. 2004;10:131–139
  34. Wise SW, Labuski MR, Kasales CJ, et al. Comparative assessment of CT and sonographic techniques for appendiceal imaging. AJR Am J Roentgenol. 2001;176:933–941
  35. Cakirer S, Basak M, Colakoglu B, et al. Diagnosis of acute appendicitis with unenhanced helical CT: a study of 130 patients. Emerg Radiol. 2002;9:155–161
  36. Christopher FL, Lane M, Ward JA, et al. Unenhanced helical CT scanning of the abdomen and pelvis changes disposition of patients presenting to the emergency department with possible acute appendicitis. J Emerg Med. 2002;23:1–7
  37. Gamanagatti S, Vashsht S, Kapoor A, et al. Comparison of graded compression ultrasonography and unenhanced spiral computed tomography in the diagnosis of acute appendicitis. Singapore Med J. 2007;48:80–87
  38. Heaston DR, McClellan JS, Heaston DK. Community hospital experience in 600+ consecutive patients who underwent unenhanced helical CT for suspected appendicitis [abstract]. AJR Am J Roentgenol. 2000;174:53
  39. Hershko DD, Awad N, Fischer D, et al. Focused helical CT using rectal contrast material only as the preferred technique for the diagnosis of suspected acute appendicitis: a prospective, randomized, controlled study comparing three different techniques. Dis Colon Rectum. 2007;50:1223–1229
  40. Lane MJ, Katz DS, Ross BA, et al. Unenhanced helical CT for suspected acute appendicitis. AJR Am J Roentgenol. 1997;168:405–409
  41. Peck J, Peck A, Peck C, et al. The clinical role of noncontrast helical computed tomography in the diagnosis of acute appendicitis. Am J Surg. 2000;180:133–136
  42. Poortman P, Lohle PNM, Schoemaker CMC, et al. Comparison of CT and sonography in the diagnosis of acute appendicitis: a blinded prospective study. AJR Am J Roengenol. 2003;181:1355–1359
  43. Togawa A, Kimura F, Chiku T, et al. Simple way to improve accuracy in diagnosis of quadrant inflammatory disease: how to avoid adverse laparotomy by using plain CT. Hepatogastroenterology. 2005;52:135–138
  44. Ashraf K, Ashraf O, Bari V, et al. Role of focused appendiceal computed tomography in clinically equivocal acute appendicitis. J Pak Med Assoc. 2006;56:200–203
  45. Ege G, Akman H, Sahin A, et al. Diagnostic value of unenhanced helical CT in adult patients with suspected acute appendicitis. Br J Radiol. 2002;75:721–725
  46. Horton MD, Counter SF, Florence MG, et al. A prospective trial of computed tomography and ultrasonography for diagnosing appendicitis in the atypical patient. Am J Surg. 2000;179:379–381
  47. in't Hof KH, van Lankeren W, Krestin GP, et al. Surgical validation of unenhanced helical computed tomography in acute appendicitis. Br J Surg. 2004;91:1641–1645
  48. Keyzer C, Zalcman M, De Maertelaer V, et al. Comparison of US and unenhanced multi-detector row CT in patients suspected of having acute appendicitis. Radiology. 2005;236:527–534
  49. Stacher R, Portugaller H, Preidler KW, et al. Acute appendicitis in unenhanced spiral CT: diagnostic luxury or benefit [in German]?. Rofo. 1999;171:26–31
  50. Tamburrini S, Brunetti A, Brown M, et al. Acute appendicitis: diagnostic value of nonenhanced CT with selective use of contrast in routine clinical settings. Eur Radiol. 2007;17:2055–2061
  51. Irwig L, Macaskill P, Glasziou P, et al. Meta-analytic methods for diagnostic test accuracy. J Clin Epidemiol. 1995;48:119–130
  52. van Randen A, Bipat S, Zwinderman A, et al. Acute appendicitis: meta-analysis of diagnostic performance of CT and graded compression US related to prevalence of disease. Radiology. 2008;249:97–106
  53. Lee S, Coughlin B, Wolfe J, et al. Prospective comparison of helical CT of the abdomen and pelvis without and with oral contrast in assessing acute abdominal pain in adult emergency department patients. Emerg Radiol. 2006;12:150–157
  54. van Randen A, Laméris W, Nio CY, et al. Inter-observer agreement for abdominal CT in unselected patients with acute abdominal pain. Eur Radiol. 2009;19:1394–1407
  55. in't Hof K, Krestin G, Steijerberg E, et al. Interobserver variability in CT scan interpretation for suspected acute appendicitis. Emerg Med J. 2009;26:92–94
  56. Togawa A, Kimura F, Chiku T, et al. Simple way to improve accuracy in diagnosis of quadrant inflammatory disease: how to avoid adverse laparotomy by using plain CT. Hepatogastroenterology. 2005;52:135–138

Back to Article Outline

References 

  1. 1. Dinnes J, Deeks J, Kirby J, et al. A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy. Health Technol Assess. 2005;9:iii, 1-113
  2. 2. Moses LE, Shapiro D, Littenberg B. Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Stat Med. 1993;12:1293–1316
  3. 3. Irwig L, Macaskill P, Glasziou P, et al. Meta-analytic methods for diagnostic test accuracy. J Clin Epidemiol. 1995;48:119–130
  4. 4. Bossuyt PM, Reitsma JB, Bruns DE, et al. Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. Clin Chem Lab Med. 2003;41:68–73
  5. 5. Vamvakas EC. Meta-analyses of studies of the diagnostic accuracy of laboratory tests: a review of the concepts and methods. Arch Pathol Lab Med. 1998;122:675–686
  6. 6. Walter SD. Properties of the summary receiver operating characteristic (SROC) curve for diagnostic test data. Stat Med. 2002;21:1237–1256
  7. 7. DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials. 1986;7:177–188
  8. 8. Leeflang MMG, Deeks JJ, Gatsonis C, et al. Cochrane Diagnostic Test Accuracy Working Group Systematic reviews of diagnostic test accuracy. Ann Intern Med. 2008;149:889–897

 Supervising editor: Allan B. Wolfson, MD

 Author contributions: VH and JAD conceived the study and reviewed and assessed all relevant studies. VH, JAD, and MDB supervised the conduct of the systematic review and data collection. VH, JAD, and LF conducted the search of all relevant electronic databases, meeting abstracts, and bibliographies. ZJ and MDB provided statistical advice. VH, JAD, ZJ, and MDB analyzed the data. VH and JAD drafted the article, and all authors contributed substantially to its revision. VH takes responsibility for the paper as a whole.

 Funding and support: By Annals policy, all authors are required to disclose any and all commercial, financial, and other relationships in any way related to the subject of this article that might create any potential conflict of interest. The authors have stated that no such relationships exist. See the Manuscript Submission Agreement in this issue for examples of specific conflicts covered by this statement.

 Earn CME Credit: Continuing Medical Education is available for this article at: http://www.ACEP.EMedHome.com.

 Reprints not available from the authors.

 Publication date: Available online September 5, 2009.

 Please see page 52 for the Editor's Capsule Summary of this article.

 Provide feedback on this article at the journal's Web site, www.annemergmed.com.

PII: S0196-0644(09)01140-8

doi:10.1016/j.annemergmed.2009.06.509

Annals of Emergency Medicine
Volume 55, Issue 1 , Pages 51-59.e1, January 2010