|
Background Film mammography has limited sensitivity for the detection of breast cancer in women with radiographically dense breasts. We assessed whether the use of digital mammography would avoid some of these limitations.
Methods A total of 49,528 asymptomatic women presenting for screening mammography at 33 sites in the United States and Canada underwent both digital and film mammography. All relevant information was available for 42,760 of these women (86.3 percent). Mammograms were interpreted independently by two radiologists. Breast-cancer status was ascertained on the basis of a breast biopsy done within 15 months after study entry or a follow-up mammogram obtained at least 10 months after study entry. Receiver-operating-characteristic (ROC) analysis was used to evaluate the results.
Results In the entire population, the diagnostic accuracy of digital and film mammography was similar (difference between methods in the area under the ROC curve, 0.03; 95 percent confidence interval, 0.02 to 0.08; P=0.18). However, the accuracy of digital mammography was significantly higher than that of film mammography among women under the age of 50 years (difference in the area under the curve, 0.15; 95 percent confidence interval, 0.05 to 0.25; P=0.002), women with heterogeneously dense or extremely dense breasts on mammography (difference, 0.11; 95 percent confidence interval, 0.04 to 0.18; P=0.003), and premenopausal or perimenopausal women (difference, 0.15; 95 percent confidence interval, 0.05 to 0.24; P=0.002).
Conclusions The overall diagnostic accuracy of digital and film mammography as a means of screening for breast cancer is similar, but digital mammography is more accurate in women under the age of 50 years, women with radiographically dense breasts, and premenopausal or perimenopausal women. (ClinicalTrials.gov number, NCT00008346
[ClinicalTrials.gov]
.)
The smaller benefit of screening in younger women is probably due to a lower incidence of breast cancer, more rapidly growing tumors, and greater radiographic density of breast tissue in women less than 50 years of age.4 Greater density reduces the sensitivity of mammography5,6 and increases the risk of breast cancer.7,8,9 Digital mammography, which was developed in part to address some of the limitations of film mammography,10 separates image acquisition and display, allowing the optimization of both. Image processing of digital data allows the degree of contrast in the image to be manipulated, so that contrast can be increased in the dense areas of the breast with the lowest contrast.11,12
Despite these apparent differences between the two approaches, previous trials have not found digital mammography to be significantly more accurate than film mammography in the diagnosis of breast cancer.13,14,15,16,17 These studies were limited, however, in that they included only one type of digital detector and had insufficient statistical power to identify relatively small differences in diagnostic accuracy. The Digital Mammographic Imaging Screening Trial (DMIST) was designed to measure relatively small but potentially clinically important differences in diagnostic accuracy between digital and film mammography.
Methods
A detailed account of the design of DMIST has been published previously.18 This trial was conducted by the American College of Radiology Imaging Network. During a two-year period, 49,528 women were recruited to the study at 33 sites. The protocol was approved by the institutional review boards at all sites. All women gave written informed consent. The study was monitored by a data and safety monitoring board. Women who presented for screening mammography at the study sites were eligible to participate unless they reported symptoms, had breast implants, believed they might be pregnant, had undergone mammography for any purpose within the preceding 11 months, or had a history of breast cancer treated with both lumpectomy and radiation.
All participants underwent both digital and film mammography in random order. Five digital-mammography systems were used: the SenoScan (Fischer Medical), the Computed Radiography System for Mammography (Fuji Medical), the Senographe 2000D (General Electric Medical Systems), the Digital Mammography System (Hologic), and the Selenia Full Field Digital Mammography System (Hologic).18
The digital and film examinations for each woman were independently interpreted by two radiologists, one reader for each examination. Readers rated the mammograms using a seven-point malignancy scale suitable for receiver-operating-characteristic (ROC) analysis and the classification of the Breast Imaging Reporting and Data System (BIRADS)19 and recorded whether they recommended that additional tests be performed. A score of 1 on the seven-point malignancy scale indicates a result that is definitely not malignant, a score of 2 findings that are almost definitely not malignant, a score of 3 findings that are probably not malignant, a score of 4 findings that may be malignant, a score of 5 findings that are probably malignant, a score of 6 findings that are almost definitely malignant, and a score of 7 findings that are definitely malignant. A BIRADS score of 0 indicates incomplete data, a score of 1 negative results, a score of 2 benign findings, a score of 3 findings that are probably benign, a score of 4 the presence of a suspicious-appearing abnormality, and a score of 5 findings highly suggestive of cancer.
Readers also rated breast density according to the standard BIRADS scale (extremely dense, heterogeneously dense, scattered fibroglandular densities, and almost completely fat). Radiologists in the United States were all qualified interpreters of mammograms under federal law. Canadian readers met equivalent standards. Each site's lead radiologist was trained in the use of the malignancy scale and trained the site's other readers.
A workup, including a biopsy or aspiration of the suspicious-appearing lesion, was performed if either reader recommended it. A single pathologist or the principal investigator of the study coded all pathological diagnoses on the basis of a review of the cytologic or histologic material or of the local pathology report. All participants were asked to return for a follow-up mammogram at one year.
To establish a reference standard, participants were classified as positive for cancer if breast cancer was pathologically verified within 455 days after the initial study mammogram and negative for cancer if their study records showed negative findings on a pathology report of a biopsy specimen, if the follow-up mammogram at 1 year was normal, or if both criteria were met. The 455-day period gave women more than a year after study entry to undergo follow-up mammography. Some analyses were repeated with the use of an additional reference standard based on information from the first 365 days after initial mammography, an interval used in other publications on screening mammography.5,6,20,21,22,23,24,25,26 The status of participants who were classified as neither positive nor negative for cancer was considered indeterminate if they had a breast biopsy with indeterminate results (owing to insufficient material or an inability to interpret the results); had a follow-up mammogram with a BIRADS score19 of 3, 4, or 5; or died during the follow-up period without receiving a diagnosis of breast cancer. All women whose cancer status was indeterminate had no additional pathological or imaging information available. The reference standard for all other participants who did not fall into these three categories was classified as unknown. Participants with either positive or negative reference-standard status made up the fully verified group.
ROC curves for digital and film mammography were estimated from the pooled data across the study with the use of the malignancy score assigned to each woman at the time of screening mammography and before further workup was conducted. The full areas under the curve (AUCs) were compared with the use of the bivariate, binormal model, which accounts for the paired test design.27,28 A corroborating, nonparametric AUC analysis was also performed.29,30 The AUCs were compared in the entire study cohort (primary study aim) as well as in prespecified subgroups of participants (secondary aims). The latter included subgroups defined according to age (younger than 50 years vs. 50 years or older), breast density (heterogeneously dense or extremely dense vs. less dense), menopausal status (premenopausal or perimenopausal vs. postmenopausal), race (white vs. black vs. other), risk of breast cancer (a lifetime risk of
25 percent vs. <25 percent, as determined by the Gail model31), and the four digital-machine manufacturers. The Bonferroni procedure was used to account for the 15 multiple comparisons in the subgroup analysis, with a P value of 0.003 or less considered to indicate statistical significance.
For descriptive purposes, estimates of the sensitivity, specificity, and positive and negative predictive values of the two methods of mammography were computed on the basis of the seven-point malignancy scale, the BIRADS scale, and the presence or absence of a workup recommendation by the radiologist. For this purpose, the scores for the seven-point malignancy scale were dichotomized as negative (score of 1, 2, or 3) and positive (score of 4, 5, 6, or 7), and the BIRADS ratings were dichotomized as negative (score of 1, 2, or 3) and positive (score of 0, 4, or 5). Results were evaluated for 365 and 455 days of follow-up. McNemar's test was used to compare estimates.
The analysis was confined to the fully verified group. We assessed the effect of missing information on disease status by deriving and comparing estimates of AUCs and sensitivity and specificity using methods for correcting for verification bias in the ROC analysis30 and in the comparisons of sensitivity and specificity.28 Both methods incorporate available information on covariates and assume that the verification status depends only on test outcomes and observed covariates.
Results
Study Population
A total of 49,528 women were enrolled in the trial. Of these, 195 (0.4 percent) were subsequently determined to be ineligible and 194 (0.4 percent) withdrew from the study. In addition, 1489 women (3.0 percent) were excluded from the analysis because the study protocol had not been followed at one participating institution, as determined by on-site audits. Thirty-nine additional women were excluded because the same radiologist interpreted both examinations or the radiologist knew the results of the other examination at the time of interpretation, and 12 were excluded because the examinations were technically inadequate (9 with inadequate film examinations and 3 with inadequate digital examinations). Of the 47,599 remaining women, follow-up information was lacking for 4339 (9.1 percent), and 500 (1.1 percent) had an indeterminate cancer status (474 with follow-up mammograms interpreted as having a BIRADS score of 3, 4, or 5; 20 who had insufficient biopsy specimens or nondiagnostic biopsy findings; 6 who died without receiving a diagnosis of breast cancer; and none of whom had definitive information concerning pathological or imaging results). Thus, we were left with data on 42,760 women (86.7 percent of those eligible) for the primary analysis. All interpreted mammograms other than the listed exclusions were included in the analysis, including those obtained from 203 women who underwent only one type of mammography (188 [0.4 percent] underwent film mammography alone, and 15 [0.04 percent] digital mammography alone, primarily owing to equipment malfunctions). Table 1 lists the characteristics of the eligible women and the women who were included in the analysis.
|
Using the dichotomized seven-point malignancy scale, we found that 223 women (0.5 percent) had both positive digital and positive film mammograms, 947 women (2.2 percent) had only positive digital mammograms, 832 women (1.9 percent) had only positive film mammograms, and 40,553 women (94.8 percent) had neither positive film nor positive digital examinations. For the remaining 205 women (0.5 percent), interpretations for either digital or film mammograms were missing (187 negative and 3 positive film examinations and 15 negative digital mammograms).
Using the dichotomized BIRADS scale, we found that 1249 women (2.9 percent) had both positive digital and positive film mammograms, 2399 women (5.6 percent) had only positive digital mammograms, 2416 women (5.7 percent) had only positive film mammograms, and 36,696 (85.8 percent) had neither positive film nor positive digital examinations.
Breast Cancers
A total of 335 breast cancers were diagnosed in the DMIST cohort on the basis of reference-standard information during the 455 days after study entry (Table 2). Of these 335 cancers, 254 (75.8 percent) were diagnosed within 365 days after study mammography and 81 (24.2 percent) were diagnosed between 366 and 455 days after study mammography. The histologic findings and the stage of the breast cancers detected by the two methods were similar.
|
The diagnostic accuracy of digital and film mammography was similar in the fully verified group, as reflected by a mean (±SE) AUC of 0.78±0.02 for digital mammography and of 0.74±0.02 for film mammography (difference in AUC, 0.03; 95 percent confidence interval, 0.02 to 0.08; P=0.18) (Figure 1A). The AUC for digital mammography also did not vary significantly from that for film mammography according to race, the risk of breast cancer, or the type of digital machine used.
|
Table 3 and Table 4 show estimates of the sensitivity, specificity, and positive predictive value of each method on the basis of the seven-point malignancy scale after 455 days of follow-up and the BIRADS scale after 365 days of follow-up, dichotomized at each possible threshold. The tables also show digital and film mammography in terms of their sensitivities and specificities, computed at the main thresholds specified above. Detailed results of statistical analyses for sensitivity and specificity with the use of the seven-point malignancy scale at the 365-day follow-up and the BIRADS scale at the 455-day follow-up are provided in the Supplementary Appendix (available with the full text of this article at www.nejm.org). When the comparisons of sensitivities and specificities were adjusted for verification bias, the results were qualitatively similar.
|
|
We found that digital mammography was significantly better than conventional film mammography at detecting breast cancer in young women, premenopausal and perimenopausal women, and women with dense breasts. There was no significant difference in diagnostic accuracy between digital and film mammography in the population as a whole or in other predefined subgroups. However, digital mammography offers other advantages over film mammography namely, easier access to images and computer-assisted diagnosis; improved means of transmission, retrieval, and storage of images; and the use of a lower average dose of radiation without a compromise in diagnostic accuracy.32 We believe that the significant improvement in accuracy in specific subgroups of women justifies the use of digital mammography in these groups.
Our results are understandable in the light of the technical advantages of digital mammography over film mammography. In a digital image, the x-ray transmission can be manipulated to enhance visualization of subtle structural changes in tissue over the entire breast. For mammograms, the most problematic areas are those in which cancers can be hidden by adjacent dense tissue owing to small differences in contrast between lesions and the fibroglandular background. The visibility of a subtle mass or cluster of calcifications present in the image can be increased if the image contrast is adjusted.33,34
DMIST did not measure mortality end points. The assumption inherent in the design of the trial is that screening mammography reduces the rate of death from breast cancer and that if digital mammography detects cancers at a rate that equals or exceeds that of film mammography, its use in screening is likely to reduce the risk of death by as much as or more than that conferred by film mammography. The evidence supporting this view is given in Table 2. The cancers detected by digital mammography and missed by film mammography in women under the age of 50 years, women with heterogeneously dense or extremely dense breasts, and premenopausal and perimenopausal women included many invasive and high-grade in situ cases. These are precisely the lesions that must be detected early to save lives through screening. Neither digital nor film mammography found all the breast cancers in the population. Palpable findings and symptoms that develop after screening should be evaluated even if a woman has negative findings on digital mammography.
Why were the sensitivities of both digital and film mammography measured in this study apparently lower than the sensitivities in other studies?20,21,22,23 Estimates of sensitivity depend on the definition used.24 We considered any woman presenting with breast cancer within 455 days after study entry to have been positive for breast cancer at the time of her initial screening mammogram. All women with negative findings on mammography at study entry who had breast cancer at the annual follow-up mammography were thus considered to have false negative results for the analysis. The longer follow-up interval was selected to allow study sites to complete the one-year follow-up and subsequent workup. Some of the cancers detected up to 455 days after study entry were probably present at the time of the initial mammogram, but the use of the 455-day follow-up interval for reporting estimates of diagnostic accuracy is unconventional. Table 4 gives estimates of the diagnostic performance of both digital and film mammography at all cutoff points of the BIRADS scale during the 365-day follow-up period. This allows our estimates of diagnostic performance to be compared with those of others.22,23,25
Although the lead radiologists at each site were trained in the use of the seven-point malignancy scale and they then trained the other radiologists interpreting mammograms, this scale has not been used in other large, published studies. Our results using the BIRADS or follow-up scales can more readily be compared with those published elsewhere.5,6,25 In addition, the percentage of the total population recalled for further workup (14.0 percent) is relatively high, because women underwent two screening tests (digital and film mammography), not just one. The call-back rate of 8.4 percent for both digital and film mammography is similar to or lower than those reported elsewhere for U.S. screening programs.21,26,35
One of the major impediments to the adoption of digital mammography will be its cost: digital systems currently cost approximately 1.5 to 4 times as much as film systems. As part of DMIST, we are performing a formal cost-effectiveness analysis and study of the quality of life of asymptomatic women.
Supported by grants from the National Cancer Institute (U01CA80098, U01CA80098-S1, U01CA79778, and U0179778-S1).
We are indebted to the many people at the headquarters of the American College of Radiology Imaging Network and at the recruiting sites for their important contributions to the study; to the radiologists, physicists, and research associates at the clinical sites; to Dennis Fryback, Anna Tosteson, Shahla Masood, Bruce Hillman, Mitchell Schnall, Thomas Caldwell, Stephen King, Charles Apgar, Irene Mahon, Sophia Sabina, Bernadine Dunning, Jamie Downs, Tess Thompson, Heather Wallace, Elaine Pakuris, Donna Hartfeil, Jessie Flaim-Spetsas, Boris Ginsburgs, Sharon Jones, Maria Oh, Rex Welsh, Tim Welsh, Fraser Wilton, Anthony Levering, Anita Murray, Brenda Young, Cheryl Crozier, Mary Kelly Truran, Chris Steward, Thomas Iarocci, Crystal Wright, Janet Vogel, Karan Boparai, Rolma Mancinow, Josephine Schloesser, Sharlene Snowdon, Vish Iyer, JoAnn Stetz, Robert Smith, and the other members of the data and safety monitoring board; to Aili Bloomquist, Gordon Mawdsley, Sam Shen, Mary Brown, Elodia Cole, Beverly Currence, Cherie Kuzmiak, Ann Sherman, Jason Hauser, Dag Pavic, Marcia Koomen, Robert McLelland, Richard Clark, Christopher Parham, Robyn Ellison, Carolyn Kylstra, Sharon Weeks, Rachel Campbell, Emily Wilde, and the following members of the American College of Radiology Biostatistics Center: Lucy Hanna, Alicia Toledano, Ben Herman, Minran Li, Jean Cormack, Prashni Paliwal, Shang-Ying Shiu, and Helga Marques; and to the late Jo-Ann D'Amato for her important work on this project.
Source Information
From the Departments of Radiology and Biomedical Engineering, the Biomedical Research Imaging Center, and the Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill (E.D.P.); the Center for Statistical Sciences, Brown University, Providence, R.I. (C.G., S.A.); the Department of Radiology, Feinberg School of Medicine, Northwestern University, Chicago (E.H.); the Departments of Medical Imaging (M.Y., R.J.) and Medical Biophysics (M.Y.), University of Toronto, Toronto; the Department of Radiology, Beth Israel Deaconess Medical Center, Boston (J.K.B.); the Department of Radiology, University of Pennsylvania Medical School, Philadelphia (E.F.C.); the Department of Radiology, University of Iowa, Iowa City (L.L.F.); the Department of Radiology, University of California at Los Angeles, Los Angeles (L.B.); the Department of Radiology, Emory University, Atlanta (C.D.); and the Department of Radiology, William Beaumont Hospital, Royal Oak, Mich. (M.R.).
This article was published at www.nejm.org on September 16, 2005.
Address reprint requests to Dr. Pisano at etta_pisano{at}med.unc.edu.
References
The following persons served as principal investigators (PIs) or lead physicists (LPs) at the DMIST clinical sites: Allegheny General Hospital, Pittsburgh W. Poller (PI), J. Och (LP); Beth Israel Deaconess Medical Center, Boston J. Baum (PI), R. Zamenhof (LP); Brown University, Providence, R.I. B. Schepps (PI), D. Shearer (LP); Columbia University, New York S.J. Smith (PI), E. Nickoloff (LP); Elizabeth Wende Breast Clinic, Rochester, N.Y. E. Bonaccio (PI), M. Zuley (PI), A. Tibold (LP); Emory University, Atlanta C. D'Orsi (PI), P. Sprawls (LP); Johns Hopkins University, Baltimore N. Khouri (PI), M. Mahesh (LP); LaGrange Hospital, LaGrange, Ill. T. Merrill (PI), C. Vyborny (PI), R. Nishikawa (LP); Lahey Clinic, Burlington, Mass. R.B. Shah (PI), N. Shaikh (LP); Massachusetts General Hospital, Boston D. Georgian-Smith (PI), J. Quattrochi (LP); Memorial Sloan-Kettering Cancer Center, New York M. Cohen (PI), R. Fleischman (LP); H. Lee Moffitt Cancer Center, Tampa, Fla. A.P. Romilly (PI), K. Coleman (LP); Monmouth County Hospital, Long Branch, N.J. M. Staiger (PI), T. Piccoli (LP); Mount Sinai University, New York S. Feig (PI), Jose Burgos (LP); Northwestern University, Chicago R.E. Hendrick (PI), E. Berns (LP); Shore Memorial Hospital, Somers Point, N.J. R. Menghetti (PI), J. Law (LP); Thomas Jefferson University, Philadelphia C. Piccoli (PI), A. Maidment (LP), E. Gingold (LP); University of California at Davis, Davis K. Lindfors (PI), A. Seibert (LP), J. Boone (LP); University of California at Los Angeles, Los Angeles L. Bassett (PI), V. Cooper (LP); University of Cincinnati, Cincinnati M. Mahoney (PI), R. Samaratunga (LP); University of Colorado, Denver P. Isaacs (PI), J. Lewin (PI), F. Larke (LP); University of Iowa, Iowa City L. Fajardo (PI), K. Berbaum (LP), M. Madsen (LP); University of North Carolina, Chapel Hill E. Pisano (PI), R.E. Johnston (LP); University of Pennsylvania, Philadelphia E. Conant (PI), M. O'Shea (LP), A. Maidment (LP); University of Texas Southwestern Medical Center, Dallas W.P. Evans III (PI), M. Hatab (LP); University of Toronto, Toronto M. Yaffe (PI), A. Bloomquist (LP), G. Mawdsley (LP); University of Virginia, Charlottesville J. Harvey (PI), M. Williams (LP); University of Washington, Seattle A. Freitas (PI), K. Kanal (LP); Washington Radiology Associates, Washington, D.C. L. Glassman (PI), J. Greenberg (PI), M. Goodwill (LP); Washington University, St. Louis D. Farria (PI), G. Fletcher (LP); William Beaumont Hospital, Royal Oak, Mich. M. Rebner (PI), D. Bakalyar (LP).
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Related Letters:
Digital and Film Mammography
Crystal P., Strano S., Keen J. D., Ebell M. H., Gatsonis C., Pisano E. D., Hendrick E.
Extract |
Full Text |
PDF
N Engl J Med 2006;
354:765-767, Feb 16, 2006.
Correspondence
This article has been cited by other articles:
HOME | SUBSCRIBE | SEARCH | CURRENT ISSUE | PAST ISSUES | COLLECTIONS | PRIVACY | HELP | beta.nejm.org Comments and questions? Please contact us. The New England Journal of Medicine is owned, published, and copyrighted © 2008 Massachusetts Medical Society. All rights reserved. |