The New England Journal of Medicine
e-mail icon  FREE NEJM E-TOC    HOME   |   SUBSCRIBE   |   CURRENT ISSUE   |   PAST ISSUES   |   COLLECTIONS   |    Advanced Search
Sign in | Get NEJM's E-Mail Table of Contents — Free | Subscribe
 
Special Article
PreviousPrevious
Volume 328:1237-1243 April 29, 1993 Number 17
NextNext

Advances in Diagnostic Imaging and Overestimations of Disease Prevalence and the Benefits of Therapy
William C. Black, and H. Gilbert Welch

 

Tools and Services
-Add to Personal Archive
-Add to Citation Manager
-Notify a Friend
-E-mail When Cited

More Information
-PubMed Citation
Over the past two decades a vast new armamentarium of diagnostic techniques has revolutionized the practice of medicine. The entire human body can now be imaged in exquisite anatomical detail. Computed tomography (CT), magnetic resonance imaging (MRI), and ultrasonography routinely "section" patients into slices less than a centimeter thick. Abnormalities can be detected well before they produce any clinical signs or symptoms. Undoubtedly, these technological advances have enhanced the physician's potential for understanding disease and treating patients.

Unfortunately, these technological advances also create confusion that may ultimately be harmful to patients. Consider the case of prostate cancer. Although the prevalence of clinically apparent prostate cancer in men 60 to 70 years of age is only about 1 percent,1 over 40 percent of men in their 60s with normal rectal examinations have been found to have histologic evidence of the disease2. Consequently, because the prostate is studied increasingly by transrectal ultrasonography3,4 and MRI, which can detect tumors too small to palpate, the reported prevalence of prostate cancer increases. In addition, the increased detection afforded by imaging can confuse the evaluation of therapeutic effectiveness. As the spectrum of detected prostate cancer becomes broader with the addition of tumors too small to palpate, the reported survival from the time of diagnosis improves regardless of the actual effect of the new tests and treatments5.

In this article, we explain how advances in diagnostic imaging create confusion in two crucial areas of medical decision making: establishing how much disease there is and defining how well treatment works. Although others have described these effects in the narrow context of mass screening6,7 and in a few clinical situations, such as the staging of lung cancer,5 these consequences of modern imaging increasingly pervade everyday medicine. Besides describing the misperceptions of disease prevalence and therapeutic effectiveness, we explain how the increasing use of sophisticated diagnostic imaging promotes a cycle of increasing intervention that often confers little or no benefit. Finally, we offer suggestions that may minimize these problems.

Prevalence of Disease

As a general principle, the prevalence of any disease increases with the observer's ability to detect the abnormalities associated with the disease. In the case of diagnostic imaging, the ability to detect an anatomical abnormality is closely related to the size of the abnormality. Thus, as technological advances make it possible for imaging equipment to detect smaller abnormalities, more are found and the prevalence of the associated disease increases accordingly. The problem is illustrated if one asks the deceptively simple question, "How many islands surround Britain's coast"8? There is no one correct answer, because the number of islands increases with the resolution of the map on which they are counted. There are analogous relations between prevalence and diagnostic scrutiny for several familiar diseases.

Abdominal Aortic Aneurysms

Two decades ago, abdominal aortic aneurysms were detected mainly by palpation. The threshold in size for palpation of an aneurysm is about 5 cm when the clinician is directed to look for the abnormality in the investigational setting and considerably higher when the clinician is not so directed9. Today, however, most abdominal aortic aneurysms are detected by ultrasonography or CT, which have a detection threshold well below 3 cm, the most commonly used criterion for the diagnosis of these aneurysms.

A recent screening study of 201 patients at high risk (men between the ages of 60 and 75 years with hypertension or coronary artery disease) showed how much ultrasonography can affect the reported prevalence of abdominal aortic aneurysms. As shown in Figure 1, five aneurysms were detected on physical examination as a "definite pulsatile mass," and four of the five were 5 cm in diameter or larger9. However, 18 aneurysms 3.6 cm or more in diameter were detected in the same population by ultrasound. Of the 13 aneurysms not detected by physical examination, 1 measured >= 5 cm, 5 measured >= 4 cm, and 13 measured >= 3.6 cm. Thus, from the perspective of the clinician performing physical examinations, the prevalence of abdominal aortic aneurysms in this high-risk population was only 2 percent, and the modal (i.e., the most common) size was larger than 5.0 cm. From the perspective of the ultrasonographer, however, the prevalence of aneurysms in this population was 9 percent, and the modal size was less than 4.0 cm.


View larger version (20K):
[in this window]
[in a new window]
 
Figure 1. Real and Apparent Distributions of Abdominal Aortic Aneurysms, According to Size, in 201 High-Risk Patients, as Determined by Ultrasound and Physical Examination, Respectively.

Data are from Lederle et al9.

 
These relations between size, prevalence, and detection threshold can explain why the reported incidence of abdominal aortic aneurysms increased by seven times in the Mayo Clinic area after the introduction of ultrasonography and why the increase was greatest -- by more than 10 times -- for small aneurysms (<5 cm)10.

Cancer

For diseases defined on the basis of macroscopic size criteria, such as abdominal aortic aneurysms, the amount of detectable subclinical disease has been fairly well defined by the imaging techniques in current use. Therefore, their prevalences and incidences cannot increase much with future improvements in imaging. However, for diseases defined microscopically, such as cancer, the reservoir of detectable subclinical disease is huge. The prevalence and incidence of cancer have the potential to rise continually as detection thresholds are lowered by advances in imaging.

This potential is most clearly illustrated in the case of the thyroid gland, which has probably been more closely scrutinized for cancer than any other internal organ. According to the Connecticut Tumor Registry,1 the prevalence of clinically apparent thyroid cancer (tumor size >2 cm) is only about 0.1 percent in adults between the ages of 50 and 70 years. By slicing the thyroid at 2.5-mm intervals at autopsy, however, Harach et al.11 found at least one papillary carcinoma in 36 percent of Finnish adults of comparable ages. In addition, Harach et al. realized that the probability that they would observe under the microscope a tumor with a diameter smaller than the distance between slices was equal to the diameter of the tumor divided by the distance between slices. For example, given the 2.5-mm interval between slices, they reasoned that all tumors larger than 2.5 mm in diameter but only one fifth of tumors with a diameter of 0.5 mm (i.e., 0.5/2.5) would be seen under the microscope. Applying this reasoning to the size distribution of the tumors they observed, Harach et al. reconstructed the likely size distribution for thyroid cancer in their patients studied at autopsy (Figure 2) and concluded that the prevalence of histologically verifiable papillary carcinoma was close to, if not equal to, 100 percent if one could look at thin enough slices of the gland.


View larger version (15K):
[in this window]
[in a new window]
 
Figure 2. Real (Estimated) and Apparent (Observed) Distributions of Thyroid Papillary Carcinomas, According to Size, in 101 Patients Studied at Autopsy.

Data are from Harach et al11. See the text for details about the adjustment.

 
Harach's observations raise two important questions. How many histologically verifiable cancers are there in other organs, and what is meant by "cancer"? If cancer is defined as a "cellular tumor the natural course of which is fatal,"12 then the findings of Harach et al. seriously challenge the validity of the accepted pathologic gold standard. Large-scale screening has not been recommended for thyroid cancer, however, and therefore these questions are more immediately relevant to cancers that are being more aggressively pursued, such as breast cancer.

Breast Cancer

Before the widespread use of mammography, most breast cancers were discovered on physical examination, as palpable lumps. In one of the few studies to assess directly the accuracy of physical examination in screening for breast cancer, only 27 percent of tumors more than 1.0 cm in diameter and 10 percent of those less than 1.0 cm in diameter were detected by physical examination13. However, the mean size of breast cancers detected by state-of-the-art screening mammography is about 1.0 cm,14,15 and many of the cancers detected as microcalcifications are only a few millimeters in size.

Again, prevalence depends on the degree of scrutiny. According to the Connecticut Tumor Registry, clinically apparent breast cancer afflicts about 1 percent of all women between the ages of 40 and 50 years1. In a recent medicolegal autopsy study, however, small foci of breast cancer were found in 39 percent of women in this age group16. Most cancers were in the form of ductal carcinoma in situ. Furthermore, over 45 percent of the women with cancer had two or more lesions, and over 40 percent had bilateral lesions. Although it has been argued that such small in situ lesions are not detected by and are therefore irrelevant to screening mammography, about half the lesions in that study16 were detected, usually as microcalcifications, on postmortem plain-film radiography of the resected breasts. Because of continual technical improvements and increasingly broad criteria for the interpretation of mammograms, the detection threshold for breast cancer has fallen considerably since the time of the Breast Cancer Screening Project of the Health Insurance Plan of Greater New York17 (1963 to 1975). This can explain the increased prevalence of cancer on mammographic screening, from 2.717 to 7.614 per 1000 examinations (with the incidence increasing from 1.517 to 3.214 per 1000 examinations). The lower detection threshold can also explain the increase in the percentage of carcinomas in situ (stage 0) among all mammographically detected cancers -- from 12.7 percent17 to over 30 percent15,18,19. The principal indication for biopsy has changed from suspicious mass to suspicious microcalcifications. This can explain why the reported incidence of breast cancer has increased and why most of the increase is in smaller lesions, particularly ductal carcinoma in situ20.

As the foregoing examples illustrate, there are large reservoirs of clinically occult disease. In fact, this is true of most primary cancers and metastases that have been closely scrutinized2,11,16,21,22,23,24,25,26. For such diseases, the observed prevalence can increase considerably as detection thresholds are lowered by advances in imaging (Table 1).

View this table:
[in this window]
[in a new window]
 
Table 1. Effect of Using Clinical and Microscopical Detection Thresholds in Three Cancers.

 
Therapeutic Effectiveness

Not only are advances in imaging changing physicians' perspectives on the prevalence of disease, but they are also distorting their perceptions of the natural history of disease and its response to medical intervention. Because only a tiny fraction of the tests and treatment strategies used in routine practice have been subjected to randomized trials, physicians must rely heavily on less rigorous methods of evaluation, which usually entail comparisons with historical controls. Whether or not these comparisons are based on published series in the literature or on the physician's recollections of personal experience, they are subject to lead-time and length biases5,6,7,27.

Lead-Time Bias

Lead-time bias pertains to comparisons that are not adjusted for the timing of the diagnosis. If survival is measured from the time of diagnosis, as is usual, then the comparison between patients who are given diagnoses earlier on the basis of the test and those given diagnoses on the basis of clinical findings is a biased one, regardless of the real effect of the earlier diagnosis. In the simple case in which earlier diagnosis has no real effect on the length of survival, the new test will appear to prolong survival (by the amount of time between detection with the test and clinical diagnosis). Therefore, the comparison should be adjusted by subtracting the lead time from the group with test-based diagnoses. In general, however, this adjustment cannot be made when a new test becomes available, because the rate of disease progression and hence the lead time afforded by testing are unknown. Furthermore, this adjustment for lead time assumes that test-detected cases progress at the same rate as those that eventually present clinically. When there is variability in the rate of disease progression, as is usually the case, then this assumption is incorrect and introduces a second bias.

Length Bias

Length bias pertains to comparisons that are unadjusted for the rate of progression of disease. The probability that a disease will be detected by testing is directly proportional to the length of its detectable preclinical phase, which is inversely related to its rate of progression (Figure 3). Therefore, disease detected by testing tends to progress less rapidly than disease that would ultimately present clinically in the absence of testing. Furthermore, the effect of the length bias increases in magnitude as the detection threshold of the test is lowered and the spectrum of detected disease is broadened to include the cases progressing the least rapidly (Figure 4). Among these may be cases that would regress, remain stable, or progress too slowly to become clinically apparent during the patient's lifetime. Some authors have described these as cases of "pseudodisease"28 and consider this aspect of length bias separately, as a bias of overdiagnosis29.


View larger version (14K):
[in this window]
[in a new window]
 
Figure 3. Influence of the Rate of Disease Progression on the Probability of Detection.

The length of each arrow represents the length of the detectable preclinical phase, from the time of initial detectability to the time of clinical diagnosis (Dx). Testing at a single moment in time would detect (bold arrows) four slowly progressive cases, but only two rapidly progressive cases. Cases not detected by the test (gray arrows) are diagnosed clinically either before or after the time of testing.

 

View larger version (41K):
[in this window]
[in a new window]
 
Figure 4. Influence of the Detection Threshold on the Spectrum of Detected Disease.

Disease develops in a cohort of patients, and some cases are subsequently detected by testing (stippled area). Only the rapidly progressive cases are detected by the standard test (left panel), whereas the advanced test, which has a lower detection threshold, detects all cases (right panel). These include the slowly progressive cases, which do not reach the clinical threshold before the patient dies of other causes.

 
Unless one can follow a cohort over time, there is no way of accurately estimating the probability that a subclinically detected abnormality will naturally progress to an adverse outcome. The probability of such an outcome is mathematically constrained, however, by the prevalence of the detected abnormality. The upper limit of this probability can be derived from reasoning that dates to the 17th century, when vital statistics were first collected30. If the number of persons dying from a specific disease is fixed, then the probability that a person with the disease will eventually die from it is inversely related to the prevalence of the disease. Therefore, given fixed mortality rates, an increase in the detection of a potentially fatal disease decreases the likelihood that the disease detected in any one person will be fatal.

This constraint is particularly relevant to the detection of subclinical cancer. As the detection threshold of diagnostic imaging decreases to the level of pathological inspection, the upper bound of the probability of dying of detected cancer becomes small. Table 1 shows how small these upper bounds would become given the relatively constant probabilities of eventually dying of breast,31 prostate,31 and thyroid32 cancer.

Apparent, Real, and Spurious Effects

From the perspective of the clinician reviewing case series in the literature or from personal experience, in which patients are tracked from the time of diagnosis, the apparent effect (usually positive) of a new diagnostic test that lowers the detection threshold is equal to the real effect (variable) plus the spurious effects (always positive) of the lead-time and length biases27:

Apparent effect = Real effect + Spurious effect.

The individual effects of the lead-time and length biases may be impossible to disentangle and quantify. Two recent randomized trials demonstrate, however, that the combined effect of these biases -- the spurious effect -- can be the chief component of the apparent effect, with the real effect being zero or negative. In the Malmo mammographic screening trial, women over the age of 45 were randomly assigned to either regular mammography or no screening. The case fatality rate tracked from the time of diagnosis was 15 percent for breast cancers detected in the control group and only 3 percent for breast cancers detected at the time of screening33. This apparent reduction of 80 percent in mortality was entirely attributed to lead-time and length biases, however, because there was no difference in mortality from breast cancer tracked from the time of randomization. In the Czech lung cancer screening trial,34 men at high risk were randomly assigned to either chest radiography twice a year or no screening. The five-year survival was 23 percent for lung cancers diagnosed in the study group and 0 percent for those diagnosed in the control group. Again, this apparent improvement was entirely attributed to lead-time and length biases, because mortality from lung cancer was actually higher in the screened group, indicating that the real effect of screening and subsequent intervention was negative.

It should be emphasized that these studies do not prove screening mammography and chest radiography to be futile. Had the study conditions -- the methods of testing, the modes of therapy, the linkages between interpretation of the film and treatment, or the targeted populations -- been different, the effects of screening might have been better (or worse). These studies do, however, demonstrate examples of screening that was ineffective and yet appeared to be highly effective from the ordinary clinical perspective (that in which patients were tracked from the time of diagnosis). These disparities between the real and apparent effects of screening mammography and chest radiography are especially disturbing in the light of the fact that these are the only screening strategies using diagnostic imaging that have ever been evaluated in a randomized trial.

Lead-time and length biases pertain not only to changes that lower the threshold for detecting disease, but also to new treatments that are applied at the same time. Whether or not new therapy is more effective than old therapy, patients given diagnoses with the use of lower detection thresholds will appear to have better outcomes than their historical controls because of these biases. Consequently, new therapies often appear promising35 and could even replace older therapies that are more effective or have fewer side effects. Because the decision to treat or to investigate the need for treatment further is increasingly influenced by the results of diagnostic imaging, lead-time and length biases increasingly pervade medical practice.

The Cycle of Increasing Intervention

Misperceptions of disease prevalence and therapeutic effectiveness can promote a cycle of increasing medical intervention, despite the best intentions of all parties. The cycle usually begins with some form of increased testing that lowers the threshold for detecting disease, such as technical improvement in imaging tests, more frequent testing, or closer scrutiny of the images. This immediately leads to a higher diagnostic yield of the disease and a spectrum of milder cases. These effects are almost always interpreted as indicating progress and provide immediate reinforcement for the increased testing, despite the caveat that earlier detection is a double-edged sword7. Unfortunately, the assessment of diagnostic accuracy often contributes to the confusion, because the conventional gold standards are surgical or pathological inspection rather than outcomes for patients. Tests that are more sensitive (at a fixed rate of false positive results) are accepted as better, even though they detect a broader spectrum of disease that includes a subgroup whose natural history and response to intervention are unknown. Consequently, the assessment legitimizes the use of the more sensitive imaging test and becomes a distraction from the fundamental question: How should patients with this newly detectable subclinical disease be treated?

Over time, the reported incidence and prevalence of the detected disease increase. In addition, because of lead-time and length biases, the patients' outcomes usually appear to improve, whether or not there is real improvement. Thus, the apparent increase in the number of detected cases and the apparent improvement in the outcome per case detected reinforce the initial increase in testing and treatment and encourage even more use in the future36. Unless they are interrupted by astute clinicians, testing and treatment may become even more frequent as long as there remain undetected cases of disease and new means of detecting them. This cycle pertains both to individual patients, who may get caught in a cascade of interventions,37 and to large populations of patients, who may be subjected to increasingly intensive screening.

That this cycle has strong potential to occur in the absence of any benefit is strongly supported by studies of screening for lung cancer. Higher rates of detection, resectability, and five-year survival (from the time of diagnosis) in the screened populations were reported during the early phases of the four most recent randomized trials38,39,40,41. Each of these trials eventually demonstrated, however, that screening did not reduce mortality from lung cancer34,42,43,44. Despite these findings in lung cancer, the same potentially misleading measures of success are being used today to justify aggressive testing and treatment for other diseases, such as breast cancer. Increased detection of minimal cancers is almost always reported as progress,45 and longer survival (from the time of diagnosis) is commonly used to justify mammographic screening in women under 50 years of age,18 who constitute about half of all women screened in the United States14.

Although we have focused on a few specific diseases in asymptomatic patients, these misperceptions about disease prevalence and therapeutic effectiveness are relevant to a wide range of conditions. Patients are now more likely to be given diagnoses of a variety of conditions, such as gallstones,46 herniated disks,47 meniscal tears,48 deep venous thrombosis,49 and pulmonary embolism,50 whether or not they are symptomatic, whether or not their diagnoses are responsible for the symptoms, and whether or not the patients benefited from medical intervention. In addition, advances in imaging can result in upward migration of the disease stage, regardless of its severity5. This greatly complicates the assessment of therapeutic effectiveness because it makes stage-specific historical comparisons invalid. For example, Feinstein et al.5 have demonstrated how the illusion of increased survival for patients at all stages of lung cancer was created by the diagnostic techniques developed in the 1970s, which have since been replaced by even newer techniques. In fact, the benefit of more accurate staging for cancer in general has not been directly demonstrated, and there is indirect evidence that the overall benefit has been small or nonexistent51.

Conclusions

The past two decades have produced dramatic technological advances in diagnostic imaging. Undoubtedly, many patients have benefited from these advances, particularly those that permit the faster and safer diagnosis of symptomatic, treatable disease. However, technological progress has also created confusion, which needs to be recognized and dealt with. Despite clinicians' best intentions, many patients may have been labeled with diseases they do not really have, and many have been given therapy they do not really need.

Much of the confusion resulting from advances in diagnostic imaging could be eliminated if diseases were categorized more carefully according to size or anatomical extent. Data on prevalence, natural history, and therapeutic effectiveness should be explicitly related to size. For the sake of consistency and precision, size should be recorded in standard dimensional units, such as centimeters and cubic centimeters, as opposed to subjective impressions and overly broad categorizations, such as present versus absent. Stratification according to size and adjustment for the sensitivity of the detection method would make statistics on prevalence less dependent on the constantly changing methods of detection. By minimizing lead-time and length biases, stratification according to size would improve the reliability of historical comparisons used to assess the effectiveness of new tests and treatment strategies. In addition, this approach would help to define those newly detectable strata of disease for which the effectiveness of intervention is unknown, thereby providing the opportunity for prospective trials of alternative interventions, including watchful waiting52,53,54. Finally, size stratification of prevalence and effectiveness would foster evaluations of diagnostic accuracy that are similarly stratified. The accuracy of imaging tests should be measured with regard to the anatomical extent of disease, not simply its presence or absence55. This would facilitate the integration of data on prevalence, effectiveness, and diagnostic accuracy for the determination of probabilities of disease56 and clinical usefulness.

All these recommendations will take time to implement. Meanwhile, clinicians can heed the following advice. First, expect the incidence and prevalence of diseases detectable by imaging to increase in the future. Some increases may be predictable on the basis of autopsy studies or other intensive cross-sectional prevalence studies in sample populations. Others may not be so predictable. All types of increases should be expected. The temptation to act aggressively must be tempered by the knowledge that the natural history of a newly detectable disease is unknown. For many diseases, the overall mortality rate has not changed, and the increased prevalence means that the prognosis for any given patient with the diagnosis has actually improved.

Second, expect that advances in imaging will be accompanied by apparent improvements in therapeutic outcomes. The effect of lead-time and length biases may be potent, and clinicians should be skeptical of reported improvements that are based on historical and other comparisons not controlled for the anatomical extent of disease and the rate of progression. Clinicians may even consider that the opposite may be true -- i.e., real outcomes may have worsened because of more aggressive interventions.

Finally, consider maintaining conventional clinical thresholds for treating disease until well-controlled trials prove the benefit of doing otherwise. This will require patience. A well-designed randomized clinical trial takes time. So does accumulating enough experience on outcomes from nonexperimental methods that can be used to control for the extent of disease and the rate of progression. From the point of view of both patients and policy, it is time well spent.

Dr. Welch was supported by the Department of Veterans Affairs Career Development Program, Health Services Research and Development.

We are indebted to Drs. R. Peter Mogielnicki, Robert F. Nease, Jr., Harold M. Swartz, John H. Wasson, and John E. Wennberg for their critique of this manuscript.


Source Information

From the Center for the Evaluative Clinical Sciences, Dartmouth Medical School, Hanover, N.H. (W.C.B., H.G.W.); the Department of Radiology, Dartmouth-Hitchcock Medical Center, Lebanon, N.H. (W.C.B.); and the Medical Service, Veterans Affairs Hospital, White River Junction, Vt. (H.G.W.).

Address reprint requests to Dr. Black at the Center for the Evaluative Clinical Sciences, Dartmouth Medical School, 318 Strausenburgh Hall, Hanover, NH 03755-3863.

References

  1. Feldman AR, Kessler L, Myers MH, Naughton MD. The prevalence of cancer: estimates based on the Connecticut Tumor Registry. N Engl J Med 1986;315:1394-1397. [Abstract]
  2. Montie JE, Wood DP Jr, Pontes E, Boyett JM, Levin HS. Adenocarcinoma of the prostate in cytoprostatectomy specimens removed for bladder cancer. Cancer 1989;63:381-385. [CrossRef][Medline]
  3. Cooner WH, Mosley BR, Rutherford CL Jr, et al. Prostate cancer detection in a clinical urological practice by ultrasonography, digital rectal examination and prostate specific antigen. J Urol 1990;143:1146-1154. [Medline]
  4. Lee F, Littrup PJ, Torp-Pedersen ST, et al. Prostate cancer: comparison of transrectal US and digital rectal examination for screening. Radiology 1988;168:389-394. [Free Full Text]
  5. Feinstein AR, Sosin DM, Wells CK. The Will Rogers phenomenon: stage migration and new diagnostic techniques as a source of misleading statistics for survival in cancer. N Engl J Med 1985;312:1604-1608. [Abstract]
  6. Feinleib M, Zelen M. Some pitfalls in the evaluation of screening programs. Arch Environ Health 1969;19:412-415. [Medline]
  7. Cole P, Morrison AS. Basic issues in population screening for cancer. J Natl Cancer Inst 1980;64:1263-1272.
  8. Islands, clusters, and percolation; diameter-number relations. In: Mandelbrot BB. The fractal geometry of nature. Rev. ed. New York: W.H. Freeman, 1983:116.
  9. Lederle FA, Walker JM, Reinke DB. Selective screening for abdominal aortic aneurysms with physical examination and ultrasound. Arch Intern Med 1988;148:1753-1756. [Abstract]
  10. Melton LJ III, Bickerstaff LK, Hollier LH, et al. Changing incidence of abdominal aortic aneurysms: a population-based study. Am J Epidemiol 1984;120:379-386. [Free Full Text]
  11. Harach HR, Franssila KO, Wasenius VM. Occult papillary carcinoma of the thyroid: a "normal" finding in Finland: a systematic autopsy study. Cancer 1985;56:531-538. [CrossRef][Medline]
  12. Cancer. In: Dorland's illustrated medical dictionary. 27th ed. Philadelphia: W.B. Saunders, 1988:261.
  13. Hicks MJ, Davis JR, Layton JM, Present AJ. Sensitivity of mammography and physical examination of the breast for detecting breast cancer. JAMA 1979;242:2080-2083. [Abstract]
  14. Sickles EA, Ominsky SH, Sollitto RA, Galvin HB, Monticciolo DL. Medical audit of a rapid-throughput mammography screening practice: methodology and results of 27,114 examinations. Radiology 1990;175:323-327. [Free Full Text]
  15. Kopans DB. The positive predictive value of mammography. AJR Am J Roentgenol 1992;158:521-526. [Free Full Text]
  16. Nielsen M, Thomsen JL, Primdahl S, Dyreborg U, Andersen JA. Breast cancer and atypia among young and middle-aged women: a study of 110 medicolegal autopsies. Br J Cancer 1987;56:814-819. [Medline]
  17. Shapiro S. Evidence on screening for breast cancer from a randomized trial. Cancer 1977;39:Suppl:2772-2782. [CrossRef][Medline]
  18. Stacey-Clear A, McCarthy KA, Hall DA, et al. Breast cancer survival among women under age 50: is mammography detrimental? Lancet 1992;340:991-994. [CrossRef][Medline]
  19. Meyer JE, Eberlein TJ, Stomper PC, Sonnenfeld MR. Biopsy of occult breast lesions: analysis of 1261 abnormalities. JAMA 1990;263:2341-2343. [Abstract]
  20. White E, Lee CY, Kristal AR. Evaluation of the increase in breast cancer incidence in relation to mammography use. J Natl Cancer Inst 1990;82:1524-1528. 
  21. Bottinger LE, Hallberg D, Schreeb TV. Renal carcinoma as an accidental finding. Acta Chir Scand 1964;127:158-161. [Medline]
  22. Grinnell RS, Lane N. Benign and malignant adenomatous polyps and papillary adenomas of the colon and rectum: an analysis of 1,856 tumors in 1,335 patients. Int Abstr Surg 1958;106:519-38.
  23. Hellsten S, Berge T, Wehlin L. Unrecognized renal cell carcinoma: clinical and diagnostic aspects. Scand J Urol Nephrol 1981;15:269-272. [Medline]
  24. Reinig JW, Dwyer AJ, Miller DL, et al. Liver metastasis detection: comparative sensitivities of MR imaging and CT scanning. Radiology 1987;162:43-47. [Free Full Text]
  25. Healy ME, Hesselink JR, Press GA, Middleton MS. Increased detection of intracranial metastases with intravenous Gd-DTPA. Radiology 1987;165:619-624. [Free Full Text]
  26. Xipell JM. The incidence of benign renal nodules (a clinicopathologic study). J Urol 1971;106:503-506. [Medline]
  27. Black WC, Ling A. Is earlier diagnosis really better? The misleading effects of lead time and length biases. AJR Am J Roentgenol 1990;155:625-630. [Free Full Text]
  28. Morrison AS. Screening in chronic disease. Vol. 7 of Monographs in epidemiology and biostatistics. New York: Oxford University Press, 1985:22-3.
  29. Miller AB. Screening and detection. In: Bland KI, Copeland EM, eds. The breast. Philadelphia: W.B. Saunders, 1991:419.
  30. Graunt J. Foundations of vital statistics. In: Newman JR, ed. The world of mathematics. Vol. 3. Redmond, Wash.: Tempus Books, 1988:1399-413.
  31. Seidman H, Mushinski MH, Gelb SK, Silverberg E. Probabilities of eventually developing or dying of cancer -- United States, 1985. CA Cancer J Clin 1985;35:36-56. [Free Full Text]
  32. National Center for Health Statistics. Vital statistics of the United States, 1986. Vol. 2. Mortality. Part B. Washington, D.C.: Government Printing Office, 1986. (DHHS publication no. (PHS) 88-1114.)
  33. Andersson I, Aspegren K, Janzon L, et al. Mammographic screening and mortality from breast cancer: the Malmo mammographic screening trial. BMJ 1988;297:943-948.
  34. Kubik A, Parkin DM, Khlat M, Erban J, Polak J, Adamec M. Lack of benefit from semi-annual screening for cancer of the lung: follow-up report of a randomized controlled trial on a population of high-risk males in Czechoslovakia. Int J Cancer 1990;45:26-33. [Medline]
  35. Sacks H, Chalmers TC, Smith H Jr. Randomized versus historical controls for clinical trials. Am J Med 1982;72:233-240. [CrossRef][Medline]
  36. Smith SJ, Bosniak MA, Megibow AJ, Hulnick DH, Horii SC, Raghavendra BN. Renal cell carcinoma: earlier discovery and increased detection. Radiology 1989;170:699-703. [Free Full Text]
  37. Mold JW, Stein HF. The cascade effect in the clinical care of patients. N Engl J Med 1986;314:512-514. [Medline]
  38. Frost JK, Ball WC Jr, Levin ML, et al. Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Johns Hopkins study. Am Rev Respir Dis 1984;130:549-554. [Medline]
  39. Flehinger BJ, Melamed MR, Zaman MB, Heelan RT, Perchick WB, Martini N. Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Memorial Sloan-Kettering study. Am Rev Respir Dis 1984;130:555-560. [Medline]
  40. Fontana RS, Sanderson DR, Taylor WF, et al. Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Mayo Clinic study. Am Rev Respir Dis 1984;130:561-565. [Medline]
  41. Kubik A, Polak J. Lung cancer detection: results of a randomized prospective study in Czechoslovakia. Cancer 1986;57:2427-2437. [CrossRef][Medline]
  42. Sanderson DR. Lung cancer screening: the Mayo study. Chest 1986;89:Suppl:324S-324S. [Medline]
  43. Tockman MS. Survival and mortality from lung cancer in a screened population: the Johns Hopkins study. Chest 1986;89:Suppl:324S-325S.
  44. Martini N. Results of the Memorial Sloan-Kettering study in screening for early lung cancer. Chest 1986;89:Suppl:325S-325S. [Free Full Text]
  45. Linver MN, Paster SB, Rosenberg RD, Key CR, Stidley CA, King WV. Improvement in mammography interpretation skills in a community radiology practice after dedicated teaching courses: 2-year medical audit of 38,633 cases. Radiology 1992;184:39-43. [Erratum, Radiology 1992;184:878.] [Free Full Text]
  46. Hopper KD, Landis JR, Meilstrup JW, McCauslin MA, Sechtin AG. The prevalence of asymptomatic gallstones in the general population. Invest Radiol 1991;26:939-945. [Medline]
  47. Boden SD, Davis DO, Dina TS, Patronas NJ, Wiesel SW. Abnormal magnetic-resonance scans of the lumbar spine in asymptomatic subjects. J Bone Joint Surg [Am] 1990;72:403-8.
  48. Kornick J, Trefelner E, McCarthy S, Lange R, Lynch K, Jokl P. Meniscal abnormalities in the asymptomatic population at MR imaging. Radiology 1990;177:463-465. [Free Full Text]
  49. Vaccaro JP, Cronan JJ, Dorfman GS. Outcome analysis of patients with normal compression US examinations. Radiology 1990;175:645-649. [Free Full Text]
  50. Huisman MV, Buller HR, ten Cate JW, et al. Unexpected high prevalence of silent pulmonary embolism in patients with deep venous thrombosis. Chest 1989;95:498-502. [Free Full Text]
  51. Bailar JC III, Smith EM. Progress against cancer? N Engl J Med 1986;314:1226-1232. [Abstract]
  52. Scott RAP, Ashton HA, Kay DN. Abdominal aortic aneurysm in 4237 screened patients: prevalence, development and management over 6 years. Br J Surg 1991;78:1122-1125. [Medline]
  53. Sickles EA. Periodic mammographic follow-up of probably benign lesions: results in 3,184 consecutive cases. Radiology 1991;179:463-468. [Free Full Text]
  54. Bosniak MA. The small ( <= 3.0 cm) renal parenchymal tumor: detection, diagnosis, and controversies. Radiology 1991;179:307-317. [Erratum, Radiology 1991;181:289.] [Free Full Text]
  55. Diagnostic and spectral markers. In: Feinstein AR. Clinical epidemiology: the architecture of clinical research. Philadelphia: W.B. Saunders, 1985:597-631.
  56. Black WC, Dwyer AJ. Local versus global measures of accuracy: an important distinction for diagnostic imaging. Med Decis Making 1990;10:226-273.

 

Tools and Services
-Add to Personal Archive
-Add to Citation Manager
-Notify a Friend
-E-mail When Cited

More Information
-PubMed Citation

This article has been cited by other articles:



HOME  |  SUBSCRIBE  |  SEARCH  |  CURRENT ISSUE  |  PAST ISSUES  |  COLLECTIONS  |  PRIVACY  |  HELP  |  beta.nejm.org

Comments and questions? Please contact us.

The New England Journal of Medicine is owned, published, and copyrighted © 2008 Massachusetts Medical Society. All rights reserved.