Over the past two decades a vast new armamentarium of diagnostictechniques has revolutionized the practice of medicine. Theentire human body can now be imaged in exquisite anatomicaldetail. Computed tomography (CT), magnetic resonance imaging(MRI), and ultrasonography routinely "section" patients intoslices less than a centimeter thick. Abnormalities can be detectedwell before they produce any clinical signs or symptoms. Undoubtedly,these technological advances have enhanced the physician's potentialfor understanding disease and treating patients.
Unfortunately, these technological advances also create confusionthat may ultimately be harmful to patients. Consider the caseof prostate cancer. Although the prevalence of clinically apparentprostate cancer in men 60 to 70 years of age is only about 1percent,1 over 40 percent of men in their 60s with normal rectalexaminations have been found to have histologic evidence ofthe disease2. Consequently, because the prostate is studiedincreasingly by transrectal ultrasonography3,4 and MRI, whichcan detect tumors too small to palpate, the reported prevalenceof prostate cancer increases. In addition, the increased detectionafforded by imaging can confuse the evaluation of therapeuticeffectiveness. As the spectrum of detected prostate cancer becomesbroader with the addition of tumors too small to palpate, thereported survival from the time of diagnosis improves regardlessof the actual effect of the new tests and treatments5.
In this article, we explain how advances in diagnostic imagingcreate confusion in two crucial areas of medical decision making:establishing how much disease there is and defining how welltreatment works. Although others have described these effectsin the narrow context of mass screening6,7 and in a few clinicalsituations, such as the staging of lung cancer,5 these consequencesof modern imaging increasingly pervade everyday medicine. Besidesdescribing the misperceptions of disease prevalence and therapeuticeffectiveness, we explain how the increasing use of sophisticateddiagnostic imaging promotes a cycle of increasing interventionthat often confers little or no benefit. Finally, we offer suggestionsthat may minimize these problems.
Prevalence of Disease
As a general principle, the prevalence of any disease increaseswith the observer's ability to detect the abnormalities associatedwith the disease. In the case of diagnostic imaging, the abilityto detect an anatomical abnormality is closely related to thesize of the abnormality. Thus, as technological advances makeit possible for imaging equipment to detect smaller abnormalities,more are found and the prevalence of the associated diseaseincreases accordingly. The problem is illustrated if one asksthe deceptively simple question, "How many islands surroundBritain's coast"8? There is no one correct answer, because thenumber of islands increases with the resolution of the map onwhich they are counted. There are analogous relations betweenprevalence and diagnostic scrutiny for several familiar diseases.
Abdominal Aortic Aneurysms
Two decades ago, abdominal aortic aneurysms were detected mainlyby palpation. The threshold in size for palpation of an aneurysmis about 5 cm when the clinician is directed to look for theabnormality in the investigational setting and considerablyhigher when the clinician is not so directed9. Today, however,most abdominal aortic aneurysms are detected by ultrasonographyor CT, which have a detection threshold well below 3 cm, themost commonly used criterion for the diagnosis of these aneurysms.
A recent screening study of 201 patients at high risk (men betweenthe ages of 60 and 75 years with hypertension or coronary arterydisease) showed how much ultrasonography can affect the reportedprevalence of abdominal aortic aneurysms. As shown in Figure 1,five aneurysms were detected on physical examination as a"definite pulsatile mass," and four of the five were 5 cm indiameter or larger9. However, 18 aneurysms 3.6 cm or more indiameter were detected in the same population by ultrasound.Of the 13 aneurysms not detected by physical examination, 1measured 5 cm, 5 measured 4 cm, and 13 measured 3.6 cm. Thus,from the perspective of the clinician performing physical examinations,the prevalence of abdominal aortic aneurysms in this high-riskpopulation was only 2 percent, and the modal (i.e., the mostcommon) size was larger than 5.0 cm. From the perspective ofthe ultrasonographer, however, the prevalence of aneurysms inthis population was 9 percent, and the modal size was less than4.0 cm.
Figure 1. Real and Apparent Distributions of Abdominal Aortic Aneurysms, According to Size, in 201 High-Risk Patients, as Determined by Ultrasound and Physical Examination, Respectively.
These relations between size, prevalence, and detection thresholdcan explain why the reported incidence of abdominal aortic aneurysmsincreased by seven times in the Mayo Clinic area after the introductionof ultrasonography and why the increase was greatest -- by morethan 10 times -- for small aneurysms (<5 cm)10.
Cancer
For diseases defined on the basis of macroscopic size criteria,such as abdominal aortic aneurysms, the amount of detectablesubclinical disease has been fairly well defined by the imagingtechniques in current use. Therefore, their prevalences andincidences cannot increase much with future improvements inimaging. However, for diseases defined microscopically, suchas cancer, the reservoir of detectable subclinical disease ishuge. The prevalence and incidence of cancer have the potentialto rise continually as detection thresholds are lowered by advancesin imaging.
This potential is most clearly illustrated in the case of thethyroid gland, which has probably been more closely scrutinizedfor cancer than any other internal organ. According to the ConnecticutTumor Registry,1 the prevalence of clinically apparent thyroidcancer (tumor size >2 cm) is only about 0.1 percent in adultsbetween the ages of 50 and 70 years. By slicing the thyroidat 2.5-mm intervals at autopsy, however, Harach et al.11 foundat least one papillary carcinoma in 36 percent of Finnish adultsof comparable ages. In addition, Harach et al. realized thatthe probability that they would observe under the microscopea tumor with a diameter smaller than the distance between sliceswas equal to the diameter of the tumor divided by the distancebetween slices. For example, given the 2.5-mm interval betweenslices, they reasoned that all tumors larger than 2.5 mm indiameter but only one fifth of tumors with a diameter of 0.5mm (i.e., 0.5/2.5) would be seen under the microscope. Applyingthis reasoning to the size distribution of the tumors they observed,Harach et al. reconstructed the likely size distribution forthyroid cancer in their patients studied at autopsy (Figure 2)and concluded that the prevalence of histologically verifiablepapillary carcinoma was close to, if not equal to, 100 percentif one could look at thin enough slices of the gland.
Figure 2. Real (Estimated) and Apparent (Observed) Distributions of Thyroid Papillary Carcinomas, According to Size, in 101 Patients Studied at Autopsy.
Data are from Harach et al11. See the text for details about the adjustment.
Harach's observations raise two important questions. How manyhistologically verifiable cancers are there in other organs,and what is meant by "cancer"? If cancer is defined as a "cellulartumor the natural course of which is fatal,"12 then the findingsof Harach et al. seriously challenge the validity of the acceptedpathologic gold standard. Large-scale screening has not beenrecommended for thyroid cancer, however, and therefore thesequestions are more immediately relevant to cancers that arebeing more aggressively pursued, such as breast cancer.
Breast Cancer
Before the widespread use of mammography, most breast cancerswere discovered on physical examination, as palpable lumps.In one of the few studies to assess directly the accuracy ofphysical examination in screening for breast cancer, only 27percent of tumors more than 1.0 cm in diameter and 10 percentof those less than 1.0 cm in diameter were detected by physicalexamination13. However, the mean size of breast cancers detectedby state-of-the-art screening mammography is about 1.0 cm,14,15and many of the cancers detected as microcalcifications areonly a few millimeters in size.
Again, prevalence depends on the degree of scrutiny. Accordingto the Connecticut Tumor Registry, clinically apparent breastcancer afflicts about 1 percent of all women between the agesof 40 and 50 years1. In a recent medicolegal autopsy study,however, small foci of breast cancer were found in 39 percentof women in this age group16. Most cancers were in the formof ductal carcinoma in situ. Furthermore, over 45 percent ofthe women with cancer had two or more lesions, and over 40 percenthad bilateral lesions. Although it has been argued that suchsmall in situ lesions are not detected by and are thereforeirrelevant to screening mammography, about half the lesionsin that study16 were detected, usually as microcalcifications,on postmortem plain-film radiography of the resected breasts.Because of continual technical improvements and increasinglybroad criteria for the interpretation of mammograms, the detectionthreshold for breast cancer has fallen considerably since thetime of the Breast Cancer Screening Project of the Health InsurancePlan of Greater New York17 (1963 to 1975). This can explainthe increased prevalence of cancer on mammographic screening,from 2.717 to 7.614 per 1000 examinations (with the incidenceincreasing from 1.517 to 3.214 per 1000 examinations). The lowerdetection threshold can also explain the increase in the percentageof carcinomas in situ (stage 0) among all mammographically detectedcancers -- from 12.7 percent17 to over 30 percent15,18,19. Theprincipal indication for biopsy has changed from suspiciousmass to suspicious microcalcifications. This can explain whythe reported incidence of breast cancer has increased and whymost of the increase is in smaller lesions, particularly ductalcarcinoma in situ20.
As the foregoing examples illustrate, there are large reservoirsof clinically occult disease. In fact, this is true of mostprimary cancers and metastases that have been closely scrutinized2,11,16,21,22,23,24,25,26.For such diseases, the observed prevalence can increase considerablyas detection thresholds are lowered by advances in imaging (Table 1).
Table 1. Effect of Using Clinical and Microscopical Detection Thresholds in Three Cancers.
Therapeutic Effectiveness
Not only are advances in imaging changing physicians' perspectiveson the prevalence of disease, but they are also distorting theirperceptions of the natural history of disease and its responseto medical intervention. Because only a tiny fraction of thetests and treatment strategies used in routine practice havebeen subjected to randomized trials, physicians must rely heavilyon less rigorous methods of evaluation, which usually entailcomparisons with historical controls. Whether or not these comparisonsare based on published series in the literature or on the physician'srecollections of personal experience, they are subject to lead-timeand length biases5,6,7,27.
Lead-Time Bias
Lead-time bias pertains to comparisons that are not adjustedfor the timing of the diagnosis. If survival is measured fromthe time of diagnosis, as is usual, then the comparison betweenpatients who are given diagnoses earlier on the basis of thetest and those given diagnoses on the basis of clinical findingsis a biased one, regardless of the real effect of the earlierdiagnosis. In the simple case in which earlier diagnosis hasno real effect on the length of survival, the new test willappear to prolong survival (by the amount of time between detectionwith the test and clinical diagnosis). Therefore, the comparisonshould be adjusted by subtracting the lead time from the groupwith test-based diagnoses. In general, however, this adjustmentcannot be made when a new test becomes available, because therate of disease progression and hence the lead time affordedby testing are unknown. Furthermore, this adjustment for leadtime assumes that test-detected cases progress at the same rateas those that eventually present clinically. When there is variabilityin the rate of disease progression, as is usually the case,then this assumption is incorrect and introduces a second bias.
Length Bias
Length bias pertains to comparisons that are unadjusted forthe rate of progression of disease. The probability that a diseasewill be detected by testing is directly proportional to thelength of its detectable preclinical phase, which is inverselyrelated to its rate of progression (Figure 3). Therefore, diseasedetected by testing tends to progress less rapidly than diseasethat would ultimately present clinically in the absence of testing.Furthermore, the effect of the length bias increases in magnitudeas the detection threshold of the test is lowered and the spectrumof detected disease is broadened to include the cases progressingthe least rapidly (Figure 4). Among these may be cases thatwould regress, remain stable, or progress too slowly to becomeclinically apparent during the patient's lifetime. Some authorshave described these as cases of "pseudodisease"28 and considerthis aspect of length bias separately, as a bias of overdiagnosis29.
Figure 3. Influence of the Rate of Disease Progression on the Probability of Detection.
The length of each arrow represents the length of the detectable preclinical phase, from the time of initial detectability to the time of clinical diagnosis (Dx). Testing at a single moment in time would detect (bold arrows) four slowly progressive cases, but only two rapidly progressive cases. Cases not detected by the test (gray arrows) are diagnosed clinically either before or after the time of testing.
Figure 4. Influence of the Detection Threshold on the Spectrum of Detected Disease.
Disease develops in a cohort of patients, and some cases are subsequently detected by testing (stippled area). Only the rapidly progressive cases are detected by the standard test (left panel), whereas the advanced test, which has a lower detection threshold, detects all cases (right panel). These include the slowly progressive cases, which do not reach the clinical threshold before the patient dies of other causes.
Unless one can follow a cohort over time, there is no way ofaccurately estimating the probability that a subclinically detectedabnormality will naturally progress to an adverse outcome. Theprobability of such an outcome is mathematically constrained,however, by the prevalence of the detected abnormality. Theupper limit of this probability can be derived from reasoningthat dates to the 17th century, when vital statistics were firstcollected30. If the number of persons dying from a specificdisease is fixed, then the probability that a person with thedisease will eventually die from it is inversely related tothe prevalence of the disease. Therefore, given fixed mortalityrates, an increase in the detection of a potentially fatal diseasedecreases the likelihood that the disease detected in any oneperson will be fatal.
This constraint is particularly relevant to the detection ofsubclinical cancer. As the detection threshold of diagnosticimaging decreases to the level of pathological inspection, theupper bound of the probability of dying of detected cancer becomessmall. Table 1 shows how small these upper bounds would becomegiven the relatively constant probabilities of eventually dyingof breast,31 prostate,31 and thyroid32 cancer.
Apparent, Real, and Spurious Effects
From the perspective of the clinician reviewing case seriesin the literature or from personal experience, in which patientsare tracked from the time of diagnosis, the apparent effect(usually positive) of a new diagnostic test that lowers thedetection threshold is equal to the real effect (variable) plusthe spurious effects (always positive) of the lead-time andlength biases27:
Apparent effect = Real effect + Spurious effect.
The individual effects of the lead-time and length biases maybe impossible to disentangle and quantify. Two recent randomizedtrials demonstrate, however, that the combined effect of thesebiases -- the spurious effect -- can be the chief componentof the apparent effect, with the real effect being zero or negative.In the Malmo mammographic screening trial, women over the ageof 45 were randomly assigned to either regular mammography orno screening. The case fatality rate tracked from the time ofdiagnosis was 15 percent for breast cancers detected in thecontrol group and only 3 percent for breast cancers detectedat the time of screening33. This apparent reduction of 80 percentin mortality was entirely attributed to lead-time and lengthbiases, however, because there was no difference in mortalityfrom breast cancer tracked from the time of randomization. Inthe Czech lung cancer screening trial,34 men at high risk wererandomly assigned to either chest radiography twice a year orno screening. The five-year survival was 23 percent for lungcancers diagnosed in the study group and 0 percent for thosediagnosed in the control group. Again, this apparent improvementwas entirely attributed to lead-time and length biases, becausemortality from lung cancer was actually higher in the screenedgroup, indicating that the real effect of screening and subsequentintervention was negative.
It should be emphasized that these studies do not prove screeningmammography and chest radiography to be futile. Had the studyconditions -- the methods of testing, the modes of therapy,the linkages between interpretation of the film and treatment,or the targeted populations -- been different, the effects ofscreening might have been better (or worse). These studies do,however, demonstrate examples of screening that was ineffectiveand yet appeared to be highly effective from the ordinary clinicalperspective (that in which patients were tracked from the timeof diagnosis). These disparities between the real and apparenteffects of screening mammography and chest radiography are especiallydisturbing in the light of the fact that these are the onlyscreening strategies using diagnostic imaging that have everbeen evaluated in a randomized trial.
Lead-time and length biases pertain not only to changes thatlower the threshold for detecting disease, but also to new treatmentsthat are applied at the same time. Whether or not new therapyis more effective than old therapy, patients given diagnoseswith the use of lower detection thresholds will appear to havebetter outcomes than their historical controls because of thesebiases. Consequently, new therapies often appear promising35and could even replace older therapies that are more effectiveor have fewer side effects. Because the decision to treat orto investigate the need for treatment further is increasinglyinfluenced by the results of diagnostic imaging, lead-time andlength biases increasingly pervade medical practice.
The Cycle of Increasing Intervention
Misperceptions of disease prevalence and therapeutic effectivenesscan promote a cycle of increasing medical intervention, despitethe best intentions of all parties. The cycle usually beginswith some form of increased testing that lowers the thresholdfor detecting disease, such as technical improvement in imagingtests, more frequent testing, or closer scrutiny of the images.This immediately leads to a higher diagnostic yield of the diseaseand a spectrum of milder cases. These effects are almost alwaysinterpreted as indicating progress and provide immediate reinforcementfor the increased testing, despite the caveat that earlier detectionis a double-edged sword7. Unfortunately, the assessment of diagnosticaccuracy often contributes to the confusion, because the conventionalgold standards are surgical or pathological inspection ratherthan outcomes for patients. Tests that are more sensitive (ata fixed rate of false positive results) are accepted as better,even though they detect a broader spectrum of disease that includesa subgroup whose natural history and response to interventionare unknown. Consequently, the assessment legitimizes the useof the more sensitive imaging test and becomes a distractionfrom the fundamental question: How should patients with thisnewly detectable subclinical disease be treated?
Over time, the reported incidence and prevalence of the detecteddisease increase. In addition, because of lead-time and lengthbiases, the patients' outcomes usually appear to improve, whetheror not there is real improvement. Thus, the apparent increasein the number of detected cases and the apparent improvementin the outcome per case detected reinforce the initial increasein testing and treatment and encourage even more use in thefuture36. Unless they are interrupted by astute clinicians,testing and treatment may become even more frequent as longas there remain undetected cases of disease and new means ofdetecting them. This cycle pertains both to individual patients,who may get caught in a cascade of interventions,37 and to largepopulations of patients, who may be subjected to increasinglyintensive screening.
That this cycle has strong potential to occur in the absenceof any benefit is strongly supported by studies of screeningfor lung cancer. Higher rates of detection, resectability, andfive-year survival (from the time of diagnosis) in the screenedpopulations were reported during the early phases of the fourmost recent randomized trials38,39,40,41. Each of these trialseventually demonstrated, however, that screening did not reducemortality from lung cancer34,42,43,44. Despite these findingsin lung cancer, the same potentially misleading measures ofsuccess are being used today to justify aggressive testing andtreatment for other diseases, such as breast cancer. Increaseddetection of minimal cancers is almost always reported as progress,45and longer survival (from the time of diagnosis) is commonlyused to justify mammographic screening in women under 50 yearsof age,18 who constitute about half of all women screened inthe United States14.
Although we have focused on a few specific diseases in asymptomaticpatients, these misperceptions about disease prevalence andtherapeutic effectiveness are relevant to a wide range of conditions.Patients are now more likely to be given diagnoses of a varietyof conditions, such as gallstones,46 herniated disks,47 meniscaltears,48 deep venous thrombosis,49 and pulmonary embolism,50whether or not they are symptomatic, whether or not their diagnosesare responsible for the symptoms, and whether or not the patientsbenefited from medical intervention. In addition, advances inimaging can result in upward migration of the disease stage,regardless of its severity5. This greatly complicates the assessmentof therapeutic effectiveness because it makes stage-specifichistorical comparisons invalid. For example, Feinstein et al.5have demonstrated how the illusion of increased survival forpatients at all stages of lung cancer was created by the diagnostictechniques developed in the 1970s, which have since been replacedby even newer techniques. In fact, the benefit of more accuratestaging for cancer in general has not been directly demonstrated,and there is indirect evidence that the overall benefit hasbeen small or nonexistent51.
Conclusions
The past two decades have produced dramatic technological advancesin diagnostic imaging. Undoubtedly, many patients have benefitedfrom these advances, particularly those that permit the fasterand safer diagnosis of symptomatic, treatable disease. However,technological progress has also created confusion, which needsto be recognized and dealt with. Despite clinicians' best intentions,many patients may have been labeled with diseases they do notreally have, and many have been given therapy they do not reallyneed.
Much of the confusion resulting from advances in diagnosticimaging could be eliminated if diseases were categorized morecarefully according to size or anatomical extent. Data on prevalence,natural history, and therapeutic effectiveness should be explicitlyrelated to size. For the sake of consistency and precision,size should be recorded in standard dimensional units, suchas centimeters and cubic centimeters, as opposed to subjectiveimpressions and overly broad categorizations, such as presentversus absent. Stratification according to size and adjustmentfor the sensitivity of the detection method would make statisticson prevalence less dependent on the constantly changing methodsof detection. By minimizing lead-time and length biases, stratificationaccording to size would improve the reliability of historicalcomparisons used to assess the effectiveness of new tests andtreatment strategies. In addition, this approach would helpto define those newly detectable strata of disease for whichthe effectiveness of intervention is unknown, thereby providingthe opportunity for prospective trials of alternative interventions,including watchful waiting52,53,54. Finally, size stratificationof prevalence and effectiveness would foster evaluations ofdiagnostic accuracy that are similarly stratified. The accuracyof imaging tests should be measured with regard to the anatomicalextent of disease, not simply its presence or absence55. Thiswould facilitate the integration of data on prevalence, effectiveness,and diagnostic accuracy for the determination of probabilitiesof disease56 and clinical usefulness.
All these recommendations will take time to implement. Meanwhile,clinicians can heed the following advice. First, expect theincidence and prevalence of diseases detectable by imaging toincrease in the future. Some increases may be predictable onthe basis of autopsy studies or other intensive cross-sectionalprevalence studies in sample populations. Others may not beso predictable. All types of increases should be expected. Thetemptation to act aggressively must be tempered by the knowledgethat the natural history of a newly detectable disease is unknown.For many diseases, the overall mortality rate has not changed,and the increased prevalence means that the prognosis for anygiven patient with the diagnosis has actually improved.
Second, expect that advances in imaging will be accompaniedby apparent improvements in therapeutic outcomes. The effectof lead-time and length biases may be potent, and cliniciansshould be skeptical of reported improvements that are basedon historical and other comparisons not controlled for the anatomicalextent of disease and the rate of progression. Clinicians mayeven consider that the opposite may be true -- i.e., real outcomesmay have worsened because of more aggressive interventions.
Finally, consider maintaining conventional clinical thresholdsfor treating disease until well-controlled trials prove thebenefit of doing otherwise. This will require patience. A well-designedrandomized clinical trial takes time. So does accumulating enoughexperience on outcomes from nonexperimental methods that canbe used to control for the extent of disease and the rate ofprogression. From the point of view of both patients and policy,it is time well spent.
Dr. Welch was supported by the Department of Veterans AffairsCareer Development Program, Health Services Research and Development.
We are indebted to Drs. R. Peter Mogielnicki, Robert F. Nease,Jr., Harold M. Swartz, John H. Wasson, and John E. Wennbergfor their critique of this manuscript.
Source Information
From the Center for the Evaluative Clinical Sciences, Dartmouth Medical School, Hanover, N.H. (W.C.B., H.G.W.); the Department of Radiology, Dartmouth-Hitchcock Medical Center, Lebanon, N.H. (W.C.B.); and the Medical Service, Veterans Affairs Hospital, White River Junction, Vt. (H.G.W.).
Address reprint requests to Dr. Black at the Center for the Evaluative Clinical Sciences, Dartmouth Medical School, 318 Strausenburgh Hall, Hanover, NH 03755-3863.
References
Feldman AR, Kessler L, Myers MH, Naughton MD. The prevalence of cancer: estimates based on the Connecticut Tumor Registry. N Engl J Med 1986;315:1394-1397. [Abstract]
Montie JE, Wood DP Jr, Pontes E, Boyett JM, Levin HS. Adenocarcinoma of the prostate in cytoprostatectomy specimens removed for bladder cancer. Cancer 1989;63:381-385. [CrossRef][Medline]
Cooner WH, Mosley BR, Rutherford CL Jr, et al. Prostate cancer detection in a clinical urological practice by ultrasonography, digital rectal examination and prostate specific antigen. J Urol 1990;143:1146-1154. [Medline]
Lee F, Littrup PJ, Torp-Pedersen ST, et al. Prostate cancer: comparison of transrectal US and digital rectal examination for screening. Radiology 1988;168:389-394. [Free Full Text]
Feinstein AR, Sosin DM, Wells CK. The Will Rogers phenomenon: stage migration and new diagnostic techniques as a source of misleading statistics for survival in cancer. N Engl J Med 1985;312:1604-1608. [Abstract]
Feinleib M, Zelen M. Some pitfalls in the evaluation of screening programs. Arch Environ Health 1969;19:412-415. [Medline]
Cole P, Morrison AS. Basic issues in population screening for cancer. J Natl Cancer Inst 1980;64:1263-1272.
Islands, clusters, and percolation; diameter-number relations. In: Mandelbrot BB. The fractal geometry of nature. Rev. ed. New York: W.H. Freeman, 1983:116.
Lederle FA, Walker JM, Reinke DB. Selective screening for abdominal aortic aneurysms with physical examination and ultrasound. Arch Intern Med 1988;148:1753-1756. [Free Full Text]
Melton LJ III, Bickerstaff LK, Hollier LH, et al. Changing incidence of abdominal aortic aneurysms: a population-based study. Am J Epidemiol 1984;120:379-386. [Free Full Text]
Harach HR, Franssila KO, Wasenius VM. Occult papillary carcinoma of the thyroid: a "normal" finding in Finland: a systematic autopsy study. Cancer 1985;56:531-538. [CrossRef][Medline]
Hicks MJ, Davis JR, Layton JM, Present AJ. Sensitivity of mammography and physical examination of the breast for detecting breast cancer. JAMA 1979;242:2080-2083. [Free Full Text]
Sickles EA, Ominsky SH, Sollitto RA, Galvin HB, Monticciolo DL. Medical audit of a rapid-throughput mammography screening practice: methodology and results of 27,114 examinations. Radiology 1990;175:323-327. [Free Full Text]
Kopans DB. The positive predictive value of mammography. AJR Am J Roentgenol 1992;158:521-526. [Free Full Text]
Nielsen M, Thomsen JL, Primdahl S, Dyreborg U, Andersen JA. Breast cancer and atypia among young and middle-aged women: a study of 110 medicolegal autopsies. Br J Cancer 1987;56:814-819. [Medline]
Shapiro S. Evidence on screening for breast cancer from a randomized trial. Cancer 1977;39:Suppl:2772-2782. [CrossRef][Medline]
Stacey-Clear A, McCarthy KA, Hall DA, et al. Breast cancer survival among women under age 50: is mammography detrimental? Lancet 1992;340:991-994. [CrossRef][Medline]
Meyer JE, Eberlein TJ, Stomper PC, Sonnenfeld MR. Biopsy of occult breast lesions: analysis of 1261 abnormalities. JAMA 1990;263:2341-2343. [Free Full Text]
White E, Lee CY, Kristal AR. Evaluation of the increase in breast cancer incidence in relation to mammography use. J Natl Cancer Inst 1990;82:1524-1528.
Bottinger LE, Hallberg D, Schreeb TV. Renal carcinoma as an accidental finding. Acta Chir Scand 1964;127:158-161. [Medline]
Grinnell RS, Lane N. Benign and malignant adenomatous polyps and papillary adenomas of the colon and rectum: an analysis of 1,856 tumors in 1,335 patients. Int Abstr Surg 1958;106:519-38.
Hellsten S, Berge T, Wehlin L. Unrecognized renal cell carcinoma: clinical and diagnostic aspects. Scand J Urol Nephrol 1981;15:269-272. [Medline]
Reinig JW, Dwyer AJ, Miller DL, et al. Liver metastasis detection: comparative sensitivities of MR imaging and CT scanning. Radiology 1987;162:43-47. [Free Full Text]
Healy ME, Hesselink JR, Press GA, Middleton MS. Increased detection of intracranial metastases with intravenous Gd-DTPA. Radiology 1987;165:619-624. [Free Full Text]
Xipell JM. The incidence of benign renal nodules (a clinicopathologic study). J Urol 1971;106:503-506. [Medline]
Black WC, Ling A. Is earlier diagnosis really better? The misleading effects of lead time and length biases. AJR Am J Roentgenol 1990;155:625-630. [Free Full Text]
Morrison AS. Screening in chronic disease. Vol. 7 of Monographs in epidemiology and biostatistics. New York: Oxford University Press, 1985:22-3.
Miller AB. Screening and detection. In: Bland KI, Copeland EM, eds. The breast. Philadelphia: W.B. Saunders, 1991:419.
Graunt J. Foundations of vital statistics. In: Newman JR, ed. The world of mathematics. Vol. 3. Redmond, Wash.: Tempus Books, 1988:1399-413.
Seidman H, Mushinski MH, Gelb SK, Silverberg E. Probabilities of eventually developing or dying of cancer -- United States, 1985. CA Cancer J Clin 1985;35:36-56. [Free Full Text]
National Center for Health Statistics. Vital statistics of the United States, 1986. Vol. 2. Mortality. Part B. Washington, D.C.: Government Printing Office, 1986. (DHHS publication no. (PHS) 88-1114.)
Andersson I, Aspegren K, Janzon L, et al. Mammographic screening and mortality from breast cancer: the Malmo mammographic screening trial. BMJ 1988;297:943-948.
Kubik A, Parkin DM, Khlat M, Erban J, Polak J, Adamec M. Lack of benefit from semi-annual screening for cancer of the lung: follow-up report of a randomized controlled trial on a population of high-risk males in Czechoslovakia. Int J Cancer 1990;45:26-33. [Medline]
Sacks H, Chalmers TC, Smith H Jr. Randomized versus historical controls for clinical trials. Am J Med 1982;72:233-240. [CrossRef][Medline]
Smith SJ, Bosniak MA, Megibow AJ, Hulnick DH, Horii SC, Raghavendra BN. Renal cell carcinoma: earlier discovery and increased detection. Radiology 1989;170:699-703. [Free Full Text]
Mold JW, Stein HF. The cascade effect in the clinical care of patients. N Engl J Med 1986;314:512-514. [Medline]
Frost JK, Ball WC Jr, Levin ML, et al. Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Johns Hopkins study. Am Rev Respir Dis 1984;130:549-554. [Medline]
Flehinger BJ, Melamed MR, Zaman MB, Heelan RT, Perchick WB, Martini N. Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Memorial Sloan-Kettering study. Am Rev Respir Dis 1984;130:555-560. [Medline]
Fontana RS, Sanderson DR, Taylor WF, et al. Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Mayo Clinic study. Am Rev Respir Dis 1984;130:561-565. [Medline]
Kubik A, Polak J. Lung cancer detection: results of a randomized prospective study in Czechoslovakia. Cancer 1986;57:2427-2437. [CrossRef][Medline]
Sanderson DR. Lung cancer screening: the Mayo study. Chest 1986;89:Suppl:324S-324S. [Medline]
Tockman MS. Survival and mortality from lung cancer in a screened population: the Johns Hopkins study. Chest 1986;89:Suppl:324S-325S.
Martini N. Results of the Memorial Sloan-Kettering study in screening for early lung cancer. Chest 1986;89:Suppl:325S-325S. [Free Full Text]
Linver MN, Paster SB, Rosenberg RD, Key CR, Stidley CA, King WV. Improvement in mammography interpretation skills in a community radiology practice after dedicated teaching courses: 2-year medical audit of 38,633 cases. Radiology 1992;184:39-43. [Erratum, Radiology 1992;184:878.] [Free Full Text]
Hopper KD, Landis JR, Meilstrup JW, McCauslin MA, Sechtin AG. The prevalence of asymptomatic gallstones in the general population. Invest Radiol 1991;26:939-945. [Medline]
Boden SD, Davis DO, Dina TS, Patronas NJ, Wiesel SW. Abnormal magnetic-resonance scans of the lumbar spine in asymptomatic subjects. J Bone Joint Surg [Am] 1990;72:403-8.
Kornick J, Trefelner E, McCarthy S, Lange R, Lynch K, Jokl P. Meniscal abnormalities in the asymptomatic population at MR imaging. Radiology 1990;177:463-465. [Free Full Text]
Vaccaro JP, Cronan JJ, Dorfman GS. Outcome analysis of patients with normal compression US examinations. Radiology 1990;175:645-649. [Free Full Text]
Huisman MV, Buller HR, ten Cate JW, et al. Unexpected high prevalence of silent pulmonary embolism in patients with deep venous thrombosis. Chest 1989;95:498-502. [Free Full Text]
Bailar JC III, Smith EM. Progress against cancer? N Engl J Med 1986;314:1226-1232. [Abstract]
Scott RAP, Ashton HA, Kay DN. Abdominal aortic aneurysm in 4237 screened patients: prevalence, development and management over 6 years. Br J Surg 1991;78:1122-1125. [Medline]
Sickles EA. Periodic mammographic follow-up of probably benign lesions: results in 3,184 consecutive cases. Radiology 1991;179:463-468. [Free Full Text]
Bosniak MA. The small ( 3.0 cm) renal parenchymal tumor: detection, diagnosis, and controversies. Radiology 1991;179:307-317. [Erratum, Radiology 1991;181:289.] [Free Full Text]
Diagnostic and spectral markers. In: Feinstein AR. Clinical epidemiology: the architecture of clinical research. Philadelphia: W.B. Saunders, 1985:597-631.
Black WC, Dwyer AJ. Local versus global measures of accuracy: an important distinction for diagnostic imaging. Med Decis Making 1990;10:226-273.
Enderling, H., Anderson, A. R.A., Chaplain, M. A.J., Beheshti, A., Hlatky, L., Hahnfeldt, P.
(2009). Paradoxical Dependencies of Tumor Dormancy and Progression on Basic Cell Kinetics. Cancer Res.
69: 8814-8821
[Abstract][Full Text]
Choyke, P. L.
(2009). Commentary on "Computed Tomography in the Diagnosis of Adrenal Disease" and "Nonfunctioning Adrenal Masses: Incidental Discovery on Computed Tomography". Am. J. Roentgenol.
192: 568-570
[Full Text]
Almog, N., Ma, L., Raychowdhury, R., Schwager, C., Erber, R., Short, S., Hlatky, L., Vajkoczy, P., Huber, P. E., Folkman, J., Abdollahi, A.
(2009). Transcriptional Switch of Dormant Tumors to Fast-Growing Angiogenic Phenotype. Cancer Res.
69: 836-844
[Abstract][Full Text]
Ma, J., Waxman, D. J.
(2008). Combination of antiangiogenesis with chemotherapy for more effective cancer treatment. Molecular Cancer Therapeutics
7: 3670-3684
[Abstract][Full Text]
Cao, Y., Langer, R.
(2008). A review of Judah Folkman's remarkable achievements in biomedicine. Proc. Natl. Acad. Sci. USA
105: 13203-13205
[Full Text]
LARSEN, T. K., MELLE, I., FRIIS, S., JOA, I., JOHANNESSEN, J. O., OPJORDSMOEN, S., SIMONSEN, E., VAGLUM, P., McGLASHAN, T. H.
(2007). One-year effect of changing duration of untreated psychosis in a single catchment area. Br. J. Psychiatry
191: s128-s132
[Abstract][Full Text]
Beliveau, R., Gingras, D.
(2007). Role of nutrition in preventing cancer. cfp
53: 1905-1911
[Abstract][Full Text]
Lin, E. Y., Pollard, J. W.
(2007). Tumor-Associated Macrophages Press the Angiogenic Switch in Breast Cancer. Cancer Res.
67: 5064-5066
[Abstract][Full Text]
Hollingsworth, J. M., Miller, D. C., Daignault, S., Hollenbeck, B. K.
(2006). Rising incidence of small renal masses: a need to reassess treatment effect.. JNCI J Natl Cancer Inst
98: 1331-1334
[Abstract][Full Text]
Almog, N., Henke, V., Flores, L., Hlatky, L., Kung, A. L., Wright, R. D., Berger, R., Hutchinson, L., Naumov, G. N., Bender, E., Akslen, L. A., Achilles, E.-G., Folkman, J.
(2006). Prolonged dormancy of human liposarcoma is associated with impaired tumor angiogenesis. FASEB J.
20: 947-949
[Abstract][Full Text]
Narazaki, M., Tosato, G.
(2006). Tumor cell populations differ in angiogenic activity: a model system for spontaneous angiogenic switch can tell us why.. JNCI J Natl Cancer Inst
98: 294-295
[Full Text]
Naumov, G. N., Bender, E., Zurakowski, D., Kang, S.-Y., Sampson, D., Flynn, E., Watnick, R. S., Straume, O., Akslen, L. A., Folkman, J., Almog, N.
(2006). A model of human tumor dormancy: an angiogenic switch from the nonangiogenic phenotype.. JNCI J Natl Cancer Inst
98: 316-325
[Abstract][Full Text]
Concato, J., Wells, C. K., Horwitz, R. I., Penson, D., Fincke, G., Berlowitz, D. R., Froehlich, G., Blake, D., Vickers, M. A., Gehr, G. A., Raheb, N. H., Sullivan, G., Peduzzi, P.
(2006). The Effectiveness of Screening for Prostate Cancer: A Nested Case-Control Study. Arch Intern Med
166: 38-43
[Abstract][Full Text]
Swensen, S. J., Jett, J. R., Hartman, T. E., Midthun, D. E., Mandrekar, S. J., Hillman, S. L., Sykes, A.-M., Aughenbaugh, G. L., Bungum, A. O., Allen, K. L.
(2005). CT Screening for Lung Cancer: Five-year Prospective Experience. Radiology
235: 259-265
[Abstract][Full Text]
McCaffery, K. J, Barratt, A. L
(2004). Assessing psychosocial/quality of life outcomes in screening: how do we do it better?. J. Epidemiol. Community Health
58: 968-970
[Full Text]
Wisnivesky, J. P., Yankelevitz, D., Henschke, C. I.
(2004). The Effect of Tumor Size on Curability of Stage I Non-small Cell Lung Cancers. Chest
126: 761-765
[Abstract][Full Text]
Barratt, A., Trevena, L., Davey, H. M, McCaffery, K.
(2004). Use of decision aids to support informed choices about screening. BMJ
329: 507-510
[Full Text]
Penson, R. T., Daniels, K. J., Lynch, T. J. Jr.
(2004). Too Old to Care?. The Oncologist
9: 343-352
[Abstract][Full Text]
Hsu, R. M., Chan, D. Y., Siegelman, S. S.
(2004). Small Renal Cell Carcinomas: Correlation of Size with Tumor Stage, Nuclear Grade, and Histologic Subtype. Am. J. Roentgenol.
182: 551-557
[Abstract][Full Text]
Sunshine, J. H., Applegate, K. E.
(2004). Technology Assessment for Radiologists. Radiology
230: 309-314
[Abstract][Full Text]
Schwartz, L. M., Woloshin, S., Fowler, F. J. Jr, Welch, H. G.
(2004). Enthusiasm for Cancer Screening in the United States. JAMA
291: 71-78
[Abstract][Full Text]
Kaplan, R. M., Ganiats, T. G., Frosch, D. L.
(2004). Diagnostic and Treatment Decisions in US Healthcare. J Health Psychol
9: 29-40
[Abstract]
Fenton, J. J., Deyo, R. A.
(2003). Patient Self-Referral for Radiologic Screening Tests: Clinical and Ethical Concerns. J Am Board Fam Med
16: 494-501
[Abstract][Full Text]
Clarfield, A. M.
(2003). The Decreasing Prevalence of Reversible Dementias: An Updated Meta-analysis. Arch Intern Med
163: 2219-2229
[Abstract][Full Text]
Mechanic, D.
(2003). Is The Prevalence Of Mental Disorders A Good Measure Of The Need For Services?. Health Aff (Millwood)
22: 8-20
[Abstract][Full Text]
Wisnivesky, J. P., Mushlin, A. I., Sicherman, N., Henschke, C.
(2003). The Cost-Effectiveness of Low-Dose CT Screening for Lung Cancer: Preliminary Results of Baseline Screening. Chest
124: 614-621
[Abstract][Full Text]
Al-Shahi, R., Bhattacharya, J. J., Currie, D. G., Papanastassiou, V., Ritchie, V., Roberts, R. C., Sellar, R. J., Warlow, C. P.
(2003). Prospective, Population-Based Detection of Intracranial Vascular Malformations in Adults: The Scottish Intracranial Vascular Malformation Study (SIVMS). Stroke
34: 1163-1169
[Abstract][Full Text]
Swensen, S. J., Jett, J. R., Hartman, T. E., Midthun, D. E., Sloan, J. A., Sykes, A.-M., Aughenbaugh, G. L., Clemens, M. A.
(2003). Lung Cancer Screening with CT: Mayo Clinic Experience. Radiology
226: 756-761
[Abstract][Full Text]
Blum, J., Rinne, N., Gould, M. K., Chirikos, T. N., Hazelton, T., Tockman, M., Clark, R.
(2003). Lung Cancer Screening Debate. Chest
123: 653-655
[Full Text]
Hoberman, A., Charron, M., Hickey, R. W., Baskin, M., Kearney, D. H., Wald, E. R.
(2003). Imaging Studies after a First Febrile Urinary Tract Infection in Young Children. NEJM
348: 195-202
[Abstract][Full Text]
Gottlieb, R. H., La, T. C., Erturk, E. N., Sotack, J. L., Voci, S. L., Holloway, R. G., Syed, L., Mikityansky, I., Tirkes, A. T., Elmarzouky, R., Zwemer, F. L., Joseph, J. V., Davis, D., DiGrazio, W. J., Messing, E. M.
(2002). CT in Detecting Urinary Tract Calculi: Influence on Patient Imaging and Clinical Outcomes. Radiology
225: 441-449
[Abstract][Full Text]
Black, W. C., Brant-Zawadzki, M. N.
(2002). Invited Commentary * Author's Response. RadioGraphics
22: 1536-1539
[Full Text]
Herman, C. R., Gill, H. K., Eng, J., Fajardo, L. L.
(2002). Screening for Preclinical Disease: Test and Disease Characteristics. Am. J. Roentgenol.
179: 825-831
[Full Text]
Swensen, S. J.
(2002). CT Screening for Lung Cancer. Am. J. Roentgenol.
179: 833-836
[Full Text]
UDAGAWA, T., FERNANDEZ, A., ACHILLES, E.-G., FOLKMAN, J., D'AMATO, R. J.
(2002). Persistence of microscopic human cancers in mice: alterations in the angiogenic balance accompanies loss of tumor dormancy. FASEB J.
16: 1361-1370
[Abstract][Full Text]
Blumenthal, H. T.
(2002). Guest Editorial: The Autopsy in Gerontological Research: A Retrospective. Journals of Gerontology Series A: Biological Sciences and Medical Sciences
57: M433-437
[Full Text]
The Evidence-Based Radiology Working Group, T.
(2001). Evidence-based Radiology: A New Approach to the Practice of Radiology. Radiology
220: 566-575
[Abstract][Full Text]
Hobby, J. L., Dixon, A. K., Bearcroft, P. W. P., Tom, B. D. M., Lomas, D. J., Rushton, N., Matthewson, M. H.
(2001). MR Imaging of the Wrist: Effect on Clinical Diagnosis and Patient Care. Radiology
220: 589-593
[Abstract][Full Text]
Obuchowski, N. A., Graham, R. J., Baker, M. E., Powell, K. A.
(2001). Ten Criteria for Effective Screening: Their Application to Multislice CT Screening for Pulmonary and Colorectal Cancers. Am. J. Roentgenol.
176: 1357-1362
[Full Text]
Berlin, L.
(2001). Dot Size, Lead Time, Fallibility, and Impact on Survival: Continuing Controversies in Mammography. Am. J. Roentgenol.
176: 1123-1130
[Full Text]
Jarvik, J. G.
(2001). The Research Framework. Am. J. Roentgenol.
176: 873-878
[Full Text]
Conio, M, Cameron, A J, Romero, Y, Branch, C D, Schleck, C D, Burgart, L J, Zinsmeister, A R, Melton, L J III, Locke, G R III
(2001). Secular trends in the epidemiology and outcome of Barrett's oesophagus in Olmsted County, Minnesota. Gut
48: 304-309
[Abstract][Full Text]
Bolondi, L, Sofia, S, Siringo, S, Gaiani, S, Casali, A, Zironi, G, Piscaglia, F, Gramantieri, L, Zanetti, M, Sherman, M
(2001). Surveillance programme of cirrhotic patients for early diagnosis and treatment of hepatocellular carcinoma: a cost effectiveness analysis. Gut
48: 251-259
[Abstract][Full Text]
Godby, C. J., Peticolas, A., Carter, J. M, Ghebrehewet, S., Woloshin, S., Schwartz, L., Welch, H G., Sox, H. C
(2000). Women's attitudes to false positive mammography results. BMJ
321: 1409-1409
[Full Text]
Black, W. C.
(2000). Anatomic Extent of Disease: A Critical Variable in Reports of Diagnostic Accuracy. Radiology
217: 319-320
[Full Text]
Black, W. C.
(2000). Overdiagnosis: An Underrecognized Cause of Confusion and Harm in Cancer Screening. JNCI J Natl Cancer Inst
92: 1280-1282
[Full Text]
Welch, H. G., Schwartz, L. M., Woloshin, S.
(2000). Are Increasing 5-Year Survival Rates Evidence of Success Against Cancer?. JAMA
283: 2975-2978
[Abstract][Full Text]
Moody-Ayers, S. Y., Wells, C. K., Feinstein, A. R.
(2000). "Benign" Tumors and "Early Detection" in Mammography-Screened Patients of a Natural Cohort With Breast Cancer. Arch Intern Med
160: 1109-1115
[Abstract][Full Text]
Medina, L. S.
(1999). Study Design and Analysis in Neuroradiology: A Practical Approach. Am. J. Neuroradiol.
20: 1584-1596
[Full Text]
Hoberman, A., Wald, E. R., Hickey, R. W., Baskin, M., Charron, M., Majd, M., Kearney, D. H., Reynolds, E. A., Ruley, J., Janosky, J. E.
(1999). Oral Versus Initial Intravenous Therapy for Urinary Tract Infections in Young Febrile Children. Pediatrics
104: 79-86
[Abstract][Full Text]
Black, W. C.
(1999). RESPONSES - Re: Trends in Reported Incidence of Primary Malignant Brain Tumors in Children in the United States. JNCI J Natl Cancer Inst
91: 648a-649a
[Full Text]
Fisher, E. S., Welch, H. G.
(1999). Avoiding the Unintended Consequences of Growth in Medical Care: How Might More Be Worse?. JAMA
281: 446-453
[Abstract][Full Text]
Schwartz, L. M., Woloshin, S., Welch, H. G.
(1999). Risk Communication in Clinical Practice: Putting Cancer in Context. J Natl Cancer Inst Monogr
1999: 124-133
[Abstract][Full Text]
Goodman, D. C., Little, G. A.
(1998). General Pediatrics, Neonatology, and the Law of Diminishing Returns. Pediatrics
102: 396-398
[Full Text]
King, J. T. Jr., Justice, A. C., Aron, D. C.
(1997). Management of Incidental Pituitary Microadenomas: A Cost-Effectiveness Analysis. J. Clin. Endocrinol. Metab.
82: 3625-3632
[Abstract][Full Text]
Black, W. C., Nease, R. F., Welch, H. G.
(1997). Determining Transition Probabilities from Mortality Rates and Autopsy Findings. Med Decis Making
17: 87-93
[Abstract]
Collins, M. M., Barry, M. J.
(1996). Controversies in Prostate Cancer Screening: Analogies to the Early Lung Cancer Screening Debate. JAMA
276: 1976-1979
[Abstract]
Welch, H. G.
(1996). Questions about the Value of Early Intervention. NEJM
334: 1472-1473
[Full Text]
Wennberg, D. E., Kellett, M. A., Dickens, J. D. Jr, Malenka, D. J., Keilson, L. M., Keller, R. B.
(1996). The Association Between Local Diagnostic Testing Intensity and Invasive Cardiac Procedures. JAMA
275: 1161-1164
[Abstract]
Verrilli, D., Welch, H. G.
(1996). The Impact of Diagnostic Testing on Therapeutic Interventions. JAMA
275: 1189-1191
[Abstract]
Bauer, T. W.
(1996). Editorial - Identification of Orthopaedic Wear Debris. JBJS
78: 479-81
[Full Text]
Albertsen, P. C., Fryback, D. G., Storer, B. E., Kolon, T. F., Fine, J.
(1995). Long-term Survival Among Men With Conservatively Treated Localized Prostate Cancer. JAMA
274: 626-631
[Abstract]
Eisenberg, L.
(1995). Medicine--Molecular, Monetary, or More Than Both?. JAMA
274: 331-334
[Abstract]
Welch, H. G., Black, W. C., Fisher, E. S.
(1995). Case-Mix Adjustment: Making Bad Apples Look Good. JAMA
273: 772-773
[Abstract]
Potosky, A. L., Miller, B. A., Albertsen, P. C., Kramer, B. S.
(1995). The Role of Increasing Detection in the Rising Incidence of Prostate Cancer. JAMA
273: 548-552
[Abstract]
Franklin, C., Friedman, Y., Wong, T., Hu, T.-C.
(1995). Improving Long-term Prognosis for Survivors of Mechanical Ventilation in Patients With AIDS With PCP and Acute Respiratory Failure: Five-Year Follow-up of Intensive Care Unit Discharges. Arch Intern Med
155: 91-95
[Abstract]
Krahn, M. D., Mahoney, J. E., Eckman, M. H., Trachtenberg, J., Pauker, S. G., Detsky, A. S.
(1994). Screening for Prostate Cancer: A Decision Analytic View. JAMA
272: 773-780
[Abstract]
Deyo, R. A.
(1994). Magnetic Resonance Imaging of the Lumbar Spine -- Terrific Test or Tar Baby?. NEJM
331: 115-116
[Full Text]
Fleming, C.
(1993). Treatment Strategies for Prostate Cancer-Reply. JAMA
270: 1693-1693
[Abstract]