|
Background Breast cancers contain a minority population of cancer cells characterized by CD44 expression but low or undetectable levels of CD24 (CD44+CD24–/low) that have higher tumorigenic capacity than other subtypes of cancer cells.
Methods We compared the gene-expression profile of CD44+CD24–/low tumorigenic breast-cancer cells with that of normal breast epithelium. Differentially expressed genes were used to generate a 186-gene "invasiveness" gene signature (IGS), which was evaluated for its association with overall survival and metastasis-free survival in patients with breast cancer or other types of cancer.
Results There was a significant association between the IGS and both overall and metastasis-free survival (P<0.001, for both) in patients with breast cancer, which was independent of established clinical and pathological variables. When combined with the prognostic criteria of the National Institutes of Health, the IGS was used to stratify patients with high-risk early breast cancer into prognostic categories (good or poor); among patients with a good prognosis, the 10-year rate of metastasis-free survival was 81%, and among those with a poor prognosis, it was 57%. The IGS was also associated with the prognosis in medulloblastoma (P=0.004), lung cancer (P=0.03), and prostate cancer (P=0.01). The prognostic power of the IGS was increased when combined with the wound-response (WR) signature.
Conclusions The IGS is strongly associated with metastasis-free survival and overall survival for four different types of tumors. This genetic signature of tumorigenic breast-cancer cells was even more strongly associated with clinical outcomes when combined with the WR signature in breast cancer.
We used gene-expression profiling of tumorigenic breast-cancer cells to develop prognostic tools to assess survival in patients with breast cancer. Gene-expression profiling has been used to predict clinical outcomes in breast cancer.5,6,7,8,9,10,11 We identified 186 genes that are differentially expressed in tumorigenic breast-cancer cells and normal breast epithelium and generated a gene signature that differs substantially from previously reported gene signatures in breast cancer.5,6,12 Since this signature indicated the likelihood of a tumor to metastasize — a process involving invasion — we called it the "invasiveness" gene signature (IGS). An important finding was that the IGS was associated with the risk of death and metastasis not only in breast cancer but also in lung cancer, prostate cancer, and medulloblastoma. This finding suggests that the IGS represents general biologic features shared by several different types of tumor.
Methods
The methods used to isolate tumorigenic breast-cancer cells and normal breast epithelium are described in detail in the Supplementary Appendix, available with the full text of this article at www.nejm.org. Those used for RNA amplification, microarray analysis, real-time polymerase chain reaction, normalization of microarray data, and generation of the IGS are also described in the Supplementary Appendix.
The flow cytometric and molecular biologic experiments were performed by an academic author, who analyzed the results. An employee of the sponsor analyzed the microarray data, identified the IGS, and performed the statistical analysis. Both the academic authors and the employees of the sponsor had access to and held the microarray data and contributed to the study design, the interpretation of the results, and the writing of the manuscript. Dr. Clarke designed the study and contributed to the analysis of the results and the writing of the manuscript; he made the decision to publish the manuscript and vouches for the completeness of the data and for the analysis.
Patient Data
Patient information, including both clinical data and gene-expression data, was obtained from two independent sources: the Netherlands Cancer Institute database, which included data for 295 consecutive patients with early breast cancer10,12 (available at http://www.ncbi.nlm.nih.gov/geo/) and the Erasmus Medical Center database, which included data for 286 patients with lymph-node–negative breast cancer7,8 (available at http://microarray-pubs.stanford.edu/wound_NKI). (A description of the transformation and analysis of the patient data is in the Supplementary Appendix.)
Statistical Analysis
Average linkage clustering was performed with the open-source Cluster software (version 3.0),13 and the results were visualized with the use of the open-source TreeView software (version 1.0.13).14 A Pearson correlation coefficient for the correlation between the expression data for each patient and the average expression of the gene signature (the average for the six samples of tumorigenic breast-cancer cells used to generate the IGS) was calculated with the use of the expression level of each of the 186 genes (or a subgroup of the genes available in a specific database). Patients were grouped according to the correlation values, with 0 or the average correlation coefficient used as the threshold. For the breast-cancer databases, we chose the calculated Pearson correlation coefficient of 0 as the threshold to separate patients into two groups, one with gene-expression profiles that were related to the IGS (correlation coefficient, >0) and the other with profiles unrelated to the IGS (correlation coefficient,
0).
Because the number of patients with medulloblastoma, lung cancer, or prostate cancer for whom data were available was small, we divided the patients into two groups of approximately the same size, using the average correlation coefficient as the threshold. Overall survival and metastasis-free survival in the two groups were analyzed and compared by the Kaplan–Meier method. Differences in survival were tested for statistical significance by the log-rank test with the use of GraphPad Prism software, version 4.03 (GraphPad Software). Univariate and multivariate analyses of survival with the use of the Cox proportional-hazards method were performed with the use of the open-source R software, version 2.1.0 (www.r-project.org).
Results
Generation of the IGS
We previously identified tumorigenic breast-cancer cells on the basis of their expression of the cell-surface proteins CD44 and CD24, which can be used to distinguish between tumorigenic and nontumorigenic breast-cancer cells.1 In this study, we defined normal breast-epithelium cells as those that were positive for epithelial-specific antigen or CD10 (cell-surface proteins that mark luminal epithelial or myoepithelial mammary cells). On microarray analysis, we searched for genes differentially expressed between tumorigenic breast-cancer cells isolated from six different breast cancers (see Table 1 in the Supplementary Appendix) and normal breast-epithelium cells derived from three reduction mammoplasties. In five samples of breast cancers with sufficient numbers of cells for analysis, we confirmed that the CD44+CD24–/low lineage cells, but not the other cancer cells, formed tumors in the mouse tumor assay (see Table 1 in the Supplementary Appendix). A set of 186 genes were selected on the basis of a significant difference, by a factor of 2, in the expression levels between tumorigenic breast-cancer cells and normal breast epithelium. A level of significance of 0.005 by Student's t-test was chosen for the comparison (for a description of the procedure, see the Methods section, Figure 1, and Table 2, all in the Supplementary Appendix).
Table 1 presents the classification of the 186 genes in the IGS (for a detailed annotation of the IGS, see Gene Annotation in the Supplementary Appendix). The list includes the genes involved in the nuclear factor-
B pathway, the RAS–mitogen-activated protein kinase pathway, and epigenetic control of gene expression (see the Results section in the Supplementary Appendix). These pathways have been shown to play critical roles in tumorigenesis, cell differentiation, and development. We compared the IGS signature with previously reported gene signatures in breast cancer.15 Of the 186 genes in the IGS, only 6 overlapped with those in the wound-response (WR) signature,12 and none of the genes in the IGS were found in other signatures, suggesting that the IGS is a new signature.
|
A Pearson correlation coefficient for the correlation between the expression value of the genes in the sample obtained from each patient in the database and that of the genes in the IGS was calculated, and the correlation coefficient was tested for its association with clinical outcomes in a survival analysis with the use of a Cox proportional-hazards model. With the use of the Pearson correlation coefficient, patients were divided into two groups: one group with gene-expression profiles that were related positively to the IGS (correlation coefficient, >0) and the other group with gene-expression profiles that were related negatively to the IGS (correlation coefficient,
0). A positive correlation was associated with reduced metastasis-free survival and reduced overall survival; the univariate hazard ratio for metastasis or death was 1.3 (P<0.001) and 1.4 (P<0.001), respectively, for each increase of 0.1 in the correlation coefficient (Table 2). The estimated 10-year rates of overall survival and metastasis-free survival were 98% and 82%, respectively, among patients in the group with a correlation coefficient of 0 or less, and 62% and 54%, respectively, in the group with a correlation coefficient greater than 0 (Figure 1).
|
|
0) and with relapse rates of 43% (96 of 224 patients) and 16% (10 of 62 patients), respectively (P=0.001 by the chi-square test). The IGS and Clinical and Pathological Criteria
To determine whether the association between the IGS and the clinical outcome in patients with breast cancer was independent of standard clinical and pathological criteria, the 295 patients with breast cancer identified in the Netherlands Cancer Institute database were stratified according to tumor size, lymph-node status, histologic grade, and estrogen-receptor status. A univariate Cox proportional-hazards model was used to evaluate the association of the IGS with the clinical outcome in each category (Table 3). The association between the IGS and the risk of death or metastasis was significant regardless of tumor size or lymph-node status (P<0.05). Furthermore, the IGS could be used to stratify tumors with intermediate differentiation into good and poor prognostic subcategories (hazard ratio for a poor prognosis, 1.6; 95% confidence interval [CI], 1.2 to 2.1; P<0.001), but was less useful for stratifying tumors with poor and good differentiation (P>0.05). The association with the outcome was significant for tumors that were positive for estrogen receptor but not for those that were negative for estrogen receptor. In a multivariate Cox proportional-hazards analysis (Table 4), the association between the IGS and death or metastasis was independent of tumor differentiation, patients' age, and estrogen-receptor status (P<0.05).
|
|
The use of adjuvant chemotherapy in breast cancer is based on the prognostic criteria of the NIH (http://consensus.nih.gov) and the St. Gallen criteria.16 We identified two groups of patients in the database of the Netherlands Cancer Institute who were at high risk for death or metastasis (284 identified according to the NIH prognostic criteria and 273 according to the St. Gallen criteria) and who would ordinarily be treated with chemotherapy. With the use of the Pearson correlation coefficient and the Kaplan–Meier method, the 284 patients identified on the basis of the NIH criteria were stratified according to the IGS. Of those patients, 21% (60 patients) had tumors with gene-expression profiles that correlated negatively with the IGS (correlation coefficient,
0); among these 60 patients, metastases developed in 13% (8 patients), and the 10-year rate of metastasis-free survival was 81% (see Figure 2A and 2B in the Supplementary Appendix). By contrast, the gene-expression profile was strongly correlated with the IGS (correlation coefficient, >0) in 224 of the 284 patients; among these patients, metastatic disease developed in 41% (91 patients), and the 10-year rate of metastasis-free survival was 57% (P<0.001). When the same analysis was performed for the 273 high-risk patients identified on the basis of the St. Gallen criteria, the results were almost identical (P<0.001) (see Figure 2C and 2D in the Supplementary Appendix).
Of the 295 patients with breast cancer in the Netherlands Cancer Institute database, 185 never received adjuvant chemotherapy. In this subgroup we evaluated the IGS independently of the confounding effects of adjuvant treatment. With the correlation threshold set at 0, the IGS signature divided the 185 patients into two groups: 42 patients with IGS-negative tumors (correlation coefficient,
0) and 143 patients with IGS-positive tumors (correlation coefficient, >0). Kaplan–Meier survival curves showed that the relapse rates at 10 years differed significantly in the two groups: 12% (5 patients) and 43% (62 patients), respectively (P<0.001).
Combined Use of the IGS and WR Gene Signature
The 512-gene WR signature is derived from transcriptional profiling of serum-stimulated fibroblasts. This signature correlates with overall survival and metastasis-free survival in patients with breast cancer.8 The IGS and the WR signature are representations of different biologic phenomena and are based on nonoverlapping lists of genes. We compared the signatures and determined whether the result would be better with the two combined than with either signature alone. For this purpose, we used data on 262 patients in the Netherlands Cancer Institute database. When each signature was used alone, the significance of the association with the outcome was similar (see the Results section, Figure 3, and Table 3a, all in the Supplementary Appendix). In a multivariate Cox proportional-hazards analysis the IGS and the WR signature performed independently, with a hazard ratio for metastasis of 1.3 (95% CI, 1.1 to 1.5; P=0.001) and 1.2 (95% CI, 1.1 to 1.4; P=0.003), respectively (see Table 3b in the Supplementary Appendix). This result suggested that the two signatures combined would perform better than either alone. Using the full data on the 295 patients from the Netherlands Cancer Institute database and both the IGS and the WR signature, we found that after 10 years of follow-up, metastatic disease had developed in 20%, 31%, and 53% of patients with tumors that were negative for both signatures, positive for one signature, or positive for both signatures, respectively (Figure 2). The results were almost identical when the analysis was performed for the subgroup of patients at high risk for relapse, according to the prognostic criteria of both the NIH and St. Gallen (see Figure 4 in the Supplementary Appendix).
|
To evaluate the contribution to the prognostic capacity of the IGS of transcriptional features that are specific to tumorigenic breast-cancer cells, as compared with nontumorigenic breast-cancer cells, we compared the transcriptional profiles of autologous pairs of tumorigenic and nontumorigenic breast-cancer cell populations that had been directly isolated from the three primary samples (see the Methods section and Table 1 in the Supplementary Appendix). On the basis of the 186 genes in the IGS, gene-expression profiles derived from tumorigenic breast-cancer cells correlated significantly with overall survival and metastasis-free survival at 10 years (P<0.001 for both comparisons), as compared with expression profiles from nontumorigenic breast-cancer cells (P=0.3 for overall survival, and P=0.06 for metastasis-free survival) (see Figure 5 in the Supplementary Appendix). To validate this finding, we compared the gene-expression profiles of both tumorigenic and paired primary nontumorigenic breast-cancer cells with those of normal breast epithelium and evaluated the genes differentially expressed (tumorigenic breast-cancer cells vs. normal breast epithelium, and nontumorigenic breast-cancer cells vs. normal breast epithelium). The gene-expression profiles generated from tumorigenic breast-cancer cells were more closely associated with the outcome than were the gene-expression profiles generated from nontumorigenic breast-cancer cells (P<0.05) (see Figure 6 in the Supplementary Appendix).
On the basis of these observations, we investigated whether the IGS could be applied to other types of cancer by testing the association of the IGS with overall survival and relapse-free survival in patients with lung cancer, prostate cancer, or medulloblastoma. With the use of the Pearson correlation analysis, data on 62 patients with lung cancer, 60 with medulloblastoma, and 21 with prostate cancer were analyzed, and patients were categorized according to tumors that were IGS-negative (a correlation coefficient that was less than or equal to the average value) and tumors that were IGS-positive (a correlation coefficient that was greater than the average value). With all three types of cancer, a gene-expression profile that was closely correlated with the IGS signature was associated with rates of overall or relapse-free survival that were less than 50% at 5 years (Figure 3A). Overall survival or relapse-free survival was higher among patients with a tumor that was negatively correlated with the IGS (>60% among those with lung cancer and >80% among those with medulloblastoma or prostate cancer; P=0.03, P=0.004, and P=0.001, respectively) (Figure 3B and 3C).
|
We compared the gene-expression profiles of CD44+CD24–/low tumorigenic breast-cancer cells and normal breast epithelium, using gene-expression microarrays. We identified a unique list of 186 differentially expressed genes and defined a gene signature that we named IGS. The association between the IGS and both overall survival and tumor recurrence was significant not only in patients with breast cancer but also in those with lung cancer, prostate cancer, or medulloblastoma. The contribution of the tumorigenic breast-cancer–cell component of the IGS was essential to its association with the clinical outcome. These results suggest the clinical relevance of the CD44+CD24–/low tumorigenic subclass of breast-cancer cells.
There are several possible explanations for the association between the IGS and clinical outcomes in four different types of tumors. First, the signature may detect the genetic fingerprints of invasion pathways that are activated in aggressive tumors. Second, since the signature was derived from cells that behave like stem cells in mouse xenograft assays, it may detect an increased number of "cancer stem cells" in tumors.1,2,3,4 Tumors with a large number of cancer stem cells may be more likely to metastasize than tumors with a small number of cancer stem cells. The observation that the association between the IGS and the clinical outcome can be improved when the IGS is combined with the 512-gene WR signature is consistent with a model in which self-renewing cancer stem cells represent the seeds of cancer and the tumor microenvironment the soil that promotes the growth of those seeds.17 Third, the IGS may detect transcriptional profiles associated with mutations that arrest cells in an immature state of differentiation and function as markers of more aggressive tumors.
In summary, we found that with the use of the IGS, patients with early breast cancer who are at high risk for metastasis or death can be stratified into two groups with substantially different 10-year relapse rates (13% vs. 41%), and more than 90% of patients in whom metastatic breast cancer develops can be identified. There was an association between the IGS and the clinical outcome in patients who had breast tumors with intermediate-grade differentiation, a group whose prognosis is difficult to assess. Further validation and refinement of the IGS by characterization of the gene subclass that acts synergistically with the WR signature or other gene signatures will help to establish and exploit the full clinical value of the IGS.
Supported by grants from the National Cancer Institute (T32 CA009357 to Dr. Chen, and CA104987 and CA126524 to Dr. Clarke), the Breast Cancer Research Foundation, the Morton Foundation, and the Virginia and D.K. Ludwig Foundation (to Dr. Clarke), and the Fondazione Italiana per la Ricerca sul Cancro and the California Institute for Regenerative Medicine (to Dr. Dalerba).
Drs. Wang, Gurney, Hoey, and Lewicki report being employees of Oncomed Pharmaceuticals, a biotechnology company that has applied for patents related to the gene signature described in this study; and Dr. Clarke, being a paid member of the advisory board of Oncomed Pharmaceuticals and owning stock in the company. No other potential conflict of interest relevant to this article was reported.
We thank T. Iwashita for discussions of the RNA amplification protocol; D. Adams, M. White, and A.-M. Des Lauriers for help with cell sorting; J. Washburn and J. MacDonald for help with the microarray analysis; and D. Renner-Chuey for help with collection of the tumor samples.
Source Information
From the Departments of Internal Medicine (R.L., G.Y.C., P.D., M.F.C.) and Statistics (K.S.), University of Michigan, Ann Arbor; Oncomed Pharmaceuticals, Mountain View, CA (X.W., A.G., T.H., J.L.); the Stanford Institute for Stem Cell Biology and Regenerative Medicine, Palo Alto, CA (P.D., M.F.C.); and the Department of Genetics, Stanford University School of Medicine, Stanford, CA (G.S.).
Drs. Liu and Wang contributed equally to this article, and Drs. Chen and Dalerba contributed equally to this article.
References
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||
Related Letters:
A Gene Signature in Breast Cancer
Bertucci F., Cervera N., Birnbaum D., Wennmalm K., Miller L. D., Bergh J., Woodward W. A., Lucci A., Cristofanilli M., the Advanced Research Center for Micrometastatic Disease , Clarke M. F., Liu R., Wang X.
Extract |
Full Text |
PDF
N Engl J Med 2007;
356:1887-1888, May 3, 2007.
Correspondence
This article has been cited by other articles:
HOME | SUBSCRIBE | SEARCH | CURRENT ISSUE | PAST ISSUES | COLLECTIONS | PRIVACY | TERMS OF USE | HELP | beta.nejm.org Comments and questions? Please contact us. The New England Journal of Medicine is owned, published, and copyrighted © 2009 Massachusetts Medical Society. All rights reserved. |