| |||||||||||||||||||||||||||||||||||||||||||||||
The Advent of Genomic Medicine
The recent completion of the draft sequence of the human genome2,3 and related developments have increased interest in genetics, but confusion remains among health professionals and the public about the role of genetic information in medical practice. Inaccurate beliefs about genetics persist, including the view that in the past it had no effect on the practice of medicine and that its influence today is pervasive. In fact, for decades knowledge of genetics has had a large role in the health care of a few patients and a small role in the health care of many. We have recently entered a transition period in which specific genetic knowledge is becoming critical to the delivery of effective health care for everyone.
If genetics has been misunderstood, genomics is even more mysterious what, exactly, is the difference? Genetics is the study of single genes and their effects. "Genomics,"4 a term coined only 15 years ago, is the study not just of single genes, but of the functions and interactions of all the genes in the genome. Genomics has a broader and more ambitious reach than does genetics. The science of genomics rests on direct experimental access to the entire genome and applies to common conditions, such as breast cancer5 and colorectal cancer,6 human immunodeficiency virus (HIV) infection,7 tuberculosis,8 Parkinson's disease,9 and Alzheimer's disease.10 These common disorders are also all due to the interactions of multiple genes and environmental factors. They are thus known as multifactorial disorders. Genetic variations in these disorders may have a protective or a pathologic role in the expression of diseases.
The role of genomics in health care is in part highlighted by the decreasing effect of certain environmental factors, such as infectious agents, on the burden of disease. Genomics also contributes to the understanding of such important infectious diseases as the acquired immunodeficiency syndrome (AIDS)7 and tuberculosis.8
The following two case vignettes illustrate how knowledge of genomics may lead to better management of common medical conditions.
Thirty-four-year-old Kathleen becomes pregnant and sees a new physician for her first prenatal visit. Her medical history is remarkable for an episode of deep venous thrombosis five years earlier while she was taking oral contraceptives; her mother had had deep venous thrombosis when pregnant with Kathleen. Her physician suspects that Kathleen has a hereditary thrombophilia and obtains blood tests to screen for a genetic predisposition to thrombosis. Kathleen proves to be among the approximately 4 percent of Americans who are heterozygous for a mutation in factor V known as factor V Leiden that increases the risk of thrombotic events. On the basis of this knowledge and her history of possibly estrogen-related thromboembolism, she is treated with prophylactic subcutaneous heparin for the balance of her pregnancy. She remains asymptomatic and delivers a healthy, term infant.
Four-year-old John has acute lymphoblastic leukemia and tolerates induction and consolidation chemotherapy well, with minimal side effects. As a key part of his maintenance-treatment protocol, he begins to receive oral mercaptopurine daily, but because a genetic test shows that John is homozygous for a mutation in the gene that encodes thiopurine S-methyltransferase, an enzyme that inactivates mercaptopurine, he receives a greatly reduced dose. Only a few years ago, about 1 in 300 patients had serious, sometimes lethal, hematopoietic adverse effects during mercaptopurine therapy. Although John is in this at-risk minority, a simple genetic test, which is now routine for patients beginning mercaptopurine therapy, alerts his physicians to this genetic predisposition. They reduce his dose of mercaptopurine and carefully monitor his blood levels, ensuring that the drug levels remain therapeutic, rather than toxic. John subsequently has an uneventful several-year maintenance period and achieves complete remission.
The Human Genome
These are two examples of genomic medicine, the application of our rapidly expanding knowledge of the human genome (Figure 1) to medical practice. Much is known, but much remains mysterious. We know that less than 2 percent of the human genome codes for proteins, while over 50 percent represents repeat sequences of several types, whose function is less well understood. These stretches of repetitive sequences, sometimes wrongly dismissed as "junk DNA," constitute an informative historical record of evolutionary biology, provide a rich source of information for population genetics and medical genetics, and by introducing changes into coding regions, are active agents for change within the genome.2
|
|
Genes are distributed unevenly across the human genome (Figure 1). Certain chromosomes, particularly 17, 19, and 22, are relatively gene dense as compared with others, such as 4, 8, 13, 18, and Y.3 Moreover, gene density varies within each chromosome, being highest in areas rich in the bases cytosine and guanine, rather than adenine and thymine.2,3 Chromosomes 13, 18, and 21, the three autosomes with the fewest genes, are also the three for which the occurrence of trisomy (i.e., three copies of a chromosome) is compatible with viability.
Not all genes reside on nuclear chromosomes; several dozen involved with energy metabolism are on the mitochondrial chromosome.13 Since ova are rich in mitochondria and sperm are not, mitochondrial DNA is usually inherited from the mother. Therefore, mitochondrial genes and diseases due to DNA-sequence variants in them are transmitted in a matrilineal pattern that is distinctly different from the pattern of inheritance of nuclear genes.
Monogenic Conditions
Over the course of the 20th century, a combination of theoretical insights, basic-science research, and clinical observation elucidated the inheritance of single-gene, or monogenic, disorders (also known as mendelian disorders, since they are transmitted in a manner consonant with Mendel's laws of inheritance). Modes of inheritance have been established for thousands of conditions caused by mutations in single genes; these have been catalogued in a textbook11 and, more recently, in an online compendium14 known as Mendelian Inheritance in Man (OMIM). For nearly 100 years, autosomal dominant, autosomal recessive, and X-linked modes of inheritance have been understood and known to cause human disease. In the past few decades, other mechanisms of monogenic inheritance have been described. These include mitochondrial inheritance,13 imprinting (a mechanism by which the effects of certain genes depend on whether they are inherited through the mother or through the father),15 uniparental disomy (the occasional situation in which both members of 1 pair of a person's 23 pairs of chromosomes derive from one parent),16 and expanding trinucleotide repeats (a phenomenon in which a sequence of three base pairs that is normally repeated a number of times in a row in the genome becomes repeated by more than the normal number of times, sometimes causing disease).17
Most single-gene conditions are uncommon. Even the commonest, such as hereditary hemochromatosis (approximate incidence, 1 in 300 persons), cystic fibrosis (approximate incidence, 1 in 3000), alpha1-antitrypsin deficiency (approximate incidence, 1 in 1700), or neurofibromatosis (approximate incidence, 1 in 3000), affect no more than 1 in several hundred people in the United States. However, the total effect of monogenic conditions is substantial, from both the individual patient's and public health perspectives, and increased understanding of genetics has already begun to improve the health of some patients with such conditions. The delineation of the mechanisms by which genetic factors cause monogenic disorders has provided important information about basic pathophysiological processes that underlie related disorders that occur with far greater frequency than do these genetic disorders. For instance, insights regarding familial hypercholesterolemia, a genetic disorder that affects only 1 of every 500 people in the United States, were instrumental to understanding the pathophysiology of atherosclerosis, which affects a large fraction of the population, and the development of the statin drugs, which are among the most frequently prescribed medications.18
Types of Mutation
There are a number of ways to categorize mutations. One is according to the causative mechanism, whereas another is according to their functional effect. When classified according to the mechanism, point mutations that is, a change in a single DNA base in the sequence are the most common. There are many types of point mutations. One type is a missense mutation (Figure 3), a substitution that leads to an alternative amino acid, because of the way in which it changes the three-base sequence, or codon, that codes for an amino acid. Nonsense mutations (Figure 3) are a more dramatically deleterious type of point mutation that change the codon to a "stop" codon, a codon that causes the termination of the protein instead of producing an amino acid. Another type of mutation is the frame-shift mutation (Figure 3), which changes the reading frame of the gene downstream from it, often leading to a premature stop codon.
|
Although mutations can cause disease by a variety of means, the most common is loss of function. Loss-of-function mutations alter the phenotype by decreasing the quantity or the functional activity of a protein. For instance, mutations in the glucose-6-phosphate dehydrogenase (G6PD) gene on the X chromosome decrease the functional activity of this enzyme, leading to acute hemolytic anemia if a male (who would, of course, have only one copy of the X chromosome) with the mutation is exposed to certain drugs, including sulfonamides, primaquine, and nitrofurantoins. Since genes do not exist just to handle pharmacologic agents, variants that cause a more severe deficiency of glucose-6-phosphate dehydrogenase also lead to hemolytic anemia when affected males ingest fava beans (favism), since this enzyme is also important in the degradation of a component of the beans.19
Some mutations cause disease through a gain of function, whereby the protein takes on some new, toxic function. Expanding exonic CAG trinucleotide repeats that cause disorders including Huntington's disease and spinocerebellar ataxia appear to lead to neuropathologic abnormalities by producing proteins that function abnormally because of expanded polyglutamine tracts (CAG codes for the amino acid glutamine).20 Such gain-of-function mutations are often dominantly inherited, since a single copy of the mutant gene can alter function.
One might assume that mutations in the approximately 98.5 percent of the genome that does not code for proteins do not affect the phenotype. Indeed, most do not. But others are regulatory mutations that may ultimately prove as important in the etiologic process of common diseases as the coding region variants. Regulatory mutations act by altering the expression of a gene. For instance, a regulatory mutation might lead to the loss of expression of a gene, to unexpected expression in a tissue in which it is usually silent, or to a change in the time at which it is expressed. Examples of regulatory mutations associated with disease are those in the flanking region of the FMR1 gene (causing fragile X syndrome),21 the insulin gene flanking region (increasing the risk of type 1 diabetes mellitus),22 a regulatory site of the type I collagen gene (increasing the risk of osteoporosis),23 and an intronic regulatory site of the calpain-10 gene (increasing the risk of type 2 diabetes mellitus).24
Mutations can also decrease the risk of a disease. One example of this is a 32-bp deletion (a frame shift) in a chemokine receptor gene, CCR5. Persons who are homozygous for this deletion prove almost completely resistant to infection with HIV type 1, and those who are heterozygous for the deletion have slower progression from infection to AIDS. These effects arise because CCR5 is an important part of the mechanism by which HIV enters the cell.25
Genes in Common Disease
The study of genomics will most likely make its greatest contribution to health by revealing mechanisms of common, complex diseases, such as hypertension, diabetes, and asthma. So far, most genes involved in common diseases have been identified by virtue of their high penetrance that is, the mutations lead to disease in a fairly large proportion of people who have them. Examples include mutations in BRCA1 and BRCA2 (increasing the risk of breast and ovarian cancer),26 HNPCC (increasing the risk of hereditary nonpolyposis colorectal cancer),6 MODY 1, MODY 2, and MODY 3 (increasing the risk of diabetes),27 and the gene for
-synuclein (causing Parkinson's disease).9 One can think of these as near-mendelian subgroups of disease within a larger group of affected persons. If a person has such a mutation, the likelihood of disease is great. However, each of these highly penetrant mutations associated with common disease has a prevalence in the general population of only one in several hundred to several thousand people.
From a public health perspective, genes with mutations that are less highly penetrant but much more prevalent have a greater effect on the population than genes that are highly penetrant but uncommon. Such mutations have been reported in genes such as APC (which increases the risk of colorectal cancer)28 and factor V Leiden (which increases the risk of thrombosis).29
An example of the relative contributions of rare, highly penetrant mutations as opposed to common, less penetrant ones is seen in Alzheimer's disease. Rare mutations in presenilin 1, presenilin 2, or the
-amyloid precursor protein gene are highly penetrant causes of early-onset Alzheimer's disease; indeed, Alzheimer's disease develops by the age of 60 years in most people who are heterozygous for a mutation in one of these genes.30 However, because so few people carry a mutation in any of these genes, these mutations play a part in fewer than 1 percent of cases of Alzheimer's disease.31 In contrast, the apolipoprotein E
4 allele also increases the risk of late-onset Alzheimer's disease (and atherosclerosis32), but more subtly. One representative Finnish study found that Alzheimer's disease develops during the mid-70s in approximately 8 percent of persons who are heterozygous for the
4 allele and 21 percent of those who are homozygous for it, as compared with 3 percent of those with no
4 allele.33 Nonetheless, because approximately 26 percent of the U.S. population is heterozygous and 2 percent is homozygous34 for the apolipoprotein E
4 allele, this genetic factor has a role in many more cases of Alzheimer's disease than do the mutations in the genes for presenilin 1, presenilin 2, and
-amyloid precursor protein combined.
Variation in the Human Genome
One characteristic of the human genome with medical and social relevance is that, on average, two unrelated persons share over 99.9 percent of their DNA sequences.3 However, given the more than 3 billion base pairs that constitute the human genome, this also means that the DNA sequences of two unrelated humans vary at millions of bases. Since a person's genotype represents the blending of parental genotypes, we are each thus heterozygous at about 3 million bases. Many efforts are currently under way, in both the academic and commercial sectors, to catalogue these variants, commonly referred to as "single-nucleotide polymorphisms" (SNPs), and to correlate these specific genotypic variations with specific phenotypic variations relevant to health.
Some SNPphenotype correlations occur as a direct result of the influence of the SNP on health. More commonly, however, the SNP is merely a marker of biologic diversity that happens to correlate with health because of its proximity to the genetic factor that is actually the cause. In this sense, the term "proximity" is only a rough measure of physical closeness; instead, it connotes that, as genetic material has passed through 5000 generations from our common African ancestral pool, recombination between the SNP and the actual genetic factor has occurred only rarely. In genetic terms, the SNP and the actual genetic factor are said to be in linkage disequilibrium (Figure 4).
|
Conclusions
Except for monozygotic twins, each person's genome is unique. All physicians will soon need to understand the concept of genetic variability, its interactions with the environment, and its implications for patient care. With the sequencing of the human genome only months from its finish, the practice of medicine has now entered an era in which the individual patient's genome will help determine the optimal approach to care, whether it is preventive, diagnostic, or therapeutic. Genomics, which has quickly emerged as the central basic science of biomedical research, is poised to take center stage in clinical medicine as well.
Editor's note: As they are published, the articles in the Genomic Medicine Series will be available without charge at http://www.nejm.org.
Source Information
From the National Human Genome Research Institute, National Institutes of Health, Bethesda, Md.
Address reprint requests to Dr. Guttmacher at Bldg. 31, Rm. 4B09, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892-2152, or at guttmach{at}mail.nih.gov.
References
1 gene. Nat Genet 1996;14:203-205. [CrossRef][Web of Science][Medline]
Glossary
The following terms are used in the text or figures of this article or others in the Genomic Medicine Series. (For a "talking glossary" of many genetics terms, see http://www.genome.gov/glossary.cfm.)
Allele An alternative form of a gene.
Alternative splicing A regulatory mechanism by which variations in the incorporation of a gene's exons, or coding regions, into messenger RNA lead to the production of more than one related protein, or isoform.
Autosomes All of the chromosomes except for the sex chromosomes and the mitochondrial chromosome.
Centromere The constricted region near the center of a chromosome that has a critical role in cell division.
Codon A three-base sequence of DNA or RNA that specifies a single amino acid.
Conservative mutation A change in a DNA or RNA sequence that leads to the replacement of one amino acid with a biochemically similar one.
Epigenetic A term describing nonmutational phenomena, such as methylation and histone modification, that modify the expression of a gene.
Exon A region of a gene that codes for a protein.
Frame-shift mutation The addition or deletion of a number of DNA bases that is not a multiple of three, thus causing a shift in the reading frame of the gene. This shift leads to a change in the reading frame of all parts of the gene that are downstream from the mutation, often leading to a premature stop codon and ultimately, to a truncated protein.
Gain-of-function mutation A mutation that produces a protein that takes on a new or enhanced function.
Genomics The study of the functions and interactions of all the genes in the genome, including their interactions with environmental factors.
Genotype A person's genetic makeup, as reflected by his or her DNA sequence.
Haplotype A group of nearby alleles that are inherited together.
Heterozygous Having two different alleles at a specific autosomal (or X chromosome in a female) gene locus.
Homozygous Having two identical alleles at a specific autosomal (or X chromosome in a female) gene locus.
Intron A region of a gene that does not code for a protein.
Linkage disequilibrium The nonrandom association in a population of alleles at nearby loci.
Loss-of-function mutation A mutation that decreases the production or function of a protein (or does both).
Missense mutation Substitution of a single DNA base that results in a codon that specifies an alternative amino acid.
Monogenic Caused by a mutation in a single gene.
Motif A DNA-sequence pattern within a gene that, because of its similarity to sequences in other known genes, suggests a possible function of the gene, its protein product, or both.
Multifactorial Caused by the interaction of multiple genetic and environmental factors.
Nonconservative mutation A change in the DNA or RNA sequence that leads to the replacement of one amino acid with a very dissimilar one.
Nonsense mutation Substitution of a single DNA base that results in a stop codon, thus leading to the truncation of a protein.
Penetrance The likelihood that a person carrying a particular mutant gene will have an altered phenotype.
Phenotype The clinical presentation or expression of a specific gene or genes, environmental factors, or both.
Point mutation The substitution of a single DNA base in the normal DNA sequence.
Regulatory mutation A mutation in a region of the genome that does not encode a protein but affects the expression of a gene.
Repeat sequence A stretch of DNA bases that occurs in the genome in multiple identical or closely related copies.
Silent mutation Substitution of a single DNA base that produces no change in the amino acid sequence of the encoded protein.
Single-nucleotide polymorphism (SNP) A common variant in the genome sequence; the human genome contains about 10 million SNPs.
Stop codon A codon that leads to the termination of a protein rather than to the addition of an amino acid. The three stop codons are TGA, TAA, and TAG.
| |||||||||||||||||||||||||||||||||||||||||||||||
Related Letters:
Genomic Medicine
Weed H. G., Medow M. A., Guttmacher A. E., Collins F. S.
Extract |
Full Text |
PDF
N Engl J Med 2003;
348:759-760, Feb 20, 2003.
Correspondence
This article has been cited by other articles:
HOME | SUBSCRIBE | SEARCH | CURRENT ISSUE | PAST ISSUES | COLLECTIONS | PRIVACY | TERMS OF USE | HELP | beta.nejm.org Comments and questions? Please contact us. The New England Journal of Medicine is owned, published, and copyrighted © 2009 Massachusetts Medical Society. All rights reserved. |