Author Affiliations: Department of Statistical Genetics (Dr Visscher) and Department of Molecular Epidemiology (Dr Montgomery), Queensland Institute of Medical Research, Brisbane, Australia.
Many common complex diseases such as hypertension, diabetes, coronary heart disease, psychiatric disorders, and some cancers have a genetic etiology. Despite enormous efforts over the last few decades, little real progress was made in finding the genes and causal variants involved. Genome-wide association studies, in which hundreds of thousands of DNA markers are tested (usually in a case-control design) for association with disease, provide the first effective approach to search for genetic variants that contribute to the complex etiology of common human diseases.
In the last 3 years, almost 1000 variants associated with a range of human traits and common diseases have been identified using genome-wide association methods (Figure).1 -Â 2 To date, most of these studies have been in populations of European descent.
SNPs indicates single-nucleotide polymorphisms. Data are adapted from the National Human Genome Research Institute1 at http://www.genome.gov/gwastudies/.
Genome-wide search strategies developed from advances in genotyping technology, greater understanding of the structure of common variation in the human genome, and continued advances in computing power and software tools. Commercial genotyping platforms can type as many as 1 million single-nucleotide polymorphisms (SNPs) on a single chip, capturing (tagging) most variation between individuals in a single experiment. Instead of genotyping per sample the 10 million to 15 million common SNPs that segregate in the population for each sample, a much smaller subset of approximately 500Â 000 SNPs is sufficient to cover common variation in the genome. The flipside of this redundancy is that SNPs that are statistically associated with disease are unlikely to be causal and will be correlated with an ungenotyped causal variant.
Genome-wide association studies have provided insights about disease, in particular: (1) for almost any disease that has been investigated, there are SNP variants common in the population (with an allele frequency >5%) that are robustly associated with disease; (2) most of these variants are in genes that contribute to biological pathways that were previously not known to be involved in disease or are nowhere near a known protein-coding gene; (3) the effect sizes of associated SNPs are typically small with odds ratios of risk alleles in the range of approximately 1.1 to 1.5; (4) for any particular disease, accumulating the effects of many different SNPs associated with a disease usually explains only a small fraction of the familial risk (or heritability); and (5) not all diseases and traits are alike in genetic architecture. For example, in age-related macular degeneration, approximately 50% of genetic variation has been accounted for by only 6 loci,3 whereas for adult height, only 6% of genetic variation has been accounted for by approximately 50 loci.4 In the iron homeostasis pathway, several common SNPs have been reported that each explain 5% or more of genetic variation.5
Failure to account for much of the genetic variation or “missing heritability”6 has divided the community with respect to the success or failure of genome-wide association studies. Since the common goal is to understand pathways to disease and develop improved methods of prevention, diagnosis, and treatment, it is important to understand what the current flood of associated SNPs reveals about disease biology and gene regulation, why so little genetic variation has been accounted for, and what experimental approaches might lead the identification of causal variants and mechanisms.
The main hypotheses that explain the missing heritability are that either there are many variants with odds ratios so small they do not reach formal statistical significance despite large genome-wide association studies, or causal variants are not tagged well by SNPs on the commercial chips (eg, because these variants occur at lower frequency or are in areas of the genome in which it is difficult to develop SNP assays).
If most causal variants have small odds ratios, many loci must be contributing to genetic variation for disease risk and therefore, all individuals must carry their own, probably unique, portfolio of risk variants. Mathematical polygenic models for disease are well developed and are consistent with the observation that some cases appear to be sporadic whereas other cases have a family history. A recent genome-wide association analysis of schizophrenia showed that the data were consistent with a substantial proportion of disease risk that is polygenic,7 suggesting that hundreds or even thousands of variants contribute to risk—similar to the findings for human height. Another consequence of many genes being involved is that there must be widespread pleiotropy, ie, comorbidity due to shared genetic factors. Indeed, genome-wide association analysis of schizophrenia and bipolar disorder revealed a strong overlap in risk.7 Thus, genetic analysis can lead to reassessment of diagnostic criteria.
Resequencing of genomic regions is uncovering much new variation. Emerging evidence points to both common and rare variants contributing to disease risk. For example, Nejentsev et al8 resequenced a candidate gene for type 1 diabetes and detected 4 new variants at approximately 1% frequency that in total contributed more to variation in risk in the population than a single common variant in the same gene detected by a previous genome-wide association study. Causal variants at lower frequencies must have very large odds ratios to contribute substantially to variation in disease risk. If not, combined resequencing and genotyping projects to detect such variants will need to be even larger than current genome-wide association studies.
Whatever the source of the missing heritability, genome-wide association results have provided novel insights into the pathogenesis of many diseases. Variants that increase the risk of type 2 diabetes influence β-cell development and function, and focus attention on insulin secretion in the development of disease.9 Discoveries in inflammatory bowel disease have highlighted the importance of the autophagy pathway in disease development. Better understanding of disease pathogenesis has already opened up new avenues for research likely to provide opportunities for drug discovery in the future.
In addition to providing new insights into disease biology, another consideration is whether and how genome-wide association studies will affect clinical outcome. Commercial companies are offering personalized genomic risk profiles based on genome-wide association results directly to consumers. However, since for most diseases, at present, only a small proportion of familial risk has been accounted for, most risk profiles closely follow the population average and the value of these profiles is questionable. If a substantial proportion of genetic variation has been accounted for by genetic markers, then risk predictors could be constructed that are more informative than current predictors based on family history. Risk prediction profiles can also be used to investigate genetic comorbidity and to evaluate use of current diagnostic criteria in closely related conditions.
Whole genome resequencing will uncover new variation but will be constrained by sample size in detecting association between rare variants and disease. Ever larger genome-wide association studies will continue to provide insight into the genetic architecture of complex diseases. Although genome-wide association studies have facilitated critical discovery in understanding the genetics of disease, much more remains to be done to translate the flood of knowledge into clinical practice.
Corresponding Author: Peter M. Visscher, PhD, Queensland Statistical Genetics, Queensland Institute of Medical Research, 300 Herston Rd, Brisbane, Australia 4006 (peter.visscher@qimr.edu.au).
Financial Disclosures: None reported.
Funding/Support: This work was supported by the Australian National Health and Medical Research Council.
Role of the Sponsors: The funding organization had no role in the preparation, review, or approval of the manuscript.
Country-Specific Mortality and Growth Failure in Infancy and Yound Children and Association With Material Stature
Use interactive graphics and maps to view and sort country-specific infant and early dhildhood mortality and growth failure data and their association with maternal
Instructions
Comments are moderated and will appear on the site at the discretion of the Journal of American Medical Association editors. Comments should not exceed 500 words of text and 10 references.
Do not submit personal medical questions or information that could identify a specific patient, questions about a particular case, or general inquiries to an author. Only content that has not been published, posted, or submitted elsewhere should be submitted. By submitting this Comment, you and any coauthors transfer copyright to the journal if your Comment is posted.
* = Required Field
Disclosure of Any Conflicts of Interest* Indicate all relevant conflicts of interest of each author below, including all relevant financial interests, activities, and relationships within the past 3 years including, but not limited to, employment, affiliation, grants or funding, consultancies, honoraria or payment, speakers’ bureaus, stock ownership or options, expert testimony, royalties, donation of medical equipment, or patents planned, pending, or issued. If all authors have none, check "No potential conflicts or relevant financial interests" in the box below. Please also indicate any funding received in support of this work. The information will be posted with your response.
Register and get free email Table of Contents alerts, saved searches, PowerPoint downloads, CME quizzes, and more
Subscribe for full-text access to content from 1998 forward and a host of useful features
Activate your current subscription (AMA members and current subscribers)
Some tools below are only available to our subscribers or users with an online account.
Download citation file:
Customize your page view by dragging & repositioning the boxes below.
Genome-wide association (GWA) study
All results at JAMAevidence.com >
and access these and other features:
Register Now
Enter your username and email address. We'll send you a reminder to the email address on record.
Athens and Shibboleth are access management services that provide single sign-on to protected resources. They replace the multiple user names and passwords necessary to access subscription-based content with a single user name and password that can be entered once per session. It operates independently of a user's location or IP address. If your institution uses Athens or Shibboleth authentication, please contact your site administrator to receive your user name and password.