0
Commentary |

Statistical Association and Causation: Title and subTitle BreakContributions of Different Types of Evidence

Charles H. Hennekens, MD, DrPH; David DeMets, PhD
[+] Author Affiliations

Author Affiliations: Charles E. Schmidt College of Medicine, Florida Atlantic University, Nova Southeastern University, and University of Miami Miller School of Medicine, Boca Raton, Florida (Dr Hennekens); and Department of Biostatistics and Informatics, University of Wisconsin School of Medicine, Madison (Dr DeMets).


JAMA. 2011;305(11):1134-1135. doi:10.1001/jama.2011.322
Text Size: A A A
Published online

Advances in medical knowledge proceed on several fronts, optimally simultaneously. Each discipline provides unique, relevant, and complementary information to a totality of evidence. When the totality of evidence is sufficient, health care professionals can make the most rational decisions for individual patients and policy makers can make the most rational decisions for the health of the general public.1 When the totality of evidence is incomplete, it is appropriate to remain uncertain.2 Nonetheless, health care professionals and policy makers are always faced with decision making. Although medical researchers are likely to be familiar with these concepts, this commentary is primarily for clinicians and policy makers to increase their knowledge and understanding of the unique contributions of different types of evidence to the conclusion of a valid statistical association as well as the need to evaluate the totality of evidence to judge causality.

Basic Research. Basic research provides cogent biological mechanisms to answer the crucial question of why an agent or intervention affects morbidity or mortality. Basic research has the unique strength of precision and can control virtually all genetic and environmental exposures to provide important directions to guide as well as explain human research. Basic research also has the unique disadvantage of questionable relevance to humans.

Clinicians and Clinical Research. Clinicians provide enormous benefits to patients by applications of advances in diagnosis and treatment and formulate hypotheses based on their clinical experiences, specifically case reports and case series. Clinical investigators test the relevance of basic research to healthy and sick individuals.

Epidemiology and Biostatistics. Epidemiologists and biostatisticians, optimally collaborating with clinicians, formulate hypotheses from descriptive studies and test those hypotheses in analytic studies. Analytic epidemiological studies designed a priori to test hypotheses address the complementary question of whether there is a valid statistical association between an exposure, such as a drug or intervention, and an outcome, such as morbidity or mortality. Epidemiology is relevant to free-living humans but has the unique disadvantage of being crude and inexact as observations on free-living humans can never take place under the controlled conditions of basic research.

Descriptive Studies. Descriptive studies report patterns of disease in relation to person, place, and time. Data from descriptive studies are essential for public health administrators to allocate resources efficiently and to plan effective prevention or education programs, and for researchers to formulate hypotheses that can be tested subsequently using an analytic study design. The 3 main types of descriptive studies are correlational or ecologic, which consider patterns of disease among populations, case reports or case series, and cross-sectional surveys, both of which involve individuals.

Observational Analytic Studies. Observational analytic studies can be either case-control studies, especially useful for rare diseases, or cohort studies, especially useful for rare exposures. These are hypothesis testing for moderate to large effect sizes only if designed a priori to do so. If not, the results should instead be considered hypothesis formulating. In particular, case-control or cohort study data collected for administrative purposes, no matter how large in sample size, should be considered hypothesis formulating. For small to moderate effect sizes, however, all observational analytic studies, no matter how well designed, conducted, analyzed, and interpreted, are hypothesis formulating as their inherent uncontrolled and uncontrollable confounding can be as big as the effect sizes.

Meta-analyses of Randomized Trials. Meta-analyses of randomized trials that were not designed a priori to test hypotheses about clinical events and, as a result, were not large enough to do so, should be considered hypothesis formulating. Meta-analysis of individual randomized trials reduce the role of chance but may introduce bias and confounding. For example, individual trials may have different design features, different exposures being studied, including the nature and dose of drugs, and different rates of adherence and follow-up. The quality and usefulness of any meta-analyses depend on the quality and comparability of the data from the component trials. In particular, to be considered hypothesis testing the component trials should have been designed and conducted to evaluate the end point of interest, should have high adherence and follow-up rates, and should have reasonably comparable exposures and outcomes.

Large-Scale Randomized Trials. Large-scale randomized trials designed a priori to test hypotheses provide the most reliable evidence concerning the most plausible small to moderate effect sizes. In large samples, randomization provides a degree of control of confounding that is not possible to achieve in any observational analytic study.3

The results of any analytic epidemiological study designed a priori to test a hypothesis may have 3 plausible alternative explanations. The first is the play of chance or the “luck of the draw,” which can occur any time a sample of a population is examined. The second is bias or systematic error in the way individuals were selected or observed in the study. The third is confounding or variables associated with the exposure and independent risk factors for the outcome. All 3 explanations must be carefully considered before concluding that there is a valid statistical association from any analytic study designed a priori to test a hypothesis.

Judgments about causation can be safely made only on a sufficient totality of evidence. Several positive criteria support a judgment of causality, including strength of association, biological credibility, consistency, temporal sequence, and dose-response relationship.3

A greater strength of association implies that plausible alternative explanations are less likely. A known or postulated biological mechanism also enhances the belief in causality. In addition, the belief in causality is enhanced by the consistency of findings from independent studies designed a priori to test a hypothesis. However, temporal sequence can sometimes be difficult to establish. Many lifestyle variables such as physical activity, smoking, or consumption of particular foods or beverages may be altered after the first symptoms appear, either deliberately or as a biological consequence of the disease. In addition, a dose-response relationship may merely reflect the effect of an uncontrolled confounding factor, as is likely to be the case with cigarette smoking and cirrhosis of the liver, a statistical association most likely due to confounding by heavy alcohol consumption.4 Finally, a dose-response relationship can be missed by an inability to precisely measure exposure.

The conclusion of a valid statistical association can be safely made from an analytic study designed a priori to test a hypothesis after exclusion of chance, bias, and confounding as plausible alternative explanations. For small to moderate effect sizes, large-scale randomized trials are a necessary component of a sufficient totality of evidence. A valid statistical association may or may not be causal. Causation is a judgment based on the totality of evidence that includes several positive criteria. As Hill aptly stated in 1965: “All scientific work is incomplete—whether it be observational or experimental. All scientific work is liable to be upset or modified by advancing knowledge. That does not confer upon us a freedom to ignore the knowledge we already have, or to postpone the action that it appears to demand at a given time. Who knows, asked Robert Browning, but the world may end tonight? True, but on available evidence most of us make ready to commute on the 8:30 next day.”5

Corresponding Author: Charles H. Hennekens, MD, DrPH, Florida Atlantic University, 2800 S Ocean Blvd, PHA, Boca Raton, FL 33432 (chenneke@fau.edu).

Conflict of Interest Disclosures: Both authors have completed and submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Dr Hennekens reported being funded by the Charles E. Schmidt College of Medicine, Department of Clinical Science, and Medical Education and Center of Excellence, Florida Atlantic University (FAU), as principal investigator on 2 investigator-initiated research grants funded to FAU by Bayer; serving in an advisory role to investigators and sponsors as chair of the data and safety monitoring boards of randomized trials funded by Actelion, Amgen, Anthera, Bristol-Myers Squibb, and Sunovion, and as a member of the data and safety monitoring boards of randomized trials funded by Bayer and the Canadian Institutes of Health Research; serving in an advisory role to the US Food and Drug Administration, US National Institutes of Health, and UpToDate; serving as an independent scientist in an advisory role to legal counsel for GlaxoSmithKline and Stryker; serving as a speaker for the Association of Research in Vision and Ophthalmology, National Association for Continuing Education, PriMed, the International Atherosclerosis Society, AstraZeneca, and Pfizer; receiving royalties for authorship or editorship of 3 textbooks and as coinventor on patents for inflammatory markers and cardiovascular disease that are held by Brigham and Women's Hospital; and having an investment-management relationship with the West-Bacon Group within SunTrust Investment Services, which has discretionary investment authority. Dr DeMets reported being partially supported by a US National Institutes of Health grant to the University of Wisconsin for the Clinical Translational Science Award for statistical consultation and collaboration and administrative leadership, as a leader of the Data Management and Biostatistics Core (Core C) of the Wisconsin Alzheimer's Disease Research Center grant, by serving as a principal investigator on University of Wisconsin contracts with industry for statistical analysis center activity for multicenter trials, which are currently sponsored by Amgen, AstraZeneca, and Bristol-Myers Squibb; serving as an independent biostatistician in an advisory role to investigators and sponsors as a member of the data and safety monitoring boards of randomized trials funded by Actelion, Amgen, Astellas, AstraZeneca, Biotronik, Boehringer-Ingelheim, CVRx, Genentech, GlaxoSmithKline, Merck, Novartis, Pfizer, Roche, Sanofi Aventis, Takeda, Duke Clinical Research Institute, and the Population Health Research Institute of McMaster University, Canadian Institutes of Health Research, Harvard Clinical Research Institute, and Hamilton Clinical Research Institute; receiving royalties from publishers of the 3 textbooks that he has coauthored and edited; having tax-sheltered retirement accounts in mutual funds with Fidelity and UBS; and having 2 small accounts of stock with Sun and Intel.

Additional Information: This commentary is dedicated to the memories of Richard Doll, FRS, and Max Halperin, PhD.

Additional Contributions: Richard Peto, FRS (University of Oxford, Oxford, England), provided advice and help with the manuscript.

Hennekens CH, Buring JE. Epidemiology in Medicine. Boston, MA: Little Brown & Co; 1987
Hennekens CH, Demets D. The need for large-scale randomized evidence without undue emphasis on small trials, meta-analyses, or subgroup analyses.  JAMA. 2009;302(21):2361-2362
PubMedCrossRef
Friedman LM, Furberg CD, DeMets DL. Fundamentals of Clinical Trials. New York, NY: Springer Verlag; 1998
Doll R, Peto R, Wheatley K, Gray R, Sutherland I. Mortality in relation to smoking: 40 years' observations on male British doctors.  BMJ. 1994;309(6959):901-911
PubMedCrossRef
Hill AB. The environment and disease: association or causation?  Proc R Soc Med. 1965;58295-300
PubMed

First Page Preview

First page PDF preview

Figures

Tables

Interactive Graphics

Video

Country-Specific Mortality and Growth Failure in Infancy and Yound Children and Association With Material Stature

Use interactive graphics and maps to view and sort country-specific infant and early dhildhood mortality and growth failure data and their association with maternal

Hennekens CH, Buring JE. Epidemiology in Medicine. Boston, MA: Little Brown & Co; 1987
Hennekens CH, Demets D. The need for large-scale randomized evidence without undue emphasis on small trials, meta-analyses, or subgroup analyses.  JAMA. 2009;302(21):2361-2362
PubMedCrossRef
Friedman LM, Furberg CD, DeMets DL. Fundamentals of Clinical Trials. New York, NY: Springer Verlag; 1998
Doll R, Peto R, Wheatley K, Gray R, Sutherland I. Mortality in relation to smoking: 40 years' observations on male British doctors.  BMJ. 1994;309(6959):901-911
PubMedCrossRef
Hill AB. The environment and disease: association or causation?  Proc R Soc Med. 1965;58295-300
PubMed
CME Course for:


You need to register in order to view this quiz.


To understand the clinical management of acute heart failure syndromes.
Accreditation Information The American Medical Association is accredited by the Accreditation Council for Continuing Medical Education to provide continuing medical education for physicians.
The AMA designates this journal-based CME activity for a maximum of 1 AMA PRA Category 1 CreditTM per course. Physicians should claim only the credit commensurate with the extent of their participation in the activity.
Physicians who complete the CME course and score at least 80% correct on the quiz are eligible for AMA PRA Category 1 CreditTM.
Note: You must get at least of the answers correct to pass this quiz.
Note: You must get at least of the answers correct to pass this quiz.
You have not filled in all the answers to complete this quiz
The following questions were not answered:
Sorry, you have unsuccessfully completed this CME quiz with a score of
The following questions were not answered correctly:
For CME Course: A Proposed Model for Initial Assessment and Management of Acute Heart Failure Syndromes
Indicate what changes(s) you will implement in your practice, if any, based on this CME course.
To view and print your certificate and access a summary of your CME courses go to My CME.
NOTE:
Citing articles are presented as examples only. In non-demo SCM6 implementation, integration with CrossRef’s “Cited By” API will populate this tab (http://www.crossref.org/citedby.html).
Submit a Response

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging & repositioning the boxes below.

Articles Related By Topic
Related Topics
JAMAevidence.com


Exposure