0
Editorial |

The Hazards of Evidence-Based Medicine: Title and subTitle BreakAssessing Variations in Care

Edward H. Livingston, MD; Robert A. McNutt, MD
[+] Author Affiliations

Author Affiliations: Division of Gastrointestinal and Endocrine Surgery, University of Texas Southwestern Medical Center, Dallas (Dr Livingston); and Department of Medicine, Rush University School of Medicine, Chicago, Illinois (Dr McNutt). Drs Livingston and McNutt are Contributing Editors, JAMA.


JAMA. 2011;306(7):762-763. doi:10.1001/jama.2011.1181
Text Size: A A A
Published online

Assessing quality of care frequently involves using measures of processes of care, such as Medicare's 25 quality metrics.1 Adherence to these processes is thought to lead to improved outcomes. For example, the Surgical Care Improvement Project was introduced in 2006, with the goal of reducing surgical complications by 25% by 2010.2 Based on observational studies demonstrating associations between process and outcomes, experts concluded that adherence to this series of process measures would result in better care. Medicare adopted these and published them on its Hospital Compare Web site1 as measures of hospital quality. However, for some process measures, studies have shown that adherence to these measures is not necessarily associated with improved outcomes. This has been the case for perioperative antibiotic use and postoperative wound infection3 and for acute myocardial infarction, heart failure, and pneumonia.4 More worrisome is that in some cases, adherence to the prescribed process measure may be associated with considerable harm, such as with tight glucose control in critically ill patients.5

Hence, rewarding institutions with high quality of care as determined by process measures and penalizing systems such as those with poorer performance on these metrics might pose potential harm. For instance, an analysis of geographic variation in health care delivery showed that regional variation in chronic conditions among Medicare beneficiaries was inversely related to their case-fatality rate.6 In that analysis, patients in some areas, such as McAllen, Texas, were among those with the highest mean number of chronic conditions, whereas patients in other areas, such as Grand Junction, Colorado, were among those with the fewest. McAllen has been associated with very high Medicare expenditures and poor quality of care based on Hospital Compare, whereas the opposite association has been noted for Grand Junction.7 However, although the Medicare expenditures in McAllen are relatively high compared with Grand Junction, the case-fatality rate is lower for a group of patients with more chronic conditions.6 The definitive outcome of a lower case-fatality rate in McAllen suggests that the higher expenditures were appropriate, yet the intermediate outcomes from process measures as defined on Hospital Compare1 suggest the opposite.

Provision of more or fewer health care services relative to the average may be either beneficial or harmful. For individual patients cared for by individual physicians, variation in care is sometimes desirable. One patient's chronic illness is not another’s, and treating all patients the same would be clinical nonsense. However, maximizing variation for individuals does not extend to groups of patients cared for by groups of physicians. Physicians and patients arriving at the same decisions in reference to treatment risks and benefits should average out when outcomes are compared among large groups. Thus, studies of variation in care seek to find similarity between large regions. These studies are limited because the outcomes measured and information available for such comparisons are not always reliable. Before concluding that variation is undesirable, caution is needed when assessing data to ensure that such variation accurately reflects the entire scope of medical decision making and how these decisions affect outcomes.

In this issue of JAMA, Haymart and colleagues8 report on variations in the administration of radioactive iodine (RAI) following operations for thyroid cancer. The incidence of thyroid cancer is increasing,8 but the reasons for the increase are unclear. Along with the increase in incidence, there has been a concomitant increase in thyroidectomy rates9 and an increase in the proportion of patients treated with RAI. Using data from a cohort of 189 219 patients with well-differentiated thyroid cancer treated at 981 hospitals associated with the National Cancer Database, Haymart et al8 reported a significant increase in the proportion of patients treated with RAI between 1990 and 2008, although there was wide variation in RAI use, with variation occurring in both directions. On one hand, approximately one-fourth to one-third of patients with advanced-stage disease or large tumors were not treated with RAI despite general agreement that RAI is necessary for these patients.10 On the other hand, a nearly equal proportion of individuals with low-risk thyroid cancer who probably should not have been treated with RAI did receive this therapy. After adjustment for available patient, tumor, and hospital factors, much of the variance was attributable to unexplained hospital characteristics. The authors conclude that the substantial between-hospital variation in RAI use suggests clinical uncertainty and perhaps inappropriate use of RAI in thyroid cancer management.

In this line of reasoning, conclusions are drawn by assessing variations in care from one hospital or region to another and making assumptions about the appropriateness of that care. However, there is incomplete knowledge about how and why care was delivered in hospitals showing variation. If RAI was not given to high-risk patients, the reasons it was not administered (such as patient preferences) are not captured in the database. If RAI was given to low-risk patients, subtle information regarding a clinician's decision to administer RAI is not captured in these databases. For example, during total thyroidectomy, some surgeons leave a rim of thyroid tissue adjacent to nerves to minimize the risk of nerve injury and rely on RAI to ablate the residual thyroid tissue. Such surgeons have made a conscious risk-benefit decision balancing nerve injury with the risks of RAI. This procedure is coded as a total thyroidectomy in an administrative database and appears in an analysis of that database to be associated with inappropriate administration of RAI. To date, no guideline or clinical trial refutes individual variation for a variety of clinical scenarios in treating thyroid cancer; thus, these scenarios fall into the realm of individual clinician judgment and decision making. Consequently, variation is to be expected. This study highlights the limits of using administrative data, which lack granular information about why decisions are made, to draw conclusions about the appropriateness of care.

Without an assessment of hard, irrefutable measures of clinical decision making that include individual preferences for treatment, decisions about the appropriateness of clinical treatments and variations of care cannot be made. For instance, when comparing McAllen, Texas, with Grand Junction, Colorado, the important outcome of mortality demonstrated that the greater per capita expenditures in McAllen7 could be justified because those patients were sicker yet, paradoxically, had lower case-fatality rates than their healthier counterparts in Grand Junction.6 An opposite conclusion was drawn when the outcome was quality of care as defined by measures that were not necessarily related to important clinical outcomes; ie, those listed on the Hospital Compare Web site.1 Similarly, in the report by Haymart et al8 on RAI use, without knowing if patients receiving RAI derived benefit or harm, it is difficult to conclude that RAI administration was appropriate or not. In some cases, evaluating intermediate outcomes such as variation in care or adherence to treatment guidelines without linking processes of care to meaningful clinical outcomes may be uninformative or, worse, misleading. The link between processes of care and outcomes must be convincingly demonstrated for proxy measures to be used in assessing quality.

Ultimately, treatment decisions are made by patients, not practitioners and not policy makers. Patients rely on physicians and other health care professionals to deliver the most amount of information possible about treatment risks and benefits. Patients then can consider these issues in the context of their own individual preferences. Policy makers should not interfere with this process unless the evidence they use to mandate care is substantial. Because of uncertainty in the integrity of most administrative databases and registries and the inherent limitation in the amount of information they contain about patient care, policy should only rarely be made based on findings from these sources.

AUTHOR INFORMATION

Corresponding Author: Edward H. Livingston, MD, Division of Gastrointestinal and Endocrine Surgery, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Room E7-126, Dallas, TX 75390-9156 (edward.livingston@utsouthwestern.edu).

Conflict of Interest Disclosures: The authors have completed and submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest and none were reported.

Editorials represent the opinions of the authors and JAMA and not those of the American Medical Association.

US Department of Health and Human Services.  Hospital Compare. http://www.hospitalcompare.hhs.gov. Accessed July 25, 2011
Hawn MT. Surgical care improvement: should performance measures have performance measures?  JAMA. 2010;303(24):2527-2528
PubMed
Stulberg JJ, Delaney CP, Neuhauser DV, Aron DC, Fu P, Koroukian SM. Adherence to surgical care improvement project measures and the association with postoperative infections.  JAMA. 2010;303(24):2479-2485
PubMed
Werner RM, Bradlow ET. Relationship between Medicare's hospital compare performance measures and mortality rates.  JAMA. 2006;296(22):2694-2702
PubMed
Finfer S, Chittock DR, Su SY,  et al; NICE-SUGAR Study Investigators.  Intensive vs conventional glucose control in critically ill patients.  N Engl J Med. 2009;360(13):1283-1297
PubMed
Welch HG, Sharp SM, Gottlieb DJ, Skinner JS, Wennberg JE. Geographic variation in diagnosis frequency and risk of death among Medicare beneficiaries.  JAMA. 2011;305(11):1113-1118
PubMed
Gawande A. The cost conundrum: what a Texas town can teach us about health care. New Yorker. June 1, 2009. http://www.newyorker.com/reporting/2009/06/01/090601fa_fact_gawande. Accessed July 25, 2011
Haymart MR, Banerjee M, Stewart AK, Koenig RJ, Birkmeyer JD, Griggs JJ. Use of radioactive iodine for thyroid cancer.  JAMA. 2011;306(7):721-728
Mitchell I, Livingston EH, Chang AY,  et al.  Trends in thyroid cancer demographics and surgical therapy in the United States.  Surgery. 2007;142(6):823-828
PubMed
Carty SE, Cooper DS, Doherty GM,  et al; American Thyroid Association Surgery Working Group; American Association of Endocrine Surgeons; American Academy of Otolaryngology-Head and Neck Surgery; American Head and Neck Society.  Consensus statement on the terminology and classification of central neck dissection for thyroid cancer.  Thyroid. 2009;19(11):1153-1158
PubMed

First Page Preview

First page PDF preview

Figures

Tables

Interactive Graphics

Video

Country-Specific Mortality and Growth Failure in Infancy and Yound Children and Association With Material Stature

Use interactive graphics and maps to view and sort country-specific infant and early dhildhood mortality and growth failure data and their association with maternal

US Department of Health and Human Services.  Hospital Compare. http://www.hospitalcompare.hhs.gov. Accessed July 25, 2011
Hawn MT. Surgical care improvement: should performance measures have performance measures?  JAMA. 2010;303(24):2527-2528
PubMed
Stulberg JJ, Delaney CP, Neuhauser DV, Aron DC, Fu P, Koroukian SM. Adherence to surgical care improvement project measures and the association with postoperative infections.  JAMA. 2010;303(24):2479-2485
PubMed
Werner RM, Bradlow ET. Relationship between Medicare's hospital compare performance measures and mortality rates.  JAMA. 2006;296(22):2694-2702
PubMed
Finfer S, Chittock DR, Su SY,  et al; NICE-SUGAR Study Investigators.  Intensive vs conventional glucose control in critically ill patients.  N Engl J Med. 2009;360(13):1283-1297
PubMed
Welch HG, Sharp SM, Gottlieb DJ, Skinner JS, Wennberg JE. Geographic variation in diagnosis frequency and risk of death among Medicare beneficiaries.  JAMA. 2011;305(11):1113-1118
PubMed
Gawande A. The cost conundrum: what a Texas town can teach us about health care. New Yorker. June 1, 2009. http://www.newyorker.com/reporting/2009/06/01/090601fa_fact_gawande. Accessed July 25, 2011
Haymart MR, Banerjee M, Stewart AK, Koenig RJ, Birkmeyer JD, Griggs JJ. Use of radioactive iodine for thyroid cancer.  JAMA. 2011;306(7):721-728
Mitchell I, Livingston EH, Chang AY,  et al.  Trends in thyroid cancer demographics and surgical therapy in the United States.  Surgery. 2007;142(6):823-828
PubMed
Carty SE, Cooper DS, Doherty GM,  et al; American Thyroid Association Surgery Working Group; American Association of Endocrine Surgeons; American Academy of Otolaryngology-Head and Neck Surgery; American Head and Neck Society.  Consensus statement on the terminology and classification of central neck dissection for thyroid cancer.  Thyroid. 2009;19(11):1153-1158
PubMed
CME Course for:


You need to register in order to view this quiz.


To understand the clinical management of acute heart failure syndromes.
Accreditation Information The American Medical Association is accredited by the Accreditation Council for Continuing Medical Education to provide continuing medical education for physicians.
The AMA designates this journal-based CME activity for a maximum of 1 AMA PRA Category 1 CreditTM per course. Physicians should claim only the credit commensurate with the extent of their participation in the activity.
Physicians who complete the CME course and score at least 80% correct on the quiz are eligible for AMA PRA Category 1 CreditTM.
Note: You must get at least of the answers correct to pass this quiz.
Note: You must get at least of the answers correct to pass this quiz.
You have not filled in all the answers to complete this quiz
The following questions were not answered:
Sorry, you have unsuccessfully completed this CME quiz with a score of
The following questions were not answered correctly:
For CME Course: A Proposed Model for Initial Assessment and Management of Acute Heart Failure Syndromes
Indicate what changes(s) you will implement in your practice, if any, based on this CME course.
To view and print your certificate and access a summary of your CME courses go to My CME.
NOTE:
Citing articles are presented as examples only. In non-demo SCM6 implementation, integration with CrossRef’s “Cited By” API will populate this tab (http://www.crossref.org/citedby.html).
Submit a Response

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging & repositioning the boxes below.

See Also...
Articles Related By Topic
Related Topics
PubMed Articles
JAMAevidence.com