0
Editorial |

The Trials and Tribulations of Clinical Practice Guidelines

Deborah Cook, MD, MSc; Mita Giacomini, PhD
JAMA. 1999;281(20):1950-1951. doi:10.1001/jama.281.20.1950
Text Size: A A A
Published online

Reports critiquing the validity of randomized trials,1 meta-analyses,2 diagnostic test studies,3 and economic evaluations4 have challenged researchers to improve the conduct of their studies and have encouraged readers to interpret them carefully. The study by Shaneyfelt and colleagues5 in this issue of THE JOURNAL tackles the clinical practice guideline industry, asking fundamental questions about how well they measure up to contemporary standards. Most guidelines outlined their rationale, specified the benefits and harms of health practices, and cited their evidentiary sources. However, guidelines much less often described how the evidence was found, how its worth was judged, and how diverse sources of evidence were combined to formulate recommendations.

The metric against which clinicians might judge guidelines is complex and multidimensional. Guideline creators, consumers, and critics will recognize the challenge of developing a credible instrument for this purpose. Shaneyfelt et al have captured key features of guideline development and reporting in 25 items generated by both literature review and dialogue with representatives from organizations that develop, implement, evaluate, or use guidelines. These items were piloted at workshops and pretested by the authors, and were endorsed as comprehensive and valid by guideline experts. The final items discriminated between poorly and well-developed guidelines that were ranked by an undisclosed reference standard. Although more sensitive scaling and reliability testing would further increase our confidence in this instrument, the majority of items it encompasses are those that would be important to most guideline users.

Discerning guideline readers want to determine whether appropriate methods were used to adduce the research evidence and whether the evidence supports the recommendations. However, users cannot help but notice that guideline developers must often reckon with research that is modest in rigor, discordant, or nonexistent. Although most guidelines are an amalgam of evidence and expert opinion, methods of integrating knowledge and experience into guidelines, particularly when data are sparse, are neither as mature nor as transparent as methods of incorporating research results. Only 7.5% of 279 guidelines evaluated by Shaneyfelt et al described formal methods to combine evidence and expert opinion. One exemplary exception to this generalization is a guideline on the diagnosis and treatment of idiopathic thrombocytopenic purpura from the American Society of Hematology.6 This guideline describes the dearth of literature, acknowledges the limitations of opinion-based directives, then issues pragmatic recommendations based on both appropriateness scores and consensus ratings.

The broad mandate of most guidelines ensures that guideline documents tend to be longer and less formulaic than other articles. Shaneyfelt and colleagues found that longer guidelines adhered to more standards than shorter guidelines. At the same time, lengthy guidelines are often ignored in practice. Thus, developers are caught in a fix, since surveys reveal that clinicians prefer pocket cards, concise pamphlets, and journal article summaries7 to more discursive formats. Because guideline appraisal is invariably limited by the format in which guidelines are expressed, more attention to reporting standards is sorely needed, akin to those proposed for randomized trials,8 meta-analyses,9 and economic evaluations.10

Clinical guidelines appear to be more palatable if they have the imprimatur of a professional organization.11 Nevertheless, most guidelines require adaptation to the local environment. Different policies (eg, universal health care or 2-tier medicine) also may influence guideline sanctioning (eg, predominant use of streptokinase vs tissue plasminogen activator for acute myocardial infarction). Thus, homegrown guidelines ultimately may have greater currency than guidelines that cannot be applied in different settings. Portability, or "flexibility of the clinical recommendations," was addressed by the majority of guidelines in this study. Guideline compendia that provide a menu of preventive care aids12 may allow clinicians to choose the best and most relevant among them.

Perhaps the greatest challenge to guideline developers is making explicit the values used in formulating recommendations, which Shaneyfelt and colleagues found in 6.1% of the guidelines they evaluated. Relevant questions about values include the following: from whose perspective are the outcomes considered—patients, practitioners, administrators, society, or a combination of these? How are competing values arbitrated? Although the goal of guidelines to reduce inappropriate expenditures may be laudable, this goal may be viewed as nefarious if "inappropriate" is supplanted with "unwanted" from a nonclinical perspective. While most clinicians value optimization of individual health outcomes, delivering the most cost-effective care for groups of patients via practice guidelines may require very different management strategies. Furthermore, guidelines designed to promote cost-effectiveness at the patient level may not maximize cost-effectiveness at the population level.13

Given the sensitivity of cost measures to different health care environments and different economic perspectives, 2 items in this instrument are particularly ambitious. These are specification of health care costs and quantification of these costs. Only when more rigorous economic evaluations become available will financial forecasts immured in practice guidelines be more credible. Until then, inclusion of costs or cost-effectiveness is a tall order for most practice guidelines.

While it is important to incorporate cost and benefit data into guidelines when possible, it is even more important to obtain empirical data on the clinical and economic sequelae of guideline implementation in the target setting. Systematic reviews of guideline implementation have found significant improvements in the processes of care in 55 of 59 studies14 and significant improvement in patient outcomes in 12 of 17 studies15 ; however, costs have not been summarized as carefully. Thus, a key additional item that should be considered for the checklist used in this study is whether the impact of the guideline has been evaluated in practice. Few guidelines fulfill this criteria before they are disseminated, but an increasing number of publications report the influence of guidelines on clinical and economic outcomes. For example, Hay and colleagues16 developed, implemented, and evaluated an evidence-based practice guideline for patients with acute upper gastrointestinal tract hemorrhage and found a reduced hospital stay, yet no difference in rebleeding, readmission, or mortality rates.

No guideline in the study by Shaneyfelt et al met all 25 criteria. However, clinicians should heed the universal caveat that "guidelines are only guidelines," intended to inform, rather than to tell what to do with pious certitude. Contrast guidelines on (1) the pharmacologic management of myocardial infarction with (2) hormone replacement therapy for postmenopausal women and (3) withdrawal of life-support decisions. The first guideline can be evidence-based, quantitative, and directive; the second may summarize risks and benefits of alternatives with context-specific provisos; and the third document may communicate only "guiding principles," since such decisions are quintessentially personal and value-specific. Thus, assessment of whether it is legitimate for a guideline to deviate from these standards requires more intimate familiarity with each guideline, with the intentions of the developers, and with the raw data used to generate the summary conclusions.

Possible explanations for the sobering findings of the study by Shaneyfelt et al include dichotomous ratings that do not grant partial credit, the fact that guideline developers do not agree on methods perceived by others to be optimal, and that optimal methods are too onerous or costly for the real world. This instrument may not capture well other sources of wisdom besides research evidence that make guidelines innately meritorious or clinically useful. Finally, this study was not designed to measure additional influences on guideline documents such as scientific and financial motives and editorial decisions.

The study by Shaneyfelt and colleagues summarizes the trials and tribulations of modern clinical practice guidelines. It exposes the diversity of guideline methodologies, raises fundamental questions about whether the proposed "quality" criteria are necessary or equally appropriate for all guideline topics, and is a clarion call for greater transparency of guideline reporting and more rigorous peer review. Careful evaluation of the real consequences of both developing and implementing practice guidelines is a health services research priority for the new millennium.

REFERENCES

Schulz KF, Chalmers I, Grimes DA, Altman DG. Assessing the quality of randomization from reports of controlled trials published in obstetrics and gynecology journals.  JAMA.1994;272:125-128.
Sacks HS, Berrier J, Reitman D, Ancona-Berk VA, Chalmers TC. Meta-analysis of randomized controlled trials.  N Engl J Med.1987;316:450-455.
Heffner JE, Feinstein D, Barbieri C. Methodologic standards for diagnostic test research in pulmonary medicine.  Chest.1998;114:877-885.
Heyland DK, Gafni A, Kernerman P, Cook DJ. Economic evaluation of the critical care literature: do they help us improve the efficiency of our unit?  Crit Care Med.1996;24:1591-1598.
Shaneyfelt TM, Mayo-Smith MF, Rothwangl J. Are guidelines following guidelines? the methodological quality of clinical practice guidelines in the peer-reviewed medical literature.  JAMA.1999;281:1900-1905.
American Society of Hematology ITP Practice Guideline Panel.  Diagnosis and treatment of idiopathic thrombocytopenic purpura: recommendations from the American Society of Hematology.  Ann Intern Med.1997;126:319-326.
Hayward RSA, Guyatt GH, Moore KA, McKibbon A, Carter AO. Canadian physicians' attitudes about and preferences regarding clinical practice guidelines.  CMAJ.1997;156:1715-1723.
Begg C, Cho M, Eastwood S.  et al.  Improving the quality of reporting of randomized controlled trials: the CONSORT statement.  JAMA.1996;276:637-639.
Cook DJ, Sackett DL, Spitzer WO. Methodological guidelines for systematic reviews of randomized control trials in health care from the Potsdam Consultation on Meta-analysis.  J Clin Epidemiol.1995;48:167-171.
Drummond MF, Jefferson TO.on behalf of the BMJ Economic Evaluation Working Party.  Guidelines for authors and peer reviewers of economic submissions to the BMJ.  BMJ.1996;313:275-283.
Tunis SR, Hayward RSA, Wilson MC.  et al.  Internists' attitudes about clinical practice guidelines.  Ann Intern Med.1994;120:956-963.
Weingarten S. Using practice guideline compendiums to provide better preventive care.  Ann Intern Med.1999;130:454-458.
Granata AV, Hillman AL. Competing practice guidelines: using cost-effectiveness analysis to make optimal decisions.  Ann Intern Med.1998;128:56-63.
Grimshaw JM, Russell IT. Effect of clinical guidelines on medical practice: a systematic review of rigorous evaluations.  Lancet.1993;342:1317-1322.
Grimshaw JM, Freemantle N, Wallace S.  et al.  Developing and implementing clinical practice guidelines.  Qual Health Care.1995;4:55-64.
Hay JA, Maldonado L, Weingarten SR, Ellrodt AG. Prospective evaluation of a clinical guideline recommending hospital length of stay in upper gastrointestinal hemorrhage.  JAMA.1997;278:2151-2156.

First Page Preview

First page PDF preview

Figures

Tables

Interactive Graphics

Video

Country-Specific Mortality and Growth Failure in Infancy and Yound Children and Association With Material Stature

Use interactive graphics and maps to view and sort country-specific infant and early dhildhood mortality and growth failure data and their association with maternal

Schulz KF, Chalmers I, Grimes DA, Altman DG. Assessing the quality of randomization from reports of controlled trials published in obstetrics and gynecology journals.  JAMA.1994;272:125-128.
Sacks HS, Berrier J, Reitman D, Ancona-Berk VA, Chalmers TC. Meta-analysis of randomized controlled trials.  N Engl J Med.1987;316:450-455.
Heffner JE, Feinstein D, Barbieri C. Methodologic standards for diagnostic test research in pulmonary medicine.  Chest.1998;114:877-885.
Heyland DK, Gafni A, Kernerman P, Cook DJ. Economic evaluation of the critical care literature: do they help us improve the efficiency of our unit?  Crit Care Med.1996;24:1591-1598.
Shaneyfelt TM, Mayo-Smith MF, Rothwangl J. Are guidelines following guidelines? the methodological quality of clinical practice guidelines in the peer-reviewed medical literature.  JAMA.1999;281:1900-1905.
American Society of Hematology ITP Practice Guideline Panel.  Diagnosis and treatment of idiopathic thrombocytopenic purpura: recommendations from the American Society of Hematology.  Ann Intern Med.1997;126:319-326.
Hayward RSA, Guyatt GH, Moore KA, McKibbon A, Carter AO. Canadian physicians' attitudes about and preferences regarding clinical practice guidelines.  CMAJ.1997;156:1715-1723.
Begg C, Cho M, Eastwood S.  et al.  Improving the quality of reporting of randomized controlled trials: the CONSORT statement.  JAMA.1996;276:637-639.
Cook DJ, Sackett DL, Spitzer WO. Methodological guidelines for systematic reviews of randomized control trials in health care from the Potsdam Consultation on Meta-analysis.  J Clin Epidemiol.1995;48:167-171.
Drummond MF, Jefferson TO.on behalf of the BMJ Economic Evaluation Working Party.  Guidelines for authors and peer reviewers of economic submissions to the BMJ.  BMJ.1996;313:275-283.
Tunis SR, Hayward RSA, Wilson MC.  et al.  Internists' attitudes about clinical practice guidelines.  Ann Intern Med.1994;120:956-963.
Weingarten S. Using practice guideline compendiums to provide better preventive care.  Ann Intern Med.1999;130:454-458.
Granata AV, Hillman AL. Competing practice guidelines: using cost-effectiveness analysis to make optimal decisions.  Ann Intern Med.1998;128:56-63.
Grimshaw JM, Russell IT. Effect of clinical guidelines on medical practice: a systematic review of rigorous evaluations.  Lancet.1993;342:1317-1322.
Grimshaw JM, Freemantle N, Wallace S.  et al.  Developing and implementing clinical practice guidelines.  Qual Health Care.1995;4:55-64.
Hay JA, Maldonado L, Weingarten SR, Ellrodt AG. Prospective evaluation of a clinical guideline recommending hospital length of stay in upper gastrointestinal hemorrhage.  JAMA.1997;278:2151-2156.
CME Course for:


You need to register in order to view this quiz.


To understand the clinical management of acute heart failure syndromes.
Accreditation Information The American Medical Association is accredited by the Accreditation Council for Continuing Medical Education to provide continuing medical education for physicians.
The AMA designates this journal-based CME activity for a maximum of 1 AMA PRA Category 1 CreditTM per course. Physicians should claim only the credit commensurate with the extent of their participation in the activity.
Physicians who complete the CME course and score at least 80% correct on the quiz are eligible for AMA PRA Category 1 CreditTM.
Note: You must get at least of the answers correct to pass this quiz.
Note: You must get at least of the answers correct to pass this quiz.
You have not filled in all the answers to complete this quiz
The following questions were not answered:
Sorry, you have unsuccessfully completed this CME quiz with a score of
The following questions were not answered correctly:
For CME Course: A Proposed Model for Initial Assessment and Management of Acute Heart Failure Syndromes
Indicate what changes(s) you will implement in your practice, if any, based on this CME course.
To view and print your certificate and access a summary of your CME courses go to My CME.
NOTE:
Citing articles are presented as examples only. In non-demo SCM6 implementation, integration with CrossRef’s “Cited By” API will populate this tab (http://www.crossref.org/citedby.html).
Submit a Comment

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging & repositioning the boxes below.

Articles Related By Topic
Related Topics
PubMed Articles
Management of group B streptococcal bacteriuria in pregnancy.
Journal of obstetrics and gynaecology Canada : JOGC = Journal d'obstétrique et gynécologie du Canada : JOGC. 2012 May