|
|
||||||||||
|
J Am Coll Cardiol, 2002; 40:1895-1901 © 2002 by the American College of Cardiology Foundation |



* Duke Clinical Research Institute, Durham, North Carolina, USA
Mayo Clinic Foundation, Rochester, Minnesota, USA
Baylor College of Medicine, Houston, Texas, USA
Kaiser-Permanente San Francisco Medical Center, San Francisco, California, USA
|| University of Virginia Health System, Charlottesville, Virginia, USA
¶ University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA Supported by the Agency for Healthcare Research and Quality (AHRQ) Centers for Education and Research on Therapeutics (CERTs) cooperative agreement grant #U18HS10548.
Manuscript received June 6, 2002; revised manuscript received June 27, 2002, accepted August 9, 2002.
* Reprint requests and correspondence: Dr. Robert M. Califf, Duke Clinical Research Institute, P.O. Box 17969, Durham, North Carolina 27710, USA.
calif001{at}mc.duke.edu
| Abstract |
|---|
|
|
|---|
| ||||||||||||
| Definition of quality andthe role of clinical trials |
|---|
|
|
|---|
Randomized trial and outcome studies have provided a basis for informed decisions about the use of medical technologies. Randomized trials cannot answer all questions, however, and many decisions in practice must be made based on an understanding of physiology, intuition, and experience when treating individuals. Questions have also arisen about the value of applying randomized trials in clinical practice, with skepticism about whether advocated measures of clinical effectiveness truly reflect a worthwhile approach to improving medical practice.
We provide a perspective on this issue by presenting a model that integrates quantitative measurements of quality and performance into the development cycle for therapeutics. Such a model could serve as a basic approach to cardiovascular medicine that is necessary, but not sufficient, to those wishing to provide the best care for their patients. These concepts have evolved largely through ongoing efforts of the American College of Cardiology (ACC) and American Heart Association (AHA) to develop clinical practice guidelines (CPGs) and performance indicators for cardiology, but the issues raised in this review pertain to all areas of medicine.
| Integration of qualityin the development cycle |
|---|
|
|
|---|
|
Recommendations about diagnosis and treatment contained in a given guideline can be synthesized into algorithms, which then can be used as Quality Indicators, specifying the clinical circumstances under which to use a technology. By determining how well a provider or institution meets these quality indicators, actual Performance Measures can be assessed. The final stage in the cycle then links measured performance with the ultimate goal of healthcare, better Outcomes. For all of these elements to contribute optimally to the overall system, continuous education and feedback about findings and concepts are needed; thus, these aspects are in the center of the cycle.
As an example, suppose the preponderance of basic and clinical evidence leads a CPG to recommend that all eligible patients receive a beta-adrenergic blocking agent after acute myocardial infarction (MI). This recommendation could translate into the quality indicator "prescription for beta-blocker at discharge after MI." The corresponding performance measure then would be "proportion of eligible patients prescribed a beta-blocker at discharge after MI." Stated simply, the guideline generates a criterion (the quality indicator), and how well it is met by providers or institutions is the performance measure.
By studying the links between and among cycle elements, we might begin to develop ideas for building a quality system. Of note, this approach deals with only one component of quality, although this quantitative component relating to outcomes may distinguish the medical environment from other elements affecting healthcare quality. Without excellence in the subjective element, these quantitative elements are moot. With this idea of giving care to the individual patient in mind, several attributes that would enhance each cycle element become evident, and many research questions can be posed.
Concepts: biological insights and the treachery of surrogates. Insight into disease mechanisms is essential to develop concepts for diagnostic and therapeutic products. Given the accelerating insights from genomics and proteomics, a wealth of biological targets seems probable. Of more immediate relevance, bioengineering progress is making devices and their combinations with drugs or biologics an increasingly routine part of medicine, as evidenced in cardiology by the advent of coated stents (6), wider indications for defibrillators (7,8), and mechanical assist devices for heart failure (9). Therapies based on genomic and proteomic technology will soon follow, and diagnostic tests will eventually use analysis of genetic variations to identify patients more or less likely to respond to given therapies.
Pressure continues to increase to develop ways to assess the efficacy and safety of new technologies. Many have advocated the use of biological "surrogates," known as biomarkers, to substitute for clinical outcomes during development. Although biomarker results should be considered when deciding which theories to pursue in trials (10,11), they provide only an entry point for medical products. Even therapies that produce substantial benefits for a respected surrogate may fail because of other safety problems (12,13). Perhaps most important, although biomarkers may identify particular benefits of therapies, they cannot reliably reflect the balance between the risks and benefits of therapies, information critical to determining their value (14).
The treachery of surrogates has caused cardiovascular specialists to require large outcomes trials as a basis for the highest level recommendations in CPGs. Arrays of biomarkers are urgently needed, however, to determine when it is reasonable to invest in such trials. Of note, single biomarkers are insufficient for this purpose. Antithrombotic drugs, for example, can affect markers of thrombin, platelet activation, and inflammation differently, precluding translation into a cohesive, quantitative estimate. Imaging methods provide a promising approach to biomarker evaluation, by integrating structure and function into a common measurement.
Clinical research: the standard of evidence
Clinical trials are preferable as the source of evidence whenever possible. From a regulatory standpoint, a definitive clinical trial must be "adequate and well controlled" and must assess the safety and efficacy of a product when used in the intended population. To be helpful in the qualitative component of quality, however, a trial also must address the issues of practicality, applicability, and effectiveness. This means that a trial should measure clinically relevant outcomes in a representative population given the treatment in practice for a clinically relevant duration. "Large, simple trials"with minimal data collection and "harnessing" data from electronic medical recordsoften can accomplish this goal (15).
Empirical experience has now provided cardiovascular practitioners with general principles to consider when designing trials designed to inform the cycle of quantitative evidence (Table 1) (16). For example, the modest nature of most treatment effects mandates large trials, so that effects can be detected or excluded with certainty. Large trials also can and should enroll a wide variety of patients, so that quantitative and qualitative interactions can be estimated for policy reasons. Large trials also maximize the possibility of detecting unanticipated effects of therapies, alone and combined with other technologies.
|
Clinicians feel increasingly pressed for time, and financial constraints leave little room for altruistic efforts to engage patients in discussions about trials. Questions about the professional responsibilities of physicians (duty to individual patients) versus trialists (duty to answer questions), and the constraints of institutional review boards and consent mechanisms, also make the conduct of research in clinical practice more daunting. The new Health Information Privacy, Portability, and Accountability Act, which places criminal penalties on the misuse of medical information, has exacerbated these concerns. Nevertheless, cardiovascular medicine has led the way by engaging many practices in addressing important questions that cannot be answered by a separate trials infrastructure (15). Highly organized cardiology practices have been the cornerstone of successful trials (20,21), and the federal government now reimburses physicians for the routine costs of trials in patients covered by Medicare (22).
Finally, in many cases randomized trials cannot be performed because: 1) they would be impractical, 2) they would be unethical, or 3) the follow-up needed would exceed societys "willingness to wait." However, standards for observational comparisons have not evolved to the same level as for clinical trials (23). Although newer statistical techniques allow better control of treatment selection in observational research, unmeasured sources of bias will always be a concern. Thus, observational studies will continue to support randomized studies: hypothesis generation, testing (if randomized trials are impossible), and confirming that the results of randomized trials can be generalized to other providers and patient populations.
Clinical practice guidelines: synthesis of evidence
A hierarchy of evidence forms the basis for formulation of CPGs. The highest level of evidence reflects large trials addressing the specific question of interest or several smaller trials with consistent results. The ACC/AHA guideline for unstable angina and nonST-segment elevation MI provides a framework for considering evidence (24). As shown in Table 2, evidence can be generated in two vectors, roughly representing the quantitative and qualitative perspectives. A level of evidence of "A" for a recommendation is derived from multiple, consistent randomized trials or a single large, definitive trial. A "C" level represents expert opinion and no definitive data, and a "B" level encompasses various intermediate-quality data. In the other vector, a class I recommendation reflects consensus that the practice should be done, whereas a class III recommendation reflects consensus that the practice should be avoided. A class IIa recommendation denotes a situation in which consensus does not exist but the practice is generally reasonable, and class IIb recommendation does not endorse the practice but does not definitively recommend against it.
|
In addition to committee composition, review of the guidelines before finalization is a major issue. Ideally, all of the affected constituencies should agree with the draft CPGs. When appropriate, joint ACC/AHA guidelines typically also are reviewed by the American Academy of Family Practitioners, the American College of PhysiciansAmerican Society of Internal Medicine, and other major professional organizations caring for cardiovascular patients. A recent experimental guideline for atrial fibrillation also has attempted to reach consensus between the American organizations and the European Society of Cardiology (ESC) (26).
Updating the guidelines is another significant component of the process. In many areas of medicine, trials are being performed at such a pace that major findings relevant to clinical practice are common. For example, the Global Use of Strategies To Open occluded arteries (GUSTO)-IV trial of acute coronary syndromes (27) was reported within days of publication of both the ACC/AHA guidelines (24) and ESC guidelines on unstable angina and nonST-segment MI (28). Within months of the publication of the ACC/AHA guidelines on heart failure (29), a randomized trial (9) showed a survival advantage for left ventricular assist devices, and initial data from another randomized study (7) appeared to expand the indications for implantable defibrillators. As other areas of clinical practice change, existing therapies must be reevaluated.
Most current CPGs do not emphasize costs, partly because information about the cost-effectiveness of therapies is often scant and based on models rather than empirical data. In theory, best practice is defined by the quality of evidence for the intervention rather than its price. The fact that cost does affect therapeutic choice has produced a crisis, however, as illustrated by the escalation of healthcare costs versus the increasing imperatives to implant defibrillators (7,8), coated stents (6), or left ventricular assist devices (9). These examples suggest that CPGs might need to be country-specific, depending on national resources. In the U.S., an incremental cost of $50,000 to $70,000 per year of life saved has become a de facto standard based on the national right to dialysis (30), but in many countries, even when a therapy clearly is beneficial, it simply is unaffordable given competing demands for financial resources.
Quality indicators
Once a CPG has been developed, its recommendations must be translated into a series of variables, or indicators, that reflect the quality of care (or lack thereof). In an ideal clinical world, for every clinical decision there would be an indicator based on a guideline based on evidence from randomized trials, such that a standard of care could be defined for each situation. Such data exist for few clinical decisions, however. Table 3 lists the class I, level A recommendations ("almost always do it") from the ACC/AHA guidelines for unstable angina/nonST-segment elevation MI (24) and heart failure (29). Table 4 lists the class III, level A recommendations ("never do it") from these same guidelines.
|
|
Several organizations are implementing quality indicators and performance measures for cardiovascular care. Many of the same issues that pertain to construction of CPGs also apply to development of the resulting quality indicators. Who should be on the committees and how are conflicts of interest managed? What level of evidence in a guideline should merit a quality indicator? How should those who devise guidelines react when a quality indicator is advocated that is inconsistent with existing guidelines? The ACC and AHA have a task force considering this very issue, and a report is expected very soon.
A particularly vexing problem for quality indicators emerges when attempting to define which patients qualify for a particular indicator. For example, when the Cooperative Cardiovascular Project investigators measured the use of fibrinolytic therapy in patients covered by Medicare, less than half of the patients with acute MI actually qualified for measurement after a long list of potential exclusions was applied (31). Care among such filtered subgroups may not reflect providers general practices.
Performance measures
If the quality indicator is the variable, the performance measure is the threshold. The concept is straightforward: in its simplest form, a quality indicator reflects either a class I, level A recommendation (use beta-blockers after MI) or a class III, level A recommendation (do not perform routine angioplasty of the infarct-related artery immediately after fibrinolysis). Each encounter with a patient who meets the circumstances of such recommendations provides evidence to assess the performance of providers or systems. The proportion of eligible patients with MI who receive beta-blockers, say, would be compared against some threshold level, or performance measure, in this case, perhaps 95%.
This raises the logical question of how thresholds are set. One attractive approach has been to develop "achievable benchmarks of care" (32). Instead of attempting to define purified populations for whom process indicators should approach 100%, one simply compares a providers performance to that seen among the top 10% of practitioners, hospitals, or practices. Thus, if leading centers can prescribe beta-blockers at discharge for 90% of their patients with MI, then this would be a reasonable and achievable performance goal for the rest of the nation. The Achievable Benchmark method provides one reasonable approach and avoids striving for unrealistic goals, which could provoke practitioners into inappropriate treatment for patients who do not meet criteria or lead them to become cynical about efforts to measure quality.
A related issue concerns the number of quality indicators. Specifically, as the number of quality indicators increases, the number of performance measures also will rise, creating a signal-to-noise issue when reviewing these results. Fortunately, process performance is likely to correlate, and centers that adhere closely to national guidelines on one measure might tend to do similarly well in other care areas. One also could develop overall "composite quality indicators" for given conditions, which combine and average provider performance on individual measures. Taking into account multiple measures per patient, composite quality indicators also increase the power for a given sample size and, thus, provide more stable estimates of performance.
Given the large number of decisions that providers and systems make that are not subject to advanced levels of evidence, it is tempting to define performance in terms of higher level, complex decision-making. Indeed, most would agree that healthcare systems must be able to provide integrated care for patients with multiple comorbidities requiring complicated procedures and regimens to be considered "high-quality" systems (i.e., excellent performance). The argument is equally strong, however, that practitioners and systems who do not adhere to relatively simple guidelines based on clear evidence cannot claim to have even basal levels of quality (i.e., poor performance).
In developing performance measures, it can be useful to focus on the specific environment (microenvironment) for healthcare delivery (31,33). The inpatient cardiovascular arena, for example, contains the various microenvironments of the emergency department; cardiac care unit; inpatient service; cardiac catheterization, interventional, and electrophysiology laboratories; surgical suite and postoperative intensive care unit; and noninvasive testing and imaging systems. The cardiovascular specialists and primary care practitioners microenvironments include the practice structure, office staff, nonphysician practitioners, and systems. The outpatient arena is even more vast, encompassing the patients home, workplace, various healthcare providers, social outlets, religious community, and insurance coverage. By measuring actual outcomes in populations, deviations from expected results should spur scientific insights, refined trial designs, and development of more appropriate quality indicators and performance measures for various settings.
Outcomes
From the perspective of the quality cycle, the ultimate goal is the best possible outcomes. Fortunately for cardiovascular practitioners, there is consensus about the outcome domains that are generally most important to the field, and these have been validated in many trials and observational studies. For most cardiovascular problems, survival, freedom from major cardiovascular events (stroke, MI, major arrhythmias, heart failure), and improved symptoms are the cornerstones of outcomes measurement. Much research also has focused on measurement of functional outcomes and quality of life in cardiovascular patients. Ideally, outcome measures would assess both the acute success of an episode of medical care and its long-term effects.
Outcomes measurement also faces particular challenges, however. These include how to adjust for patient risk (disease severity, comorbidity, educational, and financial status) when comparing outcomes among providers and the instability of outcome measures at the provider level. Because of these limitations, the quality-assessment field has moved from direct measurement of outcomes to measurement of performance in most situations. As mentioned, because performance measures are essentially surrogates, the quality cycle calls for studies of the broader measurement of outcome as a function of performance in populations of patients or practitioners, to validate that the performance measures are important and that greater adherence to them improves outcome.
Recommendations
Clinical research networks and practice databases provide a convenient mechanism to tie together the quality cycle (Fig. 1). After a concept has been developed and undergone basic testing, a network could conduct clinical trials and measure incorporation of the findings (in the form of recommendations) into practice. Multiple practice registries can provide feedback about performance for individual practices while also validating the relation between greater adherence to guidelines (in the form of performance measures) and improved patient outcomes in the registry as a whole. In the process, new concepts would be developed that can then be tested, beginning the cycle anew. Finally, education and feedback, though inadequate to improve processes and outcomes, remain a necessary foundation for all elements of the cycle.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
E. Will Intention and outcome in guideline-based nephrological practice: a suitable space for 'clinical technology' Nephrol. Dial. Transplant., November 1, 2007; 22(11): 3110 - 3114. [Full Text] [PDF] |
||||
![]() |
E. D. Peterson, E. M. Ohman, R. G. Brindis, D. J. Cohen, and D. J. Magid Development of Systems of Care for ST-Elevation Myocardial Infarction Patients: Evaluation and Outcomes Circulation, July 10, 2007; 116(2): e64 - e67. [Full Text] [PDF] |
||||
![]() |
R. A. Harrington Women, Acute Ischemic Heart Disease, and Antithrombotic Therapy: Challenges and Opportunities Circulation, June 5, 2007; 115(22): 2796 - 2798. [Full Text] [PDF] |
||||
![]() |
R. M. Califf The cycle of quality as a model for improving health outcomes in the treatment of hypertension Eur. Heart J. Suppl., May 1, 2007; 9(suppl_B): B8 - B12. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. C. Fonarow, C. W. Yancy, W. T. Abraham, and B. H. Greenberg Performance Measures and Outcomes for Patients Hospitalized With Heart Failure--Reply JAMA, April 11, 2007; 297(14): 1548 - 1549. [Full Text] [PDF] |
||||
![]() |
K. W. Mahaffey and R. A. Harrington Optimal Timing for Use of Glycoprotein IIb/IIIa Inhibitors in Acute Coronary Syndromes: Questions, Answers, and More Questions JAMA, February 14, 2007; 297(6): 636 - 639. [Full Text] [PDF] |
||||
![]() |
R. M. Califf, R. A. Harrington, L. K. Madre, E. D. Peterson, D. Roth, and K. A. Schulman Curbing The Cardiovascular Disease Epidemic: Aligning Industry, Government, Payers, And Academics Health Aff., January 1, 2007; 26(1): 62 - 74. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M Califf Clinical trials bureaucracy: unintended consequences of well-intentioned policy Clinical Trials, December 1, 2006; 3(6): 496 - 502. [Abstract] [PDF] |
||||
![]() |
R. M. Califf Fondaparinux in ST-Segment Elevation Myocardial Infarction: The Drug, the Strategy, the Environment, or All of the Above? JAMA, April 5, 2006; 295(13): 1579 - 1580. [Full Text] [PDF] |
||||
![]() |
R. G. Brindis and G. J. Dehmer Continuous Quality Improvement in the Cardiac Catheterization Laboratory: Are the Benefits Worth the Cost and Effort? Circulation, February 14, 2006; 113(6): 767 - 770. [Full Text] [PDF] |
||||
![]() |
H. V. Anderson, R. E. Shaw, R. G. Brindis, L. W. Klein, C. R. McKay, M. A. Kutcher, R. J. Krone, M. J. Wolk, S. C. Smith Jr, and W. S. Weintraub Relationship Between Procedure Indications and Outcomes of Percutaneous Coronary Interventions by American College of Cardiology/American Heart Association Task Force Guidelines Circulation, November 1, 2005; 112(18): 2786 - 2791. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. V. Anderson and R. G. Bach The Elderly Are Not So Old Anymore J. Am. Coll. Cardiol., October 18, 2005; 46(8): 1488 - 1489. [Full Text] [PDF] |
||||
![]() |
P Kaul, L K Newby, Y Fu, D B Mark, S G Goodman, G S Wagner, R A Harrington, C B Granger, F Van de Werf, E M Ohman, et al. Relation between baseline risk and treatment decisions in non-ST elevation acute coronary syndromes: an examination of international practice patterns Heart, July 1, 2005; 91(7): 876 - 881. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Califf Simple Principles of Clinical Trials Remain Powerful JAMA, January 26, 2005; 293(4): 489 - 491. [Full Text] [PDF] |
||||
![]() |
J. S. Rumsfeld and E. D. Peterson Care disparities: Moving from gray to black and white J. Am. Coll. Cardiol., January 4, 2005; 45(1): 79 - 81. [Full Text] [PDF] |
||||
![]() |
P. W. Armstrong, L. K. Newby, C. B. Granger, K. L. Lee, R. J. Simes, F. Van de Werf, H. D. White, R. M. Califf, and for the Virtual Coordinating Centre for Global Col Lessons Learned From a Clinical Trial Circulation, December 7, 2004; 110(23): 3610 - 3614. [Full Text] [PDF] |
||||
![]() |
R. M. Califf, T. Ryan, P. Douglas, and P. J. Goldschmidt-Clermont A time of accelerated change in academic cardiovascular medicine: Implications for academic divisions of cardiology and their training programs J. Am. Coll. Cardiol., November 16, 2004; 44(10): 1957 - 1965. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Wolk, E. Peterson, R. Brindis, and K. Eagle President's page: The appropriate cardiologist: Responsible stewardship in a golden era of cardiology J. Am. Coll. Cardiol., August 18, 2004; 44(4): 933 - 935. [Full Text] [PDF] |
||||
![]() |
L. H. Curtis, T. Ostbye, V. Sendersky, S. Hutchison, P. E. Dans, A. Wright, R. L. Woosley, and K. A. Schulman Inappropriate Prescribing for Elderly Americans in a Large Outpatient Population Arch Intern Med, August 9, 2004; 164(15): 1621 - 1625. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Kereiakes and J. T. Willerson Medical Technology Development and Approval: The Future Is Now Circulation, June 29, 2004; 109(25): 3078 - 3080. [Full Text] [PDF] |
||||
![]() |
G. J. Dehmer, J. W. Hirshfeld, W. J. Oetgen, K. Mitchell, A. Wells Simon, M. Elma, M. A. Kellett Jr, R. G. Brindis, on behalf of the American College of Cardiology Fo, CathKIT Task Force Members, et al. CathKIT: improving quality in the cardiac catheterization laboratory J. Am. Coll. Cardiol., March 3, 2004; 43(5): 893 - 899. [Full Text] [PDF] |
||||
![]() |
R. M Califf Comment Clinical Trials, February 1, 2004; 1(1): 115 - 115. [PDF] |
||||
![]() |
R. M. Califf Supplement on Acute Coronary Syndromes: Introduction Circulation, October 21, 2003; 108(90161): III-1 - 5. [Full Text] [PDF] |
||||
![]() |
R. M. Califf and L. H. Muhlbaier Health Insurance Portability and Accountability Act (HIPAA): Must There Be a Trade-Off Between Privacy and Quality of Health Care, or Can We Advance Both? Circulation, August 26, 2003; 108(8): 915 - 918. [Full Text] [PDF] |
||||
![]() |
R. J. Gibbons, S. C. Smith Jr, and E. Antman American College of Cardiology/American Heart Association Clinical Practice Guidelines: Part II: Evolutionary Changes in a Continuous Quality Improvement Project Circulation, June 24, 2003; 107(24): 3101 - 3107. [Full Text] [PDF] |
||||
![]() |
R. M. Califf and D. P. Faxon Need for Centers to Care for Patients With Acute Coronary Syndromes Circulation, March 25, 2003; 107(11): 1467 - 1470. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||