CLINICAL STUDY: BYPASS SURGERY
Challenges in comparing risk-adjusted bypass surgery mortality results
Results from the Cooperative Cardiovascular Project
Eric D. Peterson, MD, MPH, FACC*,
Elizabeth R. DeLong, PhD*,
Lawrence H. Muhlbaier, PhD*,
Allison B. Rosen, MD, MPH*,
Hope E. Buell, MS*,
Catarina I. Kiefe, MD, PhD and
Timothy F. Kresowik, MD, MPH
* The Duke Outcomes Research and Assessment Group, Duke University Medical Center, Durham, North Carolina, USA
The Alabama Quality Assurance Foundation, and the University of Alabama at Birmingham Center for Outcomes and Effectiveness Research and Education, Birmingham, Alabama, USA
The Iowa Foundation for Medical Care, West Des Moines, Iowa, USA
Manuscript received July 26, 1999;
revised manuscript received June 1, 2000,
accepted July 14, 2000.
Reprint requests and correspondence: Dr. Eric D. Peterson, Box 3236, Duke University Medical Center, Durham, North Carolina 27710
 |
Abstract
|
|---|
OBJECTIVES
We sought to evaluate the predictive accuracy of four bypass surgery mortality clinical risk models and to examine the extent to which hospitals risk-adjusted surgical outcomes vary depending on which risk-adjustment method is applied.
BACKGROUND
Cardiovascular "report cards" often compare risk-adjusted surgical outcomes; however, it is unclear to what extent the risk-adjustment process itself may affect these metrics.
METHODS
As part of the Cooperative Cardiovascular Projects Pilot Revascularization Study, we compared the predictive accuracy of four bypass clinical risk models among 3,654 Medicare patients undergoing surgery at 28 hospitals in Alabama and Iowa. We also compared the agreement in hospital-level risk-adjusted bypass outcome performance ratings depending on which of the four risk models was applied.
RESULTS
Although the four risk models had similar discriminatory abilities (C-index, 0.71 to 0.74), certain models tended to overpredict mortality in higher-risk patients. There was high correlation between a hospitals risk-adjusted mortality rates regardless of which of the four models was used (correlation between risk-adjusted rating, 0.93 to 0.97). In contrast, there was limited agreement in which hospitals were identified as "performance outliers" depending on which risk-adjustment model was used and how outlier status was defined.
CONCLUSIONS
A hospitals risk-adjusted bypass surgery mortality rating, relative to its peers, was consistent regardless of the risk-adjustment model applied, supporting their use as a means of provider performance feedback. Designation of performance outliers, however, can vary significantly depending on the benchmark and methods used for this determination.
|
Abbreviations and Acronyms
| | CABG | = coronary artery bypass surgery | | CCP | = Cooperative Cardiovascular Project | | O/E | = ratio of observed mortality to expected mortality | | RS | = risk score |
|
As early as the nineteenth century, Florence Nightingale recognized the value of comparing hospital mortality rates as a means of assessing quality of care (1). Since then, others have reinforced the importance of providing caregivers with outcomes feedback as a necessary step toward continual quality improvement (26). Although comparing patient outcomes is important, it is clear that these results need to be adjusted for potential differences in type, or "case-mix" of patients cared for by various caregivers. To allow for such comparisons on a leveled playing field, researchers use a statistical approach known as risk-adjustment (7,8). One common risk-adjustment mechanism uses a statistical model that adjusts for individual patient risk factors while predicting the event of interest. With such a "risk-prediction" model, one can calculate a providers expected clinical event rate (based on their patients summated estimated risk) and compare this expected rate with observed results.
Many of the prototypic provider-level comparisons of risk-adjusted outcomes have examined mortality rates following coronary artery bypass surgery. New York State and Pennsylvania routinely compare and publish hospital- and surgeon-specific risk-adjusted bypass surgery mortality results as a means of increasing consumer awareness (913). Other voluntary groups of health-care providers internally share bypass surgery mortality data as a means of "benchmarking" outcomes performance results across centers and promoting quality improvement efforts (1420).
Commonly, bypass surgery outcomes performance measures are "risk-adjusted" using one of several published surgical mortality models (9,2123). These surgical models were developed in separate patient populations with significantly different event rates (Table 1). In part, because of these differences, individual risk factors and the "weighting" of these factors vary among models (Appendix 1). To date, few have attempted to assess and compare the predictive accuracy of these models when applied outside of the database in which they were developed (2428). Furthermore, the impact of different risk-adjustment models on a providers bypass surgery performance rating has not been assessed. If a providers performance rating shifted from superior to inferior depending on which risk-adjustment model was used, then the face validity of the risk-adjustment process would be in question. Lacking this information, clinicians have generally been skeptical of the risk-adjusted outcomes profiling efforts (2931).
We evaluated the predictive accuracy of four commonly used bypass surgery-specific risk-adjustment tools in a large, community-based elderly population. We also examined the extent to which a hospital-level risk-adjusted surgical outcome rating varied depending on which risk-adjustment model was applied. We then repeated the process above after the risk models were adjusted (recalibrated) to reflect the mortality rates in our elderly study population. Finally, we assessed whether "outlier hospitals" (providers identified as having significantly superior or inferior outcomes) changed depending on which risk model was used and how outlier performance was defined.
 |
Methods
|
|---|
Bypass surgery risk models.
We considered four nonproprietary models that estimated short-term mortality risk following bypass surgery or open heart surgery. These four models will be referred to in this article by their first authors last name, including the Parsonnet, OConnor, Higgins, and Hannan models (9,2123). The Parsonnet model was developed on 3,500 patients undergoing coronary bypass and/or valve surgery in New Jersey between 1982 and 1987 (21). The OConnor model was developed on data from 3,055 patients receiving isolated bypass surgery procedures at five northern New England hospitals between 1987 and 1989 (22). The Higgins model developed a risk prediction model using data from 5,051 patients undergoing bypass surgery at the Cleveland Clinic between 1986 and 1988 (23). Hannan and colleagues developed a bypass surgery risk model for New York State using a population of 57,187 patients operated on between 1989 and 1992 (9). The clinical risk factors included in each model are displayed in Appendix 1.
Patient population.
The Cooperative Cardiovascular Project (CCP) Pilot Revascularization Study was a joint quality improvement effort between the Health Care Financing Administration, state peer review organizations, and several national medical societies (including the American Medical Association, American College of Physicians, American College of Cardiology and the American Academy of Family Physicians) (32,33). The study population included all patients aged 65 years or older covered by Medicare who underwent isolated bypass surgery procedures in Alabama and Iowa between June 1, 1992, and February 28, 1993. To avoid double counting patients, those who underwent more than one bypass surgery procedure during the study period were included only once as defined by their initial procedure. We also excluded those who received a procedure at an institution that performed in total less than 50 Medicare surgical procedures during the study period. Each of the models was developed using logistic regression, in which the risk of mortality for a patient with a vector X of risk factors is given as
 | Here ß is a vector of coefficients associated with the risk factors and the linear combination ß is called the risk score (RS).
Data collection and mortality end points.
Patients were identified using Medicare claims data (ICD-CM Codes 36.1019), and the medical records of eligible patients were reviewed by trained nurse clinicians. Detailed clinical and demographic data were collected via chart abstraction using standardized definitions. This abstraction tool was designed prospectively to contain the main data elements used in published surgical mortality prediction models. The CCP data definitions were matched to the extent possible to those used in the prior model populations. In-hospital mortality rate was chosen as the end point of interest as most community hospitals lack the ability to track postdischarge events. Of note, while the Parsonnet and Higgins models were initially developed to predict the risk of mortality within 30 days of surgery, their predictive accuracy was higher for predicting in-hospital mortality events when tested in CCP data.
Analysis: model validation.
We used two standard measures of a models performance: discrimination and calibration. Discrimination is the ability of a mortality model to correctly distinguish those patients who will die from those who will survive. An overall measure of model discrimination can be summarized by its area under a receiver operator characteristic (ROC) curve or C-index (34,35). A models C-index can range from 0.5 (no predictive ability) to 1.0 (perfect predictive accuracy). A second measure, calibration, examines how closely the models predicted mortality rates match observed mortality rates for various risk groups of patients (36). To assess this, patients were rank-ordered by their predicted mortality. Patients were then grouped into five similarly sized risk groups and the average expected mortality rate for each group was compared with that actually observed.
If a given model is not well calibrated (i.e., it significantly underpredicts or overpredicts mortality) when applied in a new population, one can consider recalibrating the model. There are several mechanisms for implementing such prevalence corrections (37). We used logistic regression to fit an intercept term ( ) and a multiplier term (ß*) to the original risk score (RS). The statistical formulation for this secondary logistic regression model can be summarized as follows: In [p*/1 p*)] = * + ß* RS, where p* is the revised predicted probability for mortality, RS is the original risk score and * and ß* are estimated when the model is applied in the current population.
Hospital-level risk-adjusted outcomes measures.
We calculated an "expected mortality rate" for each of the 28 hospitals by aggregating their patients individual estimated mortality risk and dividing by the total number of patients treated at that hospital. We then calculated a hospitals ratio of observed mortality rate to its expected (O/E). Hospitals with O/E ratios <1 were institutions with lower (better) observed bypass surgery mortality than predicted. Conversely, hospitals with O/E ratios >1 reflected higher mortality than predicted. We repeated this process using each of the four risk adjustment models, producing four O/E ratios for each hospital. We also repeated this process after each of the original risk models was revised and recalibrated in the CCP patient population. Confidence intervals surrounding hospitals O/E ratios were computed based on the normal approximation to the binomial distribution.
We also created risk-adjusted mortality rates from each model by multiplying the O/E ratios by the overall average CCP mortality rate (9). The correlation between each of these four risk-adjusted hospital mortality rates and the hospitals unadjusted mortality rate was assessed graphically and using Spearman correlation coefficients. A hospital was considered to have outlier performance if the 95% confidence interval around its O/E ratio excluded 1.0. As an alternative method for identifying outliers, we estimated the individual effect of hospital performance on outcome using a random effects regression model (37). With this, a "shrunken estimate" of a providers influence on outcome is determined relative to its peers and after adjusting for underlying risk (38).
 |
Results
|
|---|
The CCP revascularization database included 4,152 Medicare patients undergoing isolated bypass surgery at 32 hospitals in Alabama and Iowa between June 1, 1992 and February 28, 1993. From this cohort, we excluded 390 patients who were <65 years old. We also excluded 108 patients who received bypass surgery at any of the four institutions that performed fewer than 50 surgical procedures on Medicare patients during the study period. Thus, the final CCP analysis cohort consisted of 3,654 bypass surgery patients from 28 separate institutions. The mean and median number of Medicare bypass patients per hospital during this nine month period was 132 and 124, respectively. Given that patients aged 65 or older make up approximately half of an average hospitals case volume, the estimated mean yearly surgical volumes for all-aged patients at these hospitals would be 352 cases.
Baseline characteristics.
Baseline clinical characteristics for CCP patients were compared with those from the development populations for the Parsonnet, OConnor, Higgins and Hannan risk-adjustment models (Table 1). The CCP cohort contained older patients, more women, a higher percentage of those undergoing prior revascularization procedures and procedures under emergent conditions. Rates of most comorbid illnesses were similar across the four cohorts, as was the severity of underlying coronary stenoses, frequency of significant left main stenosis and degree of left ventricular dysfunction. The overall observed surgical mortality rates in these cohorts varied from 2.5% in the Higgins et al. (23) study to 8.9% in the Parsonnet et al. (21) study.
Model performance.
The discrimination abilities for each of the external bypass surgery models is displayed in Figure 1. The area under the ROC curve or C-index for these models was 0.72 for the Parsonnet model, 0.71 for Higgins, 0.72 for OConnor and 0.74 for Hannan. For comparison, the C-indexes for these models in their original populations were 0.74 for Higgins, 0.74 for the OConnor model, and 0.79 for Hannan (note: Parsonnets C-index was not published).

View larger version (17K):
[in this window]
[in a new window]
|
Figure 1 This figure the ROC curves for the four bypass surgery risk models. The C-index is equivalent to the area under each ROC curve.
|
|
Figure 2A demonstrates how well calibrated each original model was when applied in the CCP patient population. This figure displays observed versus expected in-hospital bypass surgery mortality results for each of the models by quintiles of patient risk. The diagonal line in this figure represents perfect agreement. The predicted mortality rates based on the Hannan and OConnor models were quite close to those actually observed for nearly all risk groups. For example, using the Hannan model, the lowest and highest risk groups had predicted versus observed mortality rates of (1.4% vs. 1.2%) and (13% vs. 14%), respectively. In contrast, the Parsonnet and Higgins models consistently overpredicted mortality rates, particularly in higher risk patients. Among the highest risk group, the Parsonnet model predicted mortality rate was nearly twice that actually observed (23% vs. 13%).

View larger version (14K):
[in this window]
[in a new window]
|
Figure 2 A, The observed to expected mortality rates for each quintile of patient risk. Each risk quintile contains approximately 750 patients. The diagonal line represents perfect agreement between observed and expected mortality estimates. B, The same information after the models have been internally recalibrated in the CCP database.
|
|
Figure 2B displays these same risk estimates after the models were individually internally recalibrated within the CCP population (see Methods section). After recalibration, each of the models was better able to accurately estimate surgical mortality rates across a wide range of patient risk categories.
Risk-adjusted outcomes.
Figure 3A displays the rank ordering of the 28 hospitals by their unadjusted bypass mortality rates (dash) versus their risk-adjusted mortality rates based on the Parsonnet (square) and Hannan risk models (circle). As noted, when certain risk models (e.g., Parsonnet) are used, the majority of hospitals "risk-adjusted" mortality rates appeared lower than those actually observed. However, the hospitals relative performance (compared with peers) were generally consistent regardless of which risk-adjustment model was used. This consistency between relative risk-adjusted hospital performance results becomes even more marked after the models are internally recalibrated (Figure 3B).

View larger version (16K):
[in this window]
[in a new window]
|
Figure 3 A, Each hospitals unadjusted mortality rates and their risk-adjusted mortality using the Parsonnet and Hannan risk models. Note: the 28 Hospitals are ordered on the x-axis by the unadjusted mortality rate. B, This same information after the Parsonnet and Hannan models have been internally recalibrated in the CCP database.
|
|
Table 2 displays the formal association between the various risk-adjusted mortality rates. Hospital risk-adjusted mortality rates using any of the four models were highly correlated, with Spearman correlation coefficients ranging from 0.96 for Higgins-Hannan comparison to 0.99 for Parsonnet-OConnor comparison (Table 2). Additionally, the correlation between any two risk-adjusted mortality rates was consistently greater than the correlation between these risk-adjusted mortality rates and unadjusted mortality outcomes.
Hospital outlier status.
Besides comparing relative performance, hospital-specific risk-adjusted outcomes are often used to identify "superior or inferior performers." Outlier performance, however, can be assessed by different metrics. Table 3 displays those hospitals for which the observed bypass mortality rates were significantly higher or lower than those predicted by each of the risk models (i.e., 95% confidence intervals for an O/E ratio excluding 1.0). Using this performance measure, the original Parsonnet model identified 10 significantly superior hospitals, but no hospitals as inferior performers. In contrast, the original Hannan model identified only one significantly superior hospital and four inferior hospitals. Thus, complete agreement on outlier status using this method occurred in only one of the 28 hospitals (ID No. 1), with this identifying a superior performer. Table 4 displays similar information, but now based on their risk-adjusted mortality using the internally recalibrated risk models. After recalibration of the models, agreement in outlier status was generally consistent.
As a final means, Table 5 displays which hospitals bypass performance was deemed significantly better or worse than its peers when assessed by a random-effects logistic model. This more conservative statistical method identified few high or low outliers regardless of whether or which risk-adjustment was used. However, using this method, there was complete agreement that one hospital (ID No. 28) had significantly worse bypass outcomes by all five methods.
 |
Discussion
|
|---|
The era of "scorecard medicine," in which provider-specific procedure outcomes results are openly compared, is here (39). State peer review boards, insurers, corporate employers and patients are all requesting this information as a means of assessing and comparing health-care quality (40). Whether clinicians agree or not with the basic tenets of provider comparisons, nearly all agree that, if outcomes are to be compared, it should be done only after appropriately risk-adjusting the results (41,42). We studied four risk models that were specifically developed to predict procedural mortality following bypass surgery. We found that these models retained most of their discriminatory ability when applied in a community-based, elderly patient population. Most importantly, we found that a hospitals risk-adjusted outcomes relative to its peers was remarkably consistent, regardless of the risk model used. However, we found that model calibration varied and may markedly affect which hospitals were deemed superior or inferior performers.
While many bypass surgery risk models have been published, their comparative predictive accuracy outside of the databases in which they were developed has been rare (2428). Iezzoni and colleagues (28) tested the predictive accuracy of five generic clinical and administrative severity of illness measures when applied in a population of bypass surgery patients. They found that the discrimination abilities of these generic risk tools were similar (C-index for the two clinical models 0.72 to 0.73 and 0.77 to 0.83 for the three administrative systems). The paradoxical better performance of the claims-based models over clinical-based ones was accounted for by the fact that administrative models often included postoperative complication data (e.g., cardiac arrest or heart failure) as preoperative risk predictors. Similar to our results, the authors also found substantial agreement in relative risk-adjusted hospital mortality rates, regardless of which risk model was applied (27).
In another study, Orr and colleagues (26) tested the predictive accuracy of four bypass-specific clinical risk models among 868 bypass patients in a single institution. Consistent with our findings, these authors also found that the discrimination abilities of the risk models in their hospital ranged from C-index 0.70 to 0.74. They also found that the Parsonnet model significantly overpredicted mortality while the Hannan model significantly underpredicted mortality rates. As this was a single-institution study, the authors were unable to examine the impact of the risk models on comparative provider performance.
Our study expands on this work by examining the impact of bypass surgery-specific risk models in a large multi-institutional study. Additionally, our elderly patient population provided a more stringent test of the models predictive accuracy as their risk profiles differed significantly from the patient samples used to originally create the models (Table 1). Despite this, we found that the discrimination ability of all four models was generally well preserved when applied in our higher risk elderly patients. Model calibration, however, was an issue for the Parsonnet and Higgins models, in which expected mortality differed significantly from observed, particularly in the high risk patient subgroups (Fig. 2). In contrast to the work of Orr et al. (26), in our data, expected mortality rates generated by the Hannan model tended to match closely those observed in all patient risk subgroups.
Impact of risk adjustment method on hospital performance measures.
There are numerous reasons for comparing risk-adjusted hospital outcomes data. First, a physician, payor or patient may want a general sense of how their hospitals bypass surgery outcomes compare with community peers. In this context, our data suggest that the application of various risk-adjustment models will result in similar hospital-level relative performance measures. In other words, if a hospital was generally a good performer (relative to its peers) using one risk-adjustment model, then it was likely to be a good performer no matter which risk-adjustment model was used. These conclusions are also consistent with the findings from other similar studies (27,37) and should decrease clinicians concerns that their performance outcomes are somehow an artifact of the specific risk-adjustment method applied.
It should be emphasized, however, that these results do not imply that risk-adjustment is unnecessary for outcomes comparisons. In fact, we found a much stronger correlation between any two risk-adjusted outcomes methods than any risk-adjustment method and unadjusted data, indicating that some form of risk-adjustment is required for appropriate comparison. Additionally, if comparisons are to be made among hospitals, the same risk-adjustment model should be applied to all centers. For example, it would be inappropriate to compare one hospitals risk-adjusted mortality rate based on Parsonnet with another centers result based on Hannan.
Beyond relative performance evaluation, risk-adjusted outcomes data are often used to identify the "good and bad apples" (e.g., those institutions with exceptional performance) (8). Often a hospitals risk-adjusted performance may be compared against some standard or benchmark (43,44). One of the most common methods employs a strictly external risk-adjustment model to compare O/E mortality ratios. If this method is used, the hospitals are actually being compared with an external performance benchmark (i.e., that observed in hospitals in the original risk model study population). For example, the Parsonnet model was developed on all patients undergoing open heart surgery (including higher risk valve cases) operated on in the early 1980s. As such, these patients had high bypass mortality rates relative to contemporary, bypass-only, outcomes. When applied in our population, the Parsonnet model significantly overpredicted mortality risks, making a third of CCP hospitals appear to be superior hospitals and none being inferior (Table 3). If, instead, the Hannan model, based on more contemporary bypass-only cases from New York (a state with the lowest US bypass mortality) (45), were applied as the external benchmark, only one hospital would have been identified as a superior performer while four would be significantly inferior. While the selection of an external benchmark is arbitrary, it seems reasonable to select a risk-adjustment model that is both clinically meaningful and with overall event rates similar to those found in the study population.
Alternatively, published models can be refit or recalibrated to match event rates in a new patient group. While various methods of model recalibration have been proposed, the process, however, is quite analogous to the developing of a "new" prediction model. As such, it requires a sufficiently large study population to assure stable model performance, as well as appropriate analytical oversight. When recalibration is achieved, outlier status will be based on internal (or peer), as opposed to external performance standards. Our study demonstrated that hospital performance metrics (including designation of outlier status) were quite consistent after recalibration regardless of which of the four published risk models was used as the start-point for this process.
A final alternate method of determining outlier status is also based on internal performance standards (Table 5). In contrast to the prior method, this technique does not require model recalibration to achieve this goal. Specifically, this random-effects statistical model will determine if a hospitals surgical mortality rate differs significantly from that seen in the other comparison hospitals, after adjusting for baseline risk (based on one of the published risk models). Although this technique has some advantages (37), it remains possible that an average or better hospital could be singled out as a "poor performer" if all its comparison centers were outstanding. Additionally, this method is conservative and is less likely to identify outliers at low volume centers. To gain the clearest idea of ones surgical outcomes, it is ideal to benchmark ones results both among ones regional and national peers.
Study limitations.
The performance of any surgical risk model depends in part on whether all of the variables used in the original model are collected, the degree to which variable definitions are congruent and how accurately these variables are collected. As noted, the CCP data definitions were prospectively designed to be consistent with variables needed, but slight variation between the definitions used by CCP and those used in the original patient population was unavoidable. Our study population was also limited to Medicare patients. The performance of these models may be different in all aged bypass patients. Additionally, the duration of data collection and number of cases studied per hospital was somewhat limited (average 132), which limited the power of our study to identify outlier performers.
 |
Conclusions
|
|---|
Comparing hospital-specific outcome data will remain a challenging exercise. This information has the potential to give both patients and clinicians important feedback concerning a centers quality of care. However, these data can also be confounded if the results do not take into account the surgical risks of the patients treated. Using bypass surgery as a test case, we found that published surgery risk-adjustment models varied in their ability to accurately predict mortality when applied in a community-based elderly population. Despite differences in model calibration, a hospitals risk-adjusted surgical outcomes results, relative to its peers, tended to be quite consistent regardless of which risk-adjustment model was applied. The identification of outliers (with superior or inferior surgical results) varied, however, depending on which performance benchmark was used.
These data support the concept of risk-adjusting outcomes comparisons, but re-emphasize the importance of the risk-adjustment process. To be meaningful, consumers of these new "risk-adjusted outcomes report cards" must understand clearly how their data were analyzed and to what benchmark their results were compared.
 |
Appendix 1
|
|---|
Comparison of covariates in risk-adjustment algorithm.
Risk Factor
|
Parsonnet Algorithm
|
Higgins Algorithm
|
OConnor Algorithm
|
Hannan Algorithm
|
|
| Age |
x |
x |
x |
x |
| Gender |
x |
|
x |
x |
| Smoking |
x |
|
|
|
| Hypertension |
x |
|
|
|
| Diabetes mellitus |
x |
|
|
x |
| Vascular disease |
|
x |
|
|
| Chronic pulmonary disease |
|
x |
|
x |
| Anemia |
|
x |
|
|
| Renal insufficiency |
|
x |
|
|
| Dialysis dependence |
x |
|
|
x |
| Obesity* |
x |
|
x |
x |
| Charlson comorbidity score |
|
|
x |
|
| Congestive heart failure |
|
|
|
x |
| Unstable angina |
|
|
|
x |
| Recent myocardial infarction |
|
|
|
x |
| LV ejection fraction |
x |
x |
x |
x |
| LV end diastolic pressure |
|
|
x |
|
| Left main stenoses |
|
|
|
x |
| Mitral valve disease |
x |
x |
|
|
| Aortic valve disease |
x |
|
|
|
| LV aneurysm |
x |
|
|
|
| Prior bypass surgery |
x |
x |
x |
x |
| Priority at surgery |
|
x |
x |
|
| Preoperative intra-aortic balloon pump |
x |
|
|
x |
Catastrophic states
|
x
|
|
|
x
|
|
LV = left ventricular.
* OConnor model uses the continuous variable body surface area (BSA).
 |
Footnotes
|
|---|
Supported in part by grant HS 06503-03 Supplement 2 from the Health Care Financing Administration through the Agency for Health Care Policy and Research; and R01 HS09940-01A1 from the Agency for Health Care Policy and Research.
 |
References
|
|---|
1. Nightingale F. Notes on Hospitals. 3rd ed. London: 1863.
2. Codman EA. A Study in Hospital Efficiency as Demonstrated by the Case Report of the First Five Years of a Private Hospital. Boston: Thomas Todd Company Printers; 1917.
3. Donabedian A. The Methods and Findings of Quality Assessment and Monitoring: An Illustrated Analysis. Ann Arbor, MI: Health Administration Press; 1985.
4. Donabedian A. The end results of health care: Ernest Codmans contribution to quality assessment and beyond. Milbank Q. 1989;67:233256[Medline]
5. Ellwood P. Shattuck lectureoutcomes management: a technology of patient experience. N Engl J Med. 1988;318:15491556[Medline]
6. Relman AS. Assessment and accountability: the third revolution in medical care. N Engl J Med. 1988;318:12201222
7. Iezzoni LI. Risk and outcomes. Iezzoni LI. Risk Adjustment for Measuring Healthcare Outcomes. Chicago: Health Administration Press; 1997. p. 141
8. Iezzoni LI. The risks of risk adjustment. JAMA. 1997;278:16001607[Abstract/Free Full Text]
9. Hannan EL, Kilburn H, Racz M, Shields E, Chassin MR. Improving the outcomes of coronary artery bypass surgery in New York State. JAMA. 1994;271:761766[Abstract/Free Full Text]
10. Hannan EL, Kilburn H Jr, ODonnell JF. Adult open heart surgery in New York State: an analysis of risk factors and hospital mortality rates. JAMA. 1990;264:27682774[Abstract/Free Full Text]
11. Chassin MR, Hannan EL, BeBuono BA. Benefits and hazards of reporting medical outcomes publicly. N Engl J Med. 1996;334:394398[Free Full Text]
12. Localio AR, Hamory BH, Fisher AC, TenHave TR. The public release of hospital and physician mortality data in Pennsylvania: a case study. Med Care. 1997;35:272286[CrossRef][Medline]
13. Bentley JM, Nash DB. How Pennsylvania hospitals have responded to publicly released reports on coronary artery bypass graft surgery. Joint Commission J Qual Improvement. 1998;24:4049
14. Grover FL, Hammermeister KE, Burchfiel C. Initial report of the Veterans Administration preoperative risk assessment study for cardiac surgery. Ann Thorac Surg. 1990;50:1226[Abstract]
15. Marshall G, Shroyer LW, Grover FL, Hammermeister KE. Time series monitors of outcomes: a new dimension for measuring quality of care. Med Care. 1998;36:348356[CrossRef][Medline]
16. OConnor GT, Plume SK, Olmstead EM, et al. A regional intervention to improve the hospital mortality associated with coronary artery bypass graft surgery. JAMA. 1996;275:841846[Abstract/Free Full Text]
17. Steering Committee Provincial Adult Cardiac Care Network OntarioTu JV, Naylor CD. Coronary artery bypass mortality rates in Ontario: a Canadian approach to quality assurance in cardiac surgery. Circulation. 1996;94:24292433[Abstract/Free Full Text]
18. Shroyer LW, Edwards FH, Grover FL. Updates to the data quality review program: the Society of Thoracic Surgeons adult cardiac national database. Ann Thorac Surg. 1998;65:14941497[Abstract/Free Full Text]
19. Leape LL, Hilborne LH, Schwartz JS, et al. The appropriateness of coronary artery bypass graft surgery in academic medical centers. Ann Intern Med. 1996;125:818[Abstract/Free Full Text]
20. Holman WL, Athanasuleas CL, Allman RM, Sherrill RG, for the Alabama Quality Assurance Foundation CABG Cooperative Project. Alabama CABG Cooperative Project: baseline data. Ann Thorac Surg. In Press.
21. Parsonnet V, Dean D, Bernstein AD. A method of uniform stratification of risk for evaluating the results of surgery in acquired adult heart disease. Circulation. 1989;79(Suppl I):I3I12
22. OConnor GT, Plume SK, Olmstead EM, et al. Multivariate prediction of in-hospital mortality associated with coronary artery bypass graft surgery. Circulation. 1992;85:21102118[Abstract/Free Full Text]
23. Higgins TL, Estafanous FG, Loop FD, Beck GJ, Blum JM, Paranandi L. Stratification of morbidity and mortality outcome by preoperative risk factors in coronary artery bypass patients. JAMA. 1992;267:23442348[Abstract/Free Full Text]
24. Nashef SAM, Carey F, Silcock MM, Oomen PK, Levy RD, Jones MT. Risk stratification for open heart surgery: trial of the Parsonnet system in a British hospital. BMJ. 1992;305:10661067[Free Full Text]
25. Junod FL, Harlan BJ, Payne J, et al. Preoperative risk assessment in cardiac surgery: comparison of predicted and observed results. Ann Thorac Surg. 1987;43:5964[Abstract]
26. Orr RK, Maini BS, Sottile FD, Dumas EM, OMara P. A comparison of four severity-adjusted models to predict mortality after coronary artery bypass graft surgery. Arch Surg. 1995;130:301306[Abstract/Free Full Text]
27. Landon B, Iezzoni LI, Ash AS, et al. Judging hospitals by severity-adjusted mortality rates: the case of CABG surgery. Inquiry. 1996;33:155166[Medline]
28. Iezzoni LI, Ash AS, Schwartz M, Landon B, Mackiernan YD. Predicting in-hospital deaths from coronary artery bypass graft surgery: do different severity measures give different predictions? Med Care. 1998;36:2839[CrossRef][Medline]
29. Kassirer JP. The use and abuse of practice profiles. N Engl J Med. 1994;330:634636[Free Full Text]
30. Schneider EC, Epstein AM. Influence of cardiac-surgery performance reports on referral practices and access to care. N Engl J Med. 1996;335:251256[Abstract/Free Full Text]
31. Green J, Wintfeld N. Report cards on cardiac surgeons: assessing New York States approach. N Engl J Med. 1995;332:12291232[Free Full Text]
32. Jencks SF, Wilensky GR. The health care quality improvement initiative: a new approach to quality assurance in medicare. JAMA. 1992;268:900903[Abstract/Free Full Text]
33. Vogel RA. HCFAs cooperative cardiovascular project: a nationwide quality assessment of acute myocardial infarction. Clin Cardiol. 1994;17:354356[Medline]
34. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1983;143:2936
35. Harrell FE Jr, Lee KL, Califf RM, Pryor DB, Rosati RA. Regression modelling strategies for improved prognostic prediction. Stat Med. 1984;3:142152
36. Lemeshow S, Hosmer DW Jr. A review of goodness of fit statistics for use in the development of logistic regression models. Am J Epidemiol. 1982;115:92106[Abstract/Free Full Text]
37. DeLong ER, Peterson ED, DeLong DM, Muhlbaier LH, Hackett S, Mark DB. Comparing risk-adjustment methods for provider profiling. Stat Med. 1997;16:26452664[CrossRef][Medline]
38. Efron B, Morris C. Steins paradox in statistics. Sci Am. 1977;236:119127
39. Topol EJ, Califf RM. Scorecard cardiovascular medicine: its impact and future directions. Ann Intern Med. 1994;120:6570[Abstract/Free Full Text]
40. Cooley DA. Building shelters: safeguards in public disclosure of outcomes data. Circulation. 1996;93:13[Free Full Text]
41. Califf RM, Jollis JG, Peterson ED. Operator specific outcomes: a call for professional responsibility. Circulation. 1996;93:403406[Free Full Text]
42. Salem-Schatz S, Moore G, Rucker M, Pearson SD. The case for case-mix adjustment in practice profiling: when good apples look bad. JAMA. 1994;272:871874[Abstract/Free Full Text]
43. Lorence D. Benchmarking quality under US health care reform: the next generation. Qual Prog. 1994;27:103107
44. Kiefe C, Wooley TW, Allison JJ, Box JB, Craig AS. Determining benchmarks: a data-driven search for the best achievable performance. Clin Performance Qual Health Care. 1994;2:190194
45. Peterson ED, DeLong ER, Jollis JG, Muhlbaier LH, Mark DB. The effects of New Yorks bypass surgery provider profiling on access to care and patient outcomes in the elderly. J Am Coll Cardiol. 1998;32:993999[Abstract/Free Full Text]
This article has been cited by other articles:

|
 |

|
 |
 
L. W. Klein, P. Kolm, X. Xu, R. J. Krone, H. V. Anderson, J. S. Rumsfeld, R. G. Brindis, and W. S. Weintraub
A Longitudinal Assessment of Coronary Interventional Program Quality: A Report From the American College of Cardiology-National Cardiovascular Data Registry
J. Am. Coll. Cardiol. Intv.,
February 1, 2009;
2(2):
136 - 143.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Schneeweiss, J. D. Seeger, J. Landon, and A. M. Walker
Aprotinin during Coronary-Artery Bypass Grafting and Risk of Death
N. Engl. J. Med.,
February 21, 2008;
358(8):
771 - 783.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. D'Alessandro, P. Leprince, J. L. Golmard, A. Ouattara, S. Aubert, A. Pavie, I. Gandjbakhch, and N. Bonnet
Strict glycemic control reduces EuroSCORE expected mortality in diabetic patients undergoing myocardial revascularization
J. Thorac. Cardiovasc. Surg.,
July 1, 2007;
134(1):
29 - 37.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Jin and G. L. Grunkemeier
Does the logistic EuroSCORE offer an advantage over the additive model?
Interactive CardioVascular and Thoracic Surgery,
February 1, 2006;
5(1):
15 - 17.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. M. Shahian, D. F. Torchiana, R. J. Shemin, J. D. Rawn, and S.-L. T. Normand
Massachusetts Cardiac Surgery Report Card: Implications of Statistical Methodology
Ann. Thorac. Surg.,
December 1, 2005;
80(6):
2106 - 2113.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. L. Grunkemeier and A. P. Furnary
Mandatory Database Participation: Risky Business?
Ann. Thorac. Surg.,
September 1, 2005;
80(3):
799 - 801.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Jin, G. L. Grunkemeier, A. Starr, and Providence Health System Cardiovascular Study Grou
Validation and Refinement of Mortality Risk Models for Heart Valve Surgery
Ann. Thorac. Surg.,
August 1, 2005;
80(2):
471 - 479.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
I-C. Huang, F. Dominici, C. Frangakis, G. B. Diette, C. L. Damberg, and A. W. Wu
Is Risk-Adjustor Selection More Important Than Statistical Approach for Provider Profiling? Asthma as an Example
Med Decis Making,
January 1, 2005;
25(1):
20 - 34.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
B. M. Rothenberg, T. Pearson, J. Zwanziger, and D. Mukamel
Explaining disparities in access to high-quality cardiac surgeons
Ann. Thorac. Surg.,
July 1, 2004;
78(1):
18 - 24.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. L. Krupp, G. Weinstein, A. Chalian, J. A. Berlin, P. Wolf, and R. S. Weber
Validation of a Transfusion Prediction Model in Head and Neck Cancer Surgery
Arch Otolaryngol Head Neck Surg,
December 1, 2003;
129(12):
1297 - 1302.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. A. Spertus, M. J. Radford, N. R. Every, E. F. Ellerbeck, E. D. Peterson, and H. M. Krumholz
Challenges and opportunities in quantifying the quality of care for acute myocardial infarction: Summary from the acute myocardial infarction working group of the American heart association/American college of cardiology first scientific forum on quality of care and outcomes research in cardiovascular disease and stroke
J. Am. Coll. Cardiol.,
May 7, 2003;
41(9):
1653 - 1663.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Agabiti, C. Ancona, F. Forastiere, M. Arca, and C. A. Perucci
Evaluating outcomes of hospital care following coronary artery bypass surgery in Rome, Italy
Eur. J. Cardiothorac. Surg.,
April 1, 2003;
23(4):
599 - 606.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. A. Spertus, M. J. Radford, N. R. Every, E. F. Ellerbeck, E. D. Peterson, and H. M. Krumholz
Challenges and Opportunities in Quantifying the Quality of Care for Acute Myocardial Infarction: Summary From the Acute Myocardial Infarction Working Group of the American Heart Association/American College of Cardiology First Scientific Forum on Quality of Care and Outcomes Research in Cardiovascular Disease and Stroke
Circulation,
April 1, 2003;
107(12):
1681 - 1691.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. B. Ferguson Jr, L. P. Coombs, E. D. Peterson, and for the Society of Thoracic Surgeons National Adul
Preoperative {beta}-Blocker Use and Mortality and Morbidity Following CABG Surgery in North America
JAMA,
May 1, 2002;
287(17):
2221 - 2227.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. E. Shaw, H. V. Anderson, R. G. Brindis, R. J. Krone, L. W. Klein, C. R. McKay, P. C. Block, L. J. Shaw, K. Hewitt, W. S. Weintraub, et al.
Development of a risk adjustment mortality model using the American College of Cardiology-National Cardiovascular Data Registry (ACC-NCDR) experience: 1998-2000
J. Am. Coll. Cardiol.,
April 3, 2002;
39(7):
1104 - 1112.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. B. Ferguson Jr, B. G. Hammill, E. D. Peterson, E. R. DeLong, and F. L. Grover
A decade of change--risk profiles and outcomes for isolated coronary artery bypass grafting procedures, 1990-1999: a report from the STS National Database Committee and the Duke Clinical Research Institute
Ann. Thorac. Surg.,
February 1, 2002;
73(2):
480 - 489.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. M. Shahian, S.-L. Normand, D. F. Torchiana, S. M. Lewis, J. O. Pastore, R. E. Kuntz, and P. I. Dreyer
Cardiac surgery report cards: comprehensive review and statistical critique
Ann. Thorac. Surg.,
December 1, 2001;
72(6):
2155 - 2168.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|