Skip Navigation
American College of Physicians Logo
  • Subscribe
  • Submit a Manuscript
  • Sign In
    Sign in below to access your subscription for full content
    INDIVIDUAL SIGN IN
    Sign In|Set Up Account
    You will be directed to acponline.org to register and create your Annals account
    INSTITUTIONAL SIGN IN
    Open Athens|Shibboleth|Log In
    Annals of Internal Medicine
    SUBSCRIBE
    Subscribe to Annals of Internal Medicine.
    You will be directed to acponline.org to complete your purchase.
Annals of Internal Medicine Logo Menu
  • Latest
  • Issues
  • Channels
  • CME/MOC
  • In the Clinic
  • Journal Club
  • Web Exclusives
  • Author Info
Advanced Search
  • ‹ PREV ARTICLE
  • This Issue
  • NEXT ARTICLE ›
Reviews |7 March 2017

The Effects of Pay-for-Performance Programs on Health, Health Care Use, and Processes of Care: A Systematic Review Free

Aaron Mendelson, BA; Karli Kondo, PhD; Cheryl Damberg, PhD; Allison Low, BA; Makalapua Motúapuaka, BA; Michele Freeman, MPH; Maya O'Neil, PhD; Rose Relevo, MLIS, MS; Devan Kansagara, MD, MCR

Aaron Mendelson, BA
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Karli Kondo, PhD
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Cheryl Damberg, PhD
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Allison Low, BA
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Makalapua Motúapuaka, BA
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Michele Freeman, MPH
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Maya O'Neil, PhD
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Rose Relevo, MLIS, MS
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Devan Kansagara, MD, MCR
From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

Article, Author, and Disclosure Information
Author, Article, and Disclosure Information
This article was published at Annals.org on 10 January 2017.
  • From VA Portland Health Care System, and Oregon Health & Science University, Portland, Oregon, and RAND Corporation, Santa Monica, California.

    Disclaimer: The views and conclusions expressed in this article are those of the authors, who are responsible for its content, and do not necessarily represent the views of the U.S. Department of Veterans Affairs or the U.S. government. Therefore, no statement in this article should be construed as an official position of the U.S. Department of Veterans Affairs.

    Financial Support: By the U.S. Department of Veterans Affairs, Veterans Health Administration, Office of Research and Development, Quality Enhancement Research Initiative, Evidence-based Synthesis Program (project 05-225).

    Disclosures: Authors have disclosed no conflicts of interest. Forms can be viewed at www.acponline.org/authors/icmje/ConflictOfInterestForms.do?msNum=M16-1881.

    Editors' Disclosures: Christine Laine, MD, MPH, Editor in Chief, reports that she has no financial relationships or interests to disclose. Darren B. Taichman, MD, PhD, Executive Deputy Editor, reports that he has no financial relationships or interests to disclose. Cynthia D. Mulrow, MD, MSc, Senior Deputy Editor, reports that she has no relationships or interests to disclose. Deborah Cotton, MD, MPH, Deputy Editor, reports that she has no financial relationships or interest to disclose. Jaya K. Rao, MD, MHS, Deputy Editor, reports that she has stock holdings/options in Eli Lilly and Pfizer. Sankey V. Williams, MD, Deputy Editor, reports that he has no financial relationships or interests to disclose. Catharine B. Stack, PhD, MS, Deputy Editor for Statistics, reports that she has stock holdings in Pfizer and Johnson & Johnson.

    Reproducible Research Statement:Study protocol and data set: See the Supplement. Statistical code: Not applicable.

    Requests for Single Reprints: Devan Kansagara, MD, MCR, VA Portland Health Care System, Mail Code R&D 71, 3710 SW US Veterans Hospital Road, Portland, OR 97239-2999; e-mail, kansagar@ohsu.edu.

    Current Author Addresses: Mr. Mendelson: Oregon Health & Science University, Mail Code MDYCHSE, 3181 SW Sam Jackson Park Road, Portland, OR 97239.

    Drs. Kondo, O'Neil, and Kansagara; Ms. Low; Ms. Motúapuaka; Ms. Freeman; and Ms. Relevo: VA Portland Health Care System, Mail Code R&D 71, 3710 SW US Veterans Hospital Road, Portland, OR 97239-2999.

    Dr. Damberg: RAND (Health), 1776 Main Street, Santa Monica, CA 90407-2138.

    Author Contributions: Conception and design: K. Kondo, M. Motúapuaka, M. O'Neil, R. Relevo, D. Kansagara.

    Analysis and interpretation of the data: A. Mendelson, K. Kondo, C. Damberg, A. Low, M. Motúapuaka, M. Freeman, M. O'Neil, D. Kansagara.

    Drafting of the article: A. Mendelson, K. Kondo, C. Damberg, M. Motúapuaka, M. O'Neil, D. Kansagara.

    Critical revision of the article for important intellectual content: A. Mendelson, K. Kondo, A. Low, M. Motúapuaka, D. Kansagara.

    Final approval of the article: A. Mendelson, K. Kondo, C. Damberg, A. Low, M. Motúapuaka, M. Freeman, M. O'Neil, R. Relevo, D. Kansagara.

    Obtaining of funding: D. Kansagara.

    Administrative, technical, or logistic support: A. Low, M. Motúapuaka, M. Freeman.

    Collection and assembly of data: A. Mendelson, K. Kondo, C. Damberg, A. Low, M. Motúapuaka, M. Freeman, M. O'Neil, R. Relevo, D. Kansagara.

×
  • ‹ PREV ARTICLE
  • This Issue
  • NEXT ARTICLE ›
Jump To
  • Full Article
  • FULL ARTICLE
  • FULL ARTICLE
    • Abstract
    • Methods
    • Results
    • Discussion
      1. References
  • Figures
  • Tables
  • Supplements
  • Audio/Video
  • Summary for Patients
  • Clinical Slide Sets
  • CME / MOC
  • Comments
  • Twitter Link
  • Facebook Link
  • Email Link
More
  • LinkedIn Link

Abstract

Background:

The benefits of pay-for-performance (P4P) programs are uncertain.

Purpose:

To update and expand a prior review examining the effects of P4P programs targeted at the physician, group, managerial, or institutional level on process-of-care and patient outcomes in ambulatory and inpatient settings.

Data Sources:

PubMed from June 2007 to October 2016; MEDLINE, PsycINFO, CINAHL, Business Economics and Theory, Business Source Elite, Scopus, Faculty of 1000, and Gartner Research from June 2007 to February 2016.

Study Selection:

Trials and observational studies in ambulatory and inpatient settings reporting process-of-care, health, or utilization outcomes.

Data Extraction:

Two investigators extracted data, assessed study quality, and graded the strength of the evidence.

Data Synthesis:

Among 69 studies, 58 were in ambulatory settings, 52 reported process-of-care outcomes, and 38 reported patient outcomes. Low-strength evidence suggested that P4P programs in ambulatory settings may improve process-of-care outcomes over the short term (2 to 3 years), whereas data on longer-term effects were limited. Many of the positive studies were conducted in the United Kingdom, where incentives were larger than in the United States. The largest improvements were seen in areas where baseline performance was poor. There was no consistent effect of P4P on intermediate health outcomes (low-strength evidence) and insufficient evidence to characterize any effect on patient health outcomes. In the hospital setting, there was low-strength evidence that P4P had little or no effect on patient health outcomes and a positive effect on reducing hospital readmissions.

Limitation:

Few methodologically rigorous studies; heterogeneous population and program characteristics and incentive targets.

Conclusion:

Pay-for-performance programs may be associated with improved processes of care in ambulatory settings, but consistently positive associations with improved health outcomes have not been demonstrated in any setting.

Primary Funding Source:

U.S. Department of Veterans Affairs.

Pay-for-performance (P4P) programs provide financial rewards or penalties to individual health care providers, groups of providers, or institutions according to their performance on measures of quality. In theory, if properly targeted and designed, P4P programs would help drive the behavior of providers and health care systems to improve the quality of care delivered, reduce unnecessary use of expensive health care services, and improve patient health outcomes (1). The idea is particularly relevant in the United States, where serious and broad gaps in health care quality have been tied in part to the long-standing fee-for-service system, which may provide incentives for service volume rather than quality (2).
Despite their intuitive appeal, the promise of P4P programs in improving outcomes has not been empirically realized in past studies (3–6). The most recent systematic review examining the effectiveness of P4P programs in the United States found mixed evidence that P4P was associated with modest improvements in process-of-care outcomes but had little effect on patient outcomes (7). However, the literature has grown considerably since this review (which searched through 2012), and other countries, such as the United Kingdom, have gained considerable experience with large P4P initiatives that may provide information relevant to the United States. The purpose of the current review is to update and expand the prior systematic review in order to summarize current understanding of the effects of P4P programs targeted at physicians, groups, and institutions on process-of-care and patient outcomes in ambulatory and outpatient settings in and outside the United States.

Methods

This review was conducted according to a protocol that was developed using established reporting standards and posted to a public Web site (8) before the study was initiated (Appendix 1 of the Supplement). We used an analytic framework based on work by Damberg and colleagues (7) (Appendix 2 of the Supplement).

Data Sources and Searches

We searched the following databases for studies that evaluated P4P programs: PubMed (1 June 2007 to 6 October 2016), MEDLINE, PsycINFO, CINAHL, Business Economics and Theory, Business Source Elite, Scopus, Faculty of 1000, and Gartner Research (1 June 2007 to 29 February 2016). We also performed targeted Google and PubMed searches aimed at well-known P4P demonstrations. We obtained additional articles from reference lists of pertinent studies, reviews, editorials, and expert recommendations. The search strategies are detailed in Appendix 3 of the Supplement.

Study Selection

Investigators reviewed titles and abstracts identified from literature searches. Two investigators independently assessed each potentially relevant article for inclusion using preestablished criteria (Appendices 4 and 5 of the Supplement). We included English-language studies of adult patients that evaluated ambulatory care– or hospital-based P4P programs targeting health care providers at the individual, group, managerial, or institutional level and that reported any process-of-care, utilization, health, or intermediate health (clinical measures, such as a laboratory value or blood pressure) outcome. We included studies from other countries that have health systems similar to portions of the U.S. health care system. Studies examining only patient-targeted financial incentives, as well as payment models other than direct P4P, such as managed care, capitation, bundled payments, and accountable care organizations, were excluded. We also excluded studies that were not conducted in hospital or ambulatory settings, such as studies in long-term care facilities or nursing homes.
We included clinical or cluster randomized, controlled trials (RCTs) of any size. We used a best-evidence approach, which is a method of specifying minimum inclusion criteria for nonrandomized studies (9). Inclusion of observational studies was limited to those with a comparison group, interrupted time series (ITS) studies, or large (n > 10 000) cross-sectional or uncontrolled before–after studies. We excluded smaller uncontrolled studies because we had identified a large number of potentially relevant studies during a preliminary search and because the smaller uncontrolled studies were less likely to provide broadly applicable information given their limited scope and inherent methodological deficiencies.

Data Extraction and Quality Assessment

One investigator abstracted data elements from each included study, which were reviewed for accuracy by at least 1 additional investigator. We abstracted information on study design, sample size, country, program description, incentive structure (size and timing), target of the incentive, comparator, and outcomes (grouped as health, intermediate health, process-of-care, and utilization measures). Appendices 6 and 7 of the Supplement report these data. We classified studies according to 4 broad groupings: RCTs, ITS studies, controlled before–after studies, and uncontrolled before–after studies. Two investigators independently assessed study quality using the Newcastle-Ottawa Scale (10) for observational studies and the Cochrane Risk-of-Bias tool (11) for RCTs (Appendix 8 of the Supplement). Disagreements were resolved by consensus.

Data Synthesis and Analysis

We qualitatively synthesized the results of ambulatory and hospital studies separately and report process-of-care and patient outcomes for each setting. We synthesized results for specific P4P programs whenever possible. The review team evaluated the strength of the evidence according to guidance from the Agency for Healthcare Research and Quality (12). We did not perform meta-analysis because of the marked clinical heterogeneity across studies and the large number of observational studies.

Role of the Funding Source

The U.S. Department of Veterans Affairs Quality Enhancement Research Initiative supported this review but had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; or decision to submit the manuscript for publication.

Results

Search Results

We reviewed 3418 titles and abstracts, identified 586 potentially eligible full-text articles, and ultimately included 69 studies (Figure). Fifty-eight studies were in ambulatory settings (Table 1 and Appendix 6 of the Supplement), 11 were in hospital settings (Table 2 and Appendix 7 of the Supplement), 52 reported process-of-care outcomes, and 38 assessed patient outcomes. The studies examined a wide range of P4P programs with varying incentive structures, goals, and contexts. The programs also differed in their purposes and targets, but the largest number of studies focused on managing chronic conditions in the primary care setting. Studies were conducted in a wide range of countries, including the United Kingdom (27 studies), the United States (17 studies), Taiwan (13 studies), France (3 studies), the Netherlands (3 studies), Canada (3 studies), Australia (1 study), South Korea (1 study), and Italy (1 study). There were 2 RCTs and 67 observational studies (10 ITS studies, 37 controlled before–after studies, and 20 large uncontrolled before–after studies).
Figure.

Literature flow diagram.

P4P = pay-for-performance.

* The current systematic review updates and expands on the review by Damberg and colleagues (7).

Table 1. Findings From Studies of Ambulatory-Based Pay-for-Performance Programs

Table 1. Findings From Studies of Ambulatory-Based Pay-for-Performance Programs

Table 2. Findings From Studies of Hospital-Based Pay-for-Performance Programs

Table 2. Findings From Studies of Hospital-Based Pay-for-Performance Programs
A large number of studies evaluated different aspects of 2 large-scale national programs: the United Kingdom's Quality and Outcomes Framework (QOF) (24 studies) and Taiwan's diabetes mellitus (DM-P4P) program (9 studies). The QOF is a nationwide program that began in 2004. It incentivizes primary care practices to achieve quality indicators that support clinical care and public health goals. Incentive payments can comprise up to approximately 30% of total income. Practices are aided by integrated health information technology that delivers automated prompts and decision support (36, 83). Taiwan's DM-P4P program, implemented in 2001, allows physicians to voluntarily enroll in the program, and they in turn are given freedom to choose which patients to enroll (51). From 2001 to 2006, incentives targeted process-of-care outcomes, which were augmented with intermediate health outcome measures after 2006.

Ambulatory Care–Based Programs

Process-of-Care Outcomes

We found 9 studies from the United States evaluating the effects of P4P on process-of-care outcomes (14, 16–20, 22–24). Most of these studies examined outcomes over 4 years and had an average follow-up of 2.5 years; very few studies reported longer-term data. One RCT found that individual incentives increased appropriate response to high blood pressure but not use of guideline-recommended antihypertensive medication (14). Of the 6 studies that reported positive results (16, 18, 19, 22–24), 1 did not have a control group (24), and selection bias was a serious concern in 3 others because of the way the control group was chosen (18, 22, 23). Two methodologically sound controlled before–after studies found no improvements in processes of care (17, 20).
In general, there was evidence across 17 studies in the United Kingdom (26–31, 33, 36–38, 41–47) that the QOF was associated with improvements in process-of-care measures, although the evidence was mixed among the more methodologically rigorous studies. There were 6 ITS studies. One showed substantial improvements in the prescription of long-acting reversible contraceptives (26), and another showed modest improvement in the initiation of diabetes medications (27). Another study found increased rates of depression screening and diagnoses, but antidepressant prescribing remained unchanged (31). In the other 3 studies, improvements had begun well before QOF implementation, and postintervention trends did not show substantial improvement and, in fact, showed slower or decreased improvement over time (28–30).
Although many studies of Taiwan's DM-P4P program showed improvement in process-of-care measures, selection bias was a major concern (51–54, 58). Physicians voluntarily enrolled and were given discretion over which patients to enroll. Because the program lacked risk adjustment and, initially, a mechanism to disenroll patients, physicians had a strong incentive to enroll healthier patients (51). Indeed, enrolled patients were much healthier than nonenrolled patients. Moreover, at participating institutions, the pool of nonenrolled patients became sicker over time, indicating that healthier patients were being removed to participate in DM-P4P. Though many studies attempted to adjust for differences in the 2 groups by using propensity score matching, residual confounding was still an important potential issue given the many unmeasured factors that were likely to be related to enrollment decision making.
We found 13 non-U.S. studies that were not part of a larger P4P evaluation. Two of these studies were methodologically sound observational studies from Canada that reported contradictory results on screening and preventive measures (66, 67). An ITS study found modest increases in colorectal cancer screening but no effects on cervical and breast cancer screening (66). However, a controlled before–after study found modest increases for colorectal cancer screening, mammography, flu shots, and Papanicolaou smears (67). It was difficult to draw strong conclusions from the other 11 studies because of disparities in the programs' targets and designs and the study settings, as well as the low quality of the study designs (49, 50, 61–65, 68–71).

Patient Outcomes

Health Outcomes.
Ten studies evaluated health outcomes in ambulatory settings (39, 44, 51, 52, 55–57, 59–61). Eight of the studies (most of which found positive results) were conducted in Taiwan and should be interpreted with caution due to selection bias, as described earlier (51, 52, 55–57, 59–61). Two large uncontrolled before–after studies of QOF reported no improvements in health outcomes (39, 44). One assessed the correlation among regional QOF performance, all-cause mortality, and condition-specific mortality (39). It found that better performance on both the aggregate of QOF quality indicators and a subset of intermediate outcome indicators did not correlate with reduced mortality. Another study found that chronic obstructive pulmonary disease (COPD) prevalence actually increased from 1.27% to 1.45% after QOF implementation (44). Given the time needed to develop COPD and that most QOF indicators focused on managing COPD rather than preventing it, the implications of these findings are unclear.
Studies with high risk of bias generally found positive effects associated with DM-P4P (51, 52, 55–57, 59) and the similarly structured tuberculosis P4P program (60, 61). However, given the limitations already highlighted, such results are difficult to interpret.
Utilization Outcomes.
We found 6 studies from the United States (16, 17, 20–22, 24), 5 studies from Taiwan (52–54, 58, 60), and 1 QOF study (34) reporting utilization outcomes.
The 6 studies from the United States reported mixed findings on the effects of P4P on utilization, although studies with the strongest designs showed no effect. One rigorously controlled study examined a P4P intervention that provided bonuses to practices that achieved advanced medical home status and found no effect on all-cause hospitalizations, all-cause emergency department (ED) visits, or ambulatory care–sensitive ED visits (17). Ambulatory care–sensitive hospitalizations actually increased in the second year of the intervention. Another controlled before–after study examined P4P in 3 state Medicaid programs and found no changes in any of the states for ED visits and inconsistent findings on inpatient utilization (20). A study examining a P4P program in medical homes targeting improved diabetes screenings and care found reductions in ED use and primary care visits but not in 6 other utilization measures (21). One study of a Medicare Advantage plan that rewarded physicians for providing evidence-based care to patients with heart failure found no effect on acute admissions or ED visits (16). Two studies lacking appropriate control groups showed improvement in ED use (22, 24).
Studies in Taiwan generally found reductions in hospital use associated with P4P (52–54, 58, 60). Again, due to the high likelihood of selection bias, these studies should be interpreted with caution.
A QOF study found a sustained reduction in ambulatory care–sensitive ED admissions (34).
Intermediate Health Outcomes.
Twelve studies reported 1 or more intermediate health outcomes (13, 14, 25, 32, 35, 37, 38, 40, 41, 43, 47, 48). There were 2 RCTs with low risk of bias conducted in the United States. One RCT (n = 1503) evaluated the effect of a P4P program on low-density lipoprotein cholesterol levels (13). Physicians were given monthly patient progress reports and were eligible for comparatively large P4P bonuses ($256 quarterly per patient) that were separated from other funding sources to highlight their relevance. Physicians received average total incentive payments of $3246. The difference in low-density lipoprotein cholesterol level between patients seen by physicians in the P4P and control groups was not significant (2.8 mg/dL [95% CI, −1.7 to 7.4 mg/dL]; P = 0.66).
The other RCT (14) was included in the prior review by Damberg and colleagues, but a substudy was recently published (15). The original trial compared the effect of financial incentives earned for controlled blood pressure or response to uncontrolled blood pressure across 4 groups: incentives directed to individual physicians, practices, or both, or no incentives (14). The study included 77 physicians; payments and performance feedback were delivered to physicians at the end of each 4-month performance period. The average total payment for physicians completing the entire program was $2744. A higher proportion of patients achieved one or both measures in the individual physician incentive group than the control group (difference, 8.36% [CI, 2.4% to 13.0%]; P = 0.005), although the differences were not significant in the other 2 intervention groups. The recently published substudy found that the proportion of patients achieving control was not significantly higher in the incentive group (15).
Ten observational studies examining QOF reported mixed findings on intermediate outcomes (25, 32, 35, 37, 38, 40, 41, 43, 47, 48), but methodologically stronger studies suggested that QOF had little effect. Uncontrolled studies suggested large improvements in blood pressure control, cholesterol levels, and hemoglobin A1c (HbA1c) control. However, higher-quality studies that accounted for time trends failed to replicate these findings (25, 32). One short-term ITS study found that blood pressure control and cholesterol levels improved but HbA1c control worsened relative to the underlying trend (32). A longer-term ITS study found that although mean cholesterol and HbA1c levels and blood pressure control had been improving before QOF implementation, only systolic blood pressure continued to improve afterward. Diastolic blood pressure, mean cholesterol levels, and HbA1c levels actually worsened relative to the pre-QOF trend (25).

Hospital-Based Programs

Process-of-Care Outcomes

Eight studies examined process-of-care measures in the hospital setting (74–77, 79–82). Controlled before–after studies from the United States and Canada generally failed to find improvements in care processes (74, 75), although 1 study from Canada did report modest reductions in ED wait times (80). One controlled study from Taiwan found that P4P-enrolled patients with breast cancer received better-quality care than nonenrolled patients (79). Uncontrolled studies reported larger improvements (76, 77, 81, 82).

Patient Outcomes

Health Outcomes.
Pay-for-performance programs generally did not decrease mortality or improve patient experience in 5 studies in hospital settings (73, 74, 78, 79, 82). High-quality studies examining the U.K. Hospital Quality Incentive demonstration and the U.S. Hospital Value-Based Purchasing (HVBP) programs did not find a link between mortality and targeted conditions (73, 78). One short-term controlled before–after study found no immediate change in patient experience associated with the HVBP program (74). One uncontrolled study found that mortality related to hemorrhagic strokes did not decrease after implementation of P4P (82). A study from Taiwan indicated that P4P patients had improved breast cancer survival (79).
Utilization Outcomes.
One ITS study reported utilization outcomes (72) and found that hospital readmissions among Medicare fee-for-service patients decreased sharply for approximately 2 years after implementation of the Hospital Readmissions Reduction Program; improvements continued thereafter but at a substantially lower rate. Although readmission reductions were seen for various conditions, they decreased more among the measures that were specifically targeted by the program than those that were not.

Discussion

This systematic review of 69 studies updated and expanded on a previous review that had focused on U.S. programs and reported similar findings (7). The strength of the evidence and key results are summarized in Table 3. Overall, in the ambulatory setting, we found low-strength evidence that P4P programs may improve process-of-care outcomes over the short term (2 to 3 years). Evidence on the longer-term effects of P4P programs was limited. Many of the studies reporting positive findings were conducted in the United Kingdom, where incentives were much larger than any P4P programs in the United States. The largest improvements were seen in areas where baseline performance was poor. We found low-strength evidence that P4P had little to no effect on intermediate health outcomes (changes in laboratory measures), though there were inconsistencies among study results. The evidence examining patient health outcomes was insufficient because few methodologically rigorous studies reported these outcomes. In the hospital setting, low-strength evidence showed that P4P had a neutral effect on patient health outcomes and a positive effect on reducing hospital readmissions.

Table 3. Strength of the Evidence

Table 3. Strength of the Evidence
Although many studies found positive effects associated with P4P programs, the results were inconsistent across studies, the magnitude of effect was often small, and it was difficult to confidently ascribe observed changes in outcomes to the intervention itself because of the observational nature of most studies and their specific methodological flaws. To better characterize the breadth of programs that have been evaluated, we included large uncontrolled studies reporting outcomes before and after program implementation. However, in all of these studies, the 2 measurements potentially reflect the peak and average of normally expected measurement variation (a phenomenon known as regression to the mean). The controlled before–after studies do not have this same issue, but the choice of control group was problematic in many studies because either the patients who qualified for a P4P program differed systematically from those who did not, or the participating providers or practices differed substantially from those that did not participate. The ITS studies were useful because they accounted for trends in outcomes before the intervention. Indeed, several of these studies showed that improvements in outcomes had begun before P4P implementation. It is unclear whether these reflected secular trends in health care or practice changes in anticipation of intervention implementation.
Our findings complement and add to prior reviews, which have also generally found that P4P programs have not been consistently effective in improving patient outcomes (3–7). There are several reasons why this might be the case. First, especially in the era of modern health reform, P4P programs have been implemented and assessed in settings where other effective quality improvement interventions—such as public reporting, audit and feedback, and electronic decision-support tools—may have been deployed (84). The incremental benefit of P4P may therefore have been more difficult to demonstrate.
Second, it is possible that P4P programs have not tested the “best” incentive structures and payment mechanisms. Experts have suggested the importance of designing P4P programs using the principles of behavioral economics, in which such factors as payment size, timing, and frequency are believed to have important influences on individual behavior (85). In health care, we have not found strong empirical data to help determine the most successful incentive structure (86). It is interesting to consider the United Kingdom's QOF program, which accounted for nearly 40% of the included studies in our review, alongside U.S. efforts. Studies of QOF found that incentivized process-of-care measures can lead to improvements, especially in the early years of program implementation, but the rate of improvement slowed over time and there was no clear evidence that QOF improved patient outcomes. Whereas the P4P programs in the United States tended to be implemented within health systems or payers and involve relatively small incentives, QOF is the largest P4P program ever attempted in health care. It was implemented nationally with a single payer that includes virtually all general practitioners and provides practices with up to 30% of their annual income.
Finally, P4P programs are very complex health system interventions that have been implemented in various ways. In a related article, we examined the implementation factors that might mediate the potentially beneficial and harmful effects of P4P programs (86). We systematically reviewed studies of implementation factors and also conducted interviews with experts in the field of P4P. Although direct evidence was inadequate to draw strong conclusions, we found that provider buy-in and alignment of measures with organizational goals were likely to be important in sustaining effective programs. We found that measures that were transparently developed from the evidence base and that were focused on improving clinical processes and patient outcomes rather than measures of efficiency were more likely to be effective. We also found that the overall number of incentives in place at any one time needs to be carefully considered. Given the evidence that the most substantial gains were consistently seen in areas of poor baseline performance, we suggested that organizations use incentives in the most-needed areas, review measures regularly, and discontinue them after achieving sustained improvements.
Our review has several important limitations. The evidence is limited by methodological flaws, variation in program and population characteristics, and limited reporting on secular trends in health care. We chose to include studies from other countries because the breadth of experience with P4P might be informative for some U.S. health systems, but we acknowledge that there are also limitations in applying findings from other countries broadly in the United States. Our review expands on a prior review, so it is possible we did not include some individual studies that are informative, though these probably would not have altered our summary findings.
The policy implications of our findings are open to interpretation. In the absence of strong evidence of benefit, it may be particularly important to consider the potential harms and costs associated with P4P. We recently published a systematic review of the unintended consequences of P4P: There was very limited evidence assessing the extent of gaming, no consistent evidence of a negative effect on health disparities, and a small amount of evidence suggesting the potential for both positive and negative effects on unincentivized measures (87). The costs and burden of documentation and reporting requirements associated with P4P programs are also important to consider but have not been studied extensively. Qualitative studies have found that providers perceive P4P programs as imposing a considerable burden and threatening clinical autonomy (88–90). A recent survey study found that U.S. health care providers self-report spending about 15 hours per week reporting and interpreting data for measures, which translates into billions of dollars in opportunity cost (91). Indeed, the United Kingdom decided to scale back its QOF program after 10 years of experience, in part because of provider concerns and the inconsistency of data demonstrating long-term benefit (92).
On the other hand, P4P programs have likely been effective in some areas, most notably in improving processes of care. The lack of evidence on patient outcomes may reflect deficiencies in the methods that have been used to study these effects and the likelihood that it takes a long time for process-of-care improvements to translate into large-scale patient outcome improvements (93).
In summary, we found low-strength, contradictory evidence that P4P programs could improve processes of care, but we found no clear evidence to suggest that they improve patient outcomes. Value-based purchasing is a cornerstone of the coming Medicare reform known as the Medicare Access and CHIP Reauthorization Act, so P4P will remain a fixture in U.S. health care for the foreseeable future (94). Whether the inconsistency of positive findings suggests that P4P, broadly speaking, is unlikely to have large effects or is related to the marked differences in program design, patient population, and incentive target is unclear.

References

  1. Epstein
    AM
    ,  
    Lee
    TH
    ,  
    Hamel
    MB
    .  
    Paying physicians for high-quality care.
    N Engl J Med
    2004
    350
    406
    10
     PubMed
    CrossRef
     PubMed
  2. Institute of Medicine. Report Brief. Crossing the Quality Chasm: A New Health System for the 21st Century. 2001. Accessed at http://iom.nationalacademies.org/reports/2001/crossing-the-quality-chasm-a-new-health-system-for-the-21st-century.aspx on 15 November 2016.
  3. Houle
    SK
    ,  
    McAlister
    FA
    ,  
    Jackevicius
    CA
    ,  
    Chuck
    AW
    ,  
    Tsuyuki
    RT
    .  
    Does performance-based remuneration for individual health care practitioners affect patient care?: A systematic review.
    Ann Intern Med
    2012
    157
    889
    99
    CrossRef
     PubMed
  4. Scott
    A
    ,  
    Sivey
    P
    ,  
    Ait Ouakrim
    D
    ,  
    Willenberg
    L
    ,  
    Naccarella
    L
    ,  
    Furler
    J
    ,  
    et al
    The effect of financial incentives on the quality of health care provided by primary care physicians.
    Cochrane Database Syst Rev.
    2011
    CD008451
     PubMed
  5. Eijkenaar
    F
    ,  
    Emmert
    M
    ,  
    Scheppach
    M
    ,  
    Schöffski
    O
    .  
    Effects of pay for performance in health care: a systematic review of systematic reviews.
    Health Policy
    2013
    110
    115
    30
     PubMed
    CrossRef
     PubMed
  6. Gillam
    SJ
    ,  
    Siriwardena
    AN
    ,  
    Steel
    N
    .  
    Pay-for-performance in the United Kingdom: impact of the Quality and Outcomes Framework: a systematic review.
    Ann Fam Med
    2012
    10
    461
    8
     PubMed
    CrossRef
     PubMed
  7. Damberg CL, Sorbero ME, Lovejoy SL, Martsolf GR, Raaen L, Mandel D. Measuring Success in Health Care Value-Based Purchasing Programs: Findings from an Environmental Scan, Literature Review, and Expert Panel Discussions. Santa Monica, CA: RAND Corporation; 2014. Accessed at www.rand.org/pubs/research_reports/RR306.html on 15 November 2016.
  8. Evidence-based Synthesis Program Coordinating Center. ESP Reports in Progress. Washington, DC: U.S. Department of Veterans Affairs; 2014. Accessed at www.hsrd.research.va.gov/publications/esp/in_progress.cfm on 20 August 2016.
  9. Treadwell
    JR
    ,  
    Singh
    S
    ,  
    Talati
    R
    ,  
    McPheeters
    ML
    ,  
    Reston
    JT
    .  
    A Framework for “Best Evidence” Approaches in Systematic Reviews.
    Rockville, MD
    Agency for Healthcare Research and Quality
    2011
  10. Wells GA, Shea B, O'Connell D, Peterson J, Welch V, Tugwell P. The Newcastle-Ottawa Scale (NOS) for assessing the quality of nonrandomised studies in meta-analyses. Ottawa, Ontario, Canada: Ottawa Hospital Research Institute; 2014. Accessed at www.ohri.ca/programs/clinical_epidemiology/oxford.asp on 15 November 2016.
  11. Higgins J, Green S. Cochrane Handbook for Systematic Reviews of Interventions. Version 5.1.0. Cochrane Collaboration. 2011. Accessed at http://handbook.cochrane.org on 15 November 2016.
  12. Berkman N, Lohr K, Ansari M, McDonagh M, Balk E, Whitlock E, et al. Grading the Strength of a Body of Evidence When Assessing Health Care Interventions for the Effective Health Care Program of the Agency for Healthcare Research and Quality: An Update. Methods Guide for Comparative Effectiveness Reviews. AHRQ publication no. 13(14)-EHC130-EF. Rockville, MD: Agency for Healthcare Research and Quality; 2013. Accessed at www.effectivehealthcare.ahrq.gov/ehc/products/322/998/MethodsGuideforCERs_Viswanathan_IndividualStudies.pdf on 15 November 2016.
  13. Asch
    DA
    ,  
    Troxel
    AB
    ,  
    Stewart
    WF
    ,  
    Sequist
    TD
    ,  
    Jones
    JB
    ,  
    Hirsch
    AG
    ,  
    et al
    Effect of financial incentives to physicians, patients, or both on lipid levels: a randomized clinical trial.
    JAMA
    2015
    314
    1926
    35
     PubMed
    CrossRef
     PubMed
  14. Petersen
    LA
    ,  
    Simpson
    K
    ,  
    Pietz
    K
    ,  
    Urech
    TH
    ,  
    Hysong
    SJ
    ,  
    Profit
    J
    ,  
    et al
    Effects of individual physician-level and practice-level financial incentives on hypertension care: a randomized trial.
    JAMA
    2013
    310
    1042
    50
     PubMed
    CrossRef
     PubMed
  15. Petersen
    LA
    ,  
    Ramos
    KS
    ,  
    Pietz
    K
    ,  
    Woodard
    LD
    .  
    Impact of a pay-for-performance program on care for black patients with hypertension: important answers in the era of the Affordable Care Act.
    Health Serv Res.
    2016
     PubMed
  16. Esse
    T
    ,  
    Serna
    O
    ,  
    Chitnis
    A
    ,  
    Johnson
    M
    ,  
    Fernandez
    N
    .  
    Quality compensation programs: are they worth all the hype? A comparison of outcomes within a Medicare Advantage heart failure population.
    J Manag Care Pharm
    2013
    19
    317
    24
     PubMed
     PubMed
  17. Friedberg
    MW
    ,  
    Schneider
    EC
    ,  
    Rosenthal
    MB
    ,  
    Volpp
    KG
    ,  
    Werner
    RM
    .  
    Association between participation in a multipayer medical home intervention and changes in quality, utilization, and costs of care.
    JAMA
    2014
    311
    815
    25
     PubMed
    CrossRef
     PubMed
  18. Kruse
    GR
    ,  
    Chang
    Y
    ,  
    Kelley
    JH
    ,  
    Linder
    JA
    ,  
    Einbinder
    JS
    ,  
    Rigotti
    NA
    .  
    Healthcare system effects of pay-for-performance for smoking status documentation.
    Am J Manag Care
    2013
    19
    554
    61
     PubMed
     PubMed
  19. Lemak
    CH
    ,  
    Nahra
    TA
    ,  
    Cohen
    GR
    ,  
    Erb
    ND
    ,  
    Paustian
    ML
    ,  
    Share
    D
    ,  
    et al
    Michigan's fee-for-value physician incentive program reduces spending and improves quality in primary care.
    Health Aff (Millwood)
    2015
    34
    645
    52
     PubMed
    CrossRef
     PubMed
  20. Rosenthal
    MB
    ,  
    Landrum
    MB
    ,  
    Robbins
    JA
    ,  
    Schneider
    EC
    .  
    Pay for performance in Medicaid: evidence from three natural experiments.
    Health Serv Res
    2016
    51
    1444
    66
     PubMed
    CrossRef
     PubMed
  21. Rosenthal
    MB
    ,  
    Alidina
    S
    ,  
    Friedberg
    MW
    ,  
    Singer
    SJ
    ,  
    Eastman
    D
    ,  
    Li
    Z
    ,  
    et al
    A difference-in-difference analysis of changes in quality, utilization and cost following the Colorado Multi-payer Patient-Centered Medical Home Pilot.
    J Gen Intern Med
    2016
    31
    289
    96
     PubMed
    CrossRef
     PubMed
  22. Share
    DA
    ,  
    Mason
    MH
    .  
    Michigan's Physician Group Incentive Program offers a regional model for incremental ‘fee for value' payment reform.
    Health Aff (Millwood)
    2012
    31
    1993
    2001
     PubMed
    CrossRef
     PubMed
  23. Young
    GJ
    ,  
    Beckman
    H
    ,  
    Baker
    E
    .  
    Financial incentives, professional values and performance: a study of pay-for-performance in a professional organization.
    J Organ Behav
    2012
    33
    964
    83
    CrossRef
  24. Torchiana
    DF
    ,  
    Colton
    DG
    ,  
    Rao
    SK
    ,  
    Lenz
    SK
    ,  
    Meyer
    GS
    ,  
    Ferris
    TG
    .  
    Massachusetts General Physicians Organization's quality incentive program produces encouraging results.
    Health Aff (Millwood)
    2013
    32
    1748
    56
     PubMed
    CrossRef
     PubMed
  25. Alshamsan
    R
    ,  
    Lee
    JT
    ,  
    Majeed
    A
    ,  
    Netuveli
    G
    ,  
    Millett
    C
    .  
    Effect of a UK pay-for-performance program on ethnic disparities in diabetes outcomes: interrupted time series analysis.
    Ann Fam Med
    2012
    10
    228
    34
     PubMed
    CrossRef
     PubMed
  26. Arrowsmith
    ME
    ,  
    Majeed
    A
    ,  
    Lee
    JT
    ,  
    Saxena
    S
    .  
    Impact of pay for performance on prescribing of long-acting reversible contraception in primary care: an interrupted time series study.
    PLoS One
    2014
    9
    e92205
     PubMed
    CrossRef
     PubMed
  27. Gallagher
    N
    ,  
    Cardwell
    C
    ,  
    Hughes
    C
    ,  
    O'Reilly
    D
    .  
    Increase in the pharmacological management of type 2 diabetes with pay-for-performance in primary care in the UK.
    Diabet Med
    2015
    32
    62
    8
     PubMed
    CrossRef
     PubMed
  28. Kendrick
    T
    ,  
    Stuart
    B
    ,  
    Newell
    C
    ,  
    Geraghty
    AW
    ,  
    Moore
    M
    .  
    Did NICE guidelines and the Quality Outcomes Framework change GP antidepressant prescribing in England? Observational study with time trend analyses 2003–2013.
    J Affect Disord
    2015
    186
    171
    7
     PubMed
    CrossRef
     PubMed
  29. Kontopantelis
    E
    ,  
    Reeves
    D
    ,  
    Valderas
    JM
    ,  
    Campbell
    S
    ,  
    Doran
    T
    .  
    Recorded quality of primary care for patients with diabetes in England before and after the introduction of a financial incentive scheme: a longitudinal observational study.
    BMJ Qual Saf
    2013
    22
    53
    64
     PubMed
    CrossRef
     PubMed
  30. MacBride-Stewart
    SP
    ,  
    Elton
    R
    ,  
    Walley
    T
    .  
    Do quality incentives change prescribing patterns in primary care? An observational study in Scotland.
    Fam Pract
    2008
    25
    27
    32
     PubMed
    CrossRef
     PubMed
  31. McLintock
    K
    ,  
    Russell
    AM
    ,  
    Alderson
    SL
    ,  
    West
    R
    ,  
    House
    A
    ,  
    Westerman
    K
    ,  
    et al
    The effects of financial incentives for case finding for depression in patients with diabetes and coronary heart disease: interrupted time series analysis.
    BMJ Open
    2014
    4
    e005178
     PubMed
    CrossRef
     PubMed
  32. Vamos
    EP
    ,  
    Pape
    UJ
    ,  
    Bottle
    A
    ,  
    Hamilton
    FL
    ,  
    Curcin
    V
    ,  
    Ng
    A
    ,  
    et al
    Association of practice size and pay-for-performance incentives with the quality of diabetes management in primary care.
    CMAJ
    2011
    183
    E809
    16
     PubMed
    CrossRef
     PubMed
  33. Doran
    T
    ,  
    Kontopantelis
    E
    ,  
    Valderas
    JM
    ,  
    Campbell
    S
    ,  
    Roland
    M
    ,  
    Salisbury
    C
    ,  
    et al
    Effect of financial incentives on incentivised and non-incentivised clinical activities: longitudinal analysis of data from the UK Quality and Outcomes Framework.
    BMJ
    2011
    342
    d3590
     PubMed
    CrossRef
     PubMed
  34. Harrison
    MJ
    ,  
    Dusheiko
    M
    ,  
    Sutton
    M
    ,  
    Gravelle
    H
    ,  
    Doran
    T
    ,  
    Roland
    M
    .  
    Effect of a national primary care pay for performance scheme on emergency hospital admissions for ambulatory care sensitive conditions: controlled longitudinal study.
    BMJ
    2014
    349
    g6423
     PubMed
    CrossRef
     PubMed
  35. Karunaratne
    K
    ,  
    Stevens
    P
    ,  
    Irving
    J
    ,  
    Hobbs
    H
    ,  
    Kilbride
    H
    ,  
    Kingston
    R
    ,  
    et al
    The impact of pay for performance on the control of blood pressure in people with chronic kidney disease stage 3–5.
    Nephrol Dial Transplant
    2013
    28
    2107
    16
     PubMed
    CrossRef
     PubMed
  36. Sutton
    M
    ,  
    Elder
    R
    ,  
    Guthrie
    B
    ,  
    Watt
    G
    .  
    Record rewards: the effects of targeted quality incentives on the recording of risk factors by primary care providers.
    Health Econ
    2010
    19
    1
    13
     PubMed
    CrossRef
     PubMed
  37. Calvert
    M
    ,  
    Shankar
    A
    ,  
    McManus
    RJ
    ,  
    Lester
    H
    ,  
    Freemantle
    N
    .  
    Effect of the Quality and Outcomes Framework on diabetes care in the United Kingdom: retrospective cohort study.
    BMJ
    2009
    338
    b1870
     PubMed
    CrossRef
     PubMed
  38. Hamilton
    FL
    ,  
    Laverty
    AA
    ,  
    Huckvale
    K
    ,  
    Car
    J
    ,  
    Majeed
    A
    ,  
    Millett
    C
    .  
    Financial incentives and inequalities in smoking cessation interventions in primary care: before-and-after study.
    Nicotine Tob Res
    2016
    18
    341
    50
     PubMed
    CrossRef
     PubMed
  39. Kontopantelis
    E
    ,  
    Springate
    DA
    ,  
    Ashworth
    M
    ,  
    Webb
    RT
    ,  
    Buchan
    IE
    ,  
    Doran
    T
    .  
    Investigating the relationship between quality of primary care and premature mortality in England: a spatial whole-population study.
    BMJ
    2015
    350
    h904
     PubMed
    CrossRef
     PubMed
  40. Millett
    C
    ,  
    Bottle
    A
    ,  
    Ng
    A
    ,  
    Curcin
    V
    ,  
    Molokhia
    M
    ,  
    Saxena
    S
    ,  
    et al
    Pay for performance and the quality of diabetes management in individuals with and without co-morbid medical conditions.
    J R Soc Med
    2009
    102
    369
    77
     PubMed
    CrossRef
     PubMed
  41. Murray
    J
    ,  
    Saxena
    S
    ,  
    Millett
    C
    ,  
    Curcin
    V
    ,  
    de Lusignan
    S
    ,  
    Majeed
    A
    .  
    Reductions in risk factors for secondary prevention of coronary heart disease by ethnic group in south-west London: 10-year longitudinal study (1998–2007).
    Fam Pract
    2010
    27
    430
    8
     PubMed
    CrossRef
     PubMed
  42. Norbury
    M
    ,  
    Fawkes
    N
    ,  
    Guthrie
    B
    .  
    Impact of the GP contract on inequalities associated with influenza immunisation: a retrospective population-database analysis.
    Br J Gen Pract
    2011
    61
    e379
    85
     PubMed
    CrossRef
     PubMed
  43. Simpson
    CR
    ,  
    Hannaford
    PC
    ,  
    Ritchie
    LD
    ,  
    Sheikh
    A
    ,  
    Williams
    D
    .  
    Impact of the pay-for-performance contract and the management of hypertension in Scottish primary care: a 6-year population-based repeated cross-sectional study.
    Br J Gen Pract
    2011
    61
    e443
    51
     PubMed
    CrossRef
     PubMed
  44. Smith
    CJ
    ,  
    Gribbin
    J
    ,  
    Challen
    KB
    ,  
    Hubbard
    RB
    .  
    The impact of the 2004 NICE guideline and 2003 General Medical Services contract on COPD in primary care in the UK.
    QJM
    2008
    101
    145
    53
     PubMed
    CrossRef
     PubMed
  45. Szatkowski
    L
    ,  
    McNeill
    A
    ,  
    Lewis
    S
    ,  
    Coleman
    T
    .  
    A comparison of patient recall of smoking cessation advice with advice recorded in electronic medical records.
    BMC Public Health
    2011
    11
    291
     PubMed
    CrossRef
     PubMed
  46. Taggar
    JS
    ,  
    Coleman
    T
    ,  
    Lewis
    S
    ,  
    Szatkowski
    L
    .  
    The impact of the Quality and Outcomes Framework (QOF) on the recording of smoking targets in primary care medical records: cross-sectional analyses from The Health Improvement Network (THIN) database.
    BMC Public Health
    2012
    12
    329
     PubMed
    CrossRef
     PubMed
  47. Tahrani
    AA
    ,  
    McCarthy
    M
    ,  
    Godson
    J
    ,  
    Taylor
    S
    ,  
    Slater
    H
    ,  
    Capps
    N
    ,  
    et al
    Diabetes care and the new GMS contract: the evidence for a whole county.
    Br J Gen Pract
    2007
    57
    483
    5
     PubMed
     PubMed
  48. Vaghela
    P
    ,  
    Ashworth
    M
    ,  
    Schofield
    P
    ,  
    Gulliford
    MC
    .  
    Population intermediate outcomes of diabetes under pay-for-performance incentives in England from 2004 to 2008.
    Diabetes Care
    2009
    32
    427
    9
     PubMed
    CrossRef
     PubMed
  49. Mason
    T
    ,  
    Sutton
    M
    ,  
    Whittaker
    W
    ,  
    McSweeney
    T
    ,  
    Millar
    T
    ,  
    Donmall
    M
    ,  
    et al
    The impact of paying treatment providers for outcomes: difference-in-differences analysis of the ‘payment by results for drugs recovery' pilot.
    Addiction
    2015
    110
    1120
    8
     PubMed
    CrossRef
     PubMed
  50. Kalwij
    S
    ,  
    French
    S
    ,  
    Mugezi
    R
    ,  
    Baraitser
    P
    .  
    Using educational outreach and a financial incentive to increase general practices' contribution to chlamydia screening in South-East London 2003–2011.
    BMC Public Health
    2012
    12
    802
     PubMed
    CrossRef
     PubMed
  51. Chang
    RE
    ,  
    Lin
    SP
    ,  
    Aron
    DC
    .  
    A pay-for-performance program in Taiwan improved care for some diabetes patients, but doctors may have excluded sicker ones.
    Health Aff (Millwood)
    2012
    31
    93
    102
     PubMed
    CrossRef
     PubMed
  52. Chen
    YC
    ,  
    Lee
    CT
    ,  
    Lin
    BJ
    ,  
    Chang
    YY
    ,  
    Shi
    HY
    .  
    Impact of pay-for-performance on mortality in diabetes patients in Taiwan: a population-based study.
    Medicine (Baltimore)
    2016
    95
    e4197
     PubMed
    CrossRef
     PubMed
  53. Chen
    CC
    ,  
    Cheng
    SH
    .  
    Does pay-for-performance benefit patients with multiple chronic conditions? Evidence from a universal coverage health care system.
    Health Policy Plan
    2016
    31
    83
    90
     PubMed
    CrossRef
     PubMed
  54. Cheng
    SH
    ,  
    Lee
    TT
    ,  
    Chen
    CC
    .  
    A longitudinal examination of a pay-for-performance program for diabetes care: evidence from a natural experiment.
    Med Care
    2012
    50
    109
    16
     PubMed
    CrossRef
     PubMed
  55. Hsieh
    HM
    ,  
    Tsai
    SL
    ,  
    Shin
    SJ
    ,  
    Mau
    LW
    ,  
    Chiu
    HC
    .  
    Cost-effectiveness of diabetes pay-for-performance incentive designs.
    Med Care
    2015
    53
    106
    15
     PubMed
    CrossRef
     PubMed
  56. Hsieh
    HM
    ,  
    Lin
    TH
    ,  
    Lee
    IC
    ,  
    Huang
    CJ
    ,  
    Shin
    SJ
    ,  
    Chiu
    HC
    .  
    The association between participation in a pay-for-performance program and macrovascular complications in patients with type 2 diabetes in Taiwan: a nationwide population-based cohort study.
    Prev Med
    2016
    85
    53
    9
     PubMed
    CrossRef
     PubMed
  57. Liao
    PJ
    ,  
    Lin
    TY
    ,  
    Wang
    TC
    ,  
    Ting
    MK
    ,  
    Wu
    IW
    ,  
    Huang
    HT
    ,  
    et al
    Long-term and interactive effects of pay-for-performance interventions among diabetic nephropathy patients at the early chronic kidney disease stage.
    Medicine (Baltimore)
    2016
    95
    e3282
     PubMed
    CrossRef
     PubMed
  58. Lee
    TT
    ,  
    Cheng
    SH
    ,  
    Chen
    CC
    ,  
    Lai
    MS
    .  
    A pay-for-performance program for diabetes care in Taiwan: a preliminary assessment.
    Am J Manag Care
    2010
    16
    65
    9
     PubMed
     PubMed
  59. Tan
    EC
    ,  
    Pwu
    RF
    ,  
    Chen
    DR
    ,  
    Yang
    MC
    .  
    Is a diabetes pay-for-performance program cost-effective under the National Health Insurance in Taiwan?
    Qual Life Res
    2014
    23
    687
    96
     PubMed
    CrossRef
     PubMed
  60. Lee
    CY
    ,  
    Chi
    MJ
    ,  
    Yang
    SL
    ,  
    Lo
    HY
    ,  
    Cheng
    SH
    .  
    Using financial incentives to improve the care of tuberculosis patients.
    Am J Manag Care
    2015
    21
    e35
    42
     PubMed
     PubMed
  61. Li
    YH
    ,  
    Tsai
    WC
    ,  
    Khan
    M
    ,  
    Yang
    WT
    ,  
    Lee
    TF
    ,  
    Wu
    YC
    ,  
    et al
    The effects of pay-for-performance on tuberculosis treatment in Taiwan.
    Health Policy Plan
    2010
    25
    334
    41
     PubMed
    CrossRef
     PubMed
  62. Chen
    HJ
    ,  
    Huang
    N
    ,  
    Chen
    LS
    ,  
    Chou
    YJ
    ,  
    Li
    CP
    ,  
    Wu
    CY
    ,  
    et al
    Does pay-for-performance program increase providers adherence to guidelines for managing hepatitis B and hepatitis C virus infection in Taiwan?
    PLoS One
    2016
    11
    e0161002
     PubMed
    CrossRef
     PubMed
  63. Martens
    JD
    ,  
    Werkhoven
    MJ
    ,  
    Severens
    JL
    ,  
    Winkens
    RA
    .  
    Effects of a behaviour independent financial incentive on prescribing behaviour of general practitioners.
    J Eval Clin Pract
    2007
    13
    369
    73
     PubMed
    CrossRef
     PubMed
  64. Pechlivanoglou
    P
    ,  
    Wieringa
    JE
    ,  
    de Jager
    T
    ,  
    Postma
    MJ
    .  
    The effect of financial and educational incentives on rational prescribing. A state-space approach.
    Health Econ
    2015
    24
    439
    53
     PubMed
    CrossRef
     PubMed
  65. Kirschner
    K
    ,  
    Braspenning
    J
    ,  
    Akkermans
    RP
    ,  
    Jacobs
    JE
    ,  
    Grol
    R
    .  
    Assessment of a pay-for-performance program in primary care designed by target users.
    Fam Pract
    2013
    30
    161
    71
     PubMed
    CrossRef
     PubMed
  66. Kiran
    T
    ,  
    Wilton
    AS
    ,  
    Moineddin
    R
    ,  
    Paszat
    L
    ,  
    Glazier
    RH
    .  
    Effect of payment incentives on cancer screening in Ontario primary care.
    Ann Fam Med
    2014
    12
    317
    23
     PubMed
    CrossRef
     PubMed
  67. Li
    J
    ,  
    Hurley
    J
    ,  
    DeCicca
    P
    ,  
    Buckley
    G
    .  
    Physician response to pay-for-performance: evidence from a natural experiment.
    Health Econ
    2014
    23
    962
    78
     PubMed
    CrossRef
     PubMed
  68. Michel-Lepage
    A
    ,  
    Ventelou
    B
    .  
    The true impact of the French pay-for-performance program on physicians' benzodiazepines prescription behavior.
    Eur J Health Econ
    2016
    17
    723
    32
     PubMed
    CrossRef
     PubMed
  69. Sicsic
    J
    ,  
    Franc
    C
    .  
    Impact assessment of a pay-for-performance program on breast cancer screening in France using micro data.
    Eur J Health Econ.
    2016
     PubMed
  70. Rat
    C
    ,  
    Penhouet
    G
    ,  
    Gaultier
    A
    ,  
    Chaslerie
    A
    ,  
    Pivette
    J
    ,  
    Nguyen
    JM
    ,  
    et al
    Did the new French pay-for-performance system modify benzodiazepine prescribing practices?
    BMC Health Serv Res
    2014
    14
    301
     PubMed
    CrossRef
     PubMed
  71. Greene
    J
    .  
    An examination of pay-for-performance in general practice in Australia.
    Health Serv Res
    2013
    48
    1415
    32
     PubMed
    CrossRef
     PubMed
  72. Zuckerman
    RB
    ,  
    Sheingold
    SH
    ,  
    Orav
    EJ
    ,  
    Ruhter
    J
    ,  
    Epstein
    AM
    .  
    Readmissions, observation, and the Hospital Readmissions Reduction Program.
    N Engl J Med
    2016
    374
    1543
    51
     PubMed
    CrossRef
     PubMed
  73. Figueroa
    JF
    ,  
    Tsugawa
    Y
    ,  
    Zheng
    J
    ,  
    Orav
    EJ
    ,  
    Jha
    AK
    .  
    Association between the Value-Based Purchasing pay for performance program and patient mortality in US hospitals: observational study.
    BMJ
    2016
    353
    i2214
     PubMed
    CrossRef
     PubMed
  74. Ryan
    AM
    ,  
    Burgess
    JF
    Jr
    ,  
    Pesko
    MF
    ,  
    Borden
    WB
    ,  
    Dimick
    JB
    .  
    The early effects of Medicare's mandatory hospital pay-for-performance program.
    Health Serv Res
    2015
    50
    81
    97
     PubMed
    CrossRef
     PubMed
  75. Ryan
    A
    ,  
    Sutton
    M
    ,  
    Doran
    T
    .  
    Does winning a pay-for-performance bonus improve subsequent quality performance? Evidence from the Hospital Quality Incentive Demonstration.
    Health Serv Res
    2014
    49
    568
    87
     PubMed
    CrossRef
     PubMed
  76. Andriole
    KP
    ,  
    Prevedello
    LM
    ,  
    Dufault
    A
    ,  
    Pezeshk
    P
    ,  
    Bransfield
    R
    ,  
    Hanson
    R
    ,  
    et al
    Augmenting the impact of technology adoption with financial incentive to improve radiology report signature times.
    J Am Coll Radiol
    2010
    7
    198
    204
     PubMed
    CrossRef
     PubMed
  77. Benzer
    JK
    ,  
    Young
    GJ
    ,  
    Burgess
    JF
    Jr
    ,  
    Baker
    E
    ,  
    Mohr
    DC
    ,  
    Charns
    MP
    ,  
    et al
    Sustainability of quality improvement following removal of pay-for-performance incentives.
    J Gen Intern Med
    2014
    29
    127
    32
     PubMed
    CrossRef
     PubMed
  78. Kristensen
    SR
    ,  
    Meacock
    R
    ,  
    Turner
    AJ
    ,  
    Boaden
    R
    ,  
    McDonald
    R
    ,  
    Roland
    M
    ,  
    et al
    Long-term effect of hospital pay for performance on mortality in England.
    N Engl J Med
    2014
    371
    540
    8
     PubMed
    CrossRef
     PubMed
  79. Kuo
    RN
    ,  
    Chung
    KP
    ,  
    Lai
    MS
    .  
    Effect of the Pay-for-Performance Program for Breast Cancer Care in Taiwan.
    J Oncol Pract
    2011
    7
    e8s
    e15s
     PubMed
    CrossRef
     PubMed
  80. Vermeulen
    MJ
    ,  
    Stukel
    TA
    ,  
    Boozary
    AS
    ,  
    Guttmann
    A
    ,  
    Schull
    MJ
    .  
    The effect of pay for performance in the emergency department on patient waiting times and quality of care in Ontario, Canada: a difference-in-differences analysis.
    Ann Emerg Med
    2016
    67
    496
    505
     PubMed
    CrossRef
     PubMed
  81. Colais
    P
    ,  
    Pinnarelli
    L
    ,  
    Fusco
    D
    ,  
    Davoli
    M
    ,  
    Braga
    M
    ,  
    Perucci
    CA
    .  
    The impact of a pay-for-performance system on timing to hip fracture surgery: experience from the Lazio Region (Italy).
    BMC Health Serv Res
    2013
    13
    393
     PubMed
    CrossRef
     PubMed
  82. Yang
    JH
    ,  
    Kim
    SM
    ,  
    Han
    SJ
    ,  
    Knaak
    M
    ,  
    Yang
    GH
    ,  
    Lee
    KD
    ,  
    et al
    The impact of Value Incentive Program (VIP) on the quality of hospital care for acute stroke in Korea.
    Int J Qual Health Care
    2016
    28
    580
    585
     PubMed
    CrossRef
     PubMed
  83. Gillam
    S
    ,  
    Steel
    N
    .  
    The Quality and Outcomes Framework—where next?
    BMJ
    2013
    346
    f659
     PubMed
    CrossRef
     PubMed
  84. Tricco
    AC
    ,  
    Ivers
    NM
    ,  
    Grimshaw
    JM
    ,  
    Moher
    D
    ,  
    Turner
    L
    ,  
    Galipeau
    J
    ,  
    et al
    Effectiveness of quality improvement strategies on the management of diabetes: a systematic review and meta-analysis.
    Lancet
    2012
    379
    2252
    61
     PubMed
    CrossRef
     PubMed
  85. Emanuel
    EJ
    ,  
    Ubel
    PA
    ,  
    Kessler
    JB
    ,  
    Meyer
    G
    ,  
    Muller
    RW
    ,  
    Navathe
    AS
    ,  
    et al
    Using behavioral economics to design physician incentives that deliver high-value care.
    Ann Intern Med
    2016
    164
    114
    9
    CrossRef
     PubMed
  86. Kondo
    KK
    ,  
    Damberg
    CL
    ,  
    Mendelson
    A
    ,  
    Motúapuaka
    M
    ,  
    Freeman
    M
    ,  
    O'Neil
    M
    ,  
    et al
    Implementation processes and pay for performance in healthcare: a systematic review.
    J Gen Intern Med
    2016
    31 Suppl 1
    61
    9
     PubMed
    CrossRef
     PubMed
  87. Kondo
    K
    ,  
    Damberg
    C
    ,  
    Mendelson
    A
    ,  
    Motúapuaka
    M
    ,  
    Freeman
    M
    ,  
    O'Neil
    M
    ,  
    et al
    Understanding the Intervention and Implementation Factors Associated with Benefits and Harms of Pay for Performance Programs in Healthcare.
    Washington, DC
    U.S. Department of Veterans Affairs
    2015
  88. Kansagara
    D
    ,  
    Tuepker
    A
    ,  
    Joos
    S
    ,  
    Nicolaidis
    C
    ,  
    Skaperdas
    E
    ,  
    Hickam
    D
    .  
    Getting performance metrics right: a qualitative study of staff experiences implementing and measuring practice transformation.
    J Gen Intern Med
    2014
    29 Suppl 2
    S607
    13
     PubMed
    CrossRef
     PubMed
  89. Powell
    AA
    ,  
    White
    KM
    ,  
    Partin
    MR
    ,  
    Halek
    K
    ,  
    Christianson
    JB
    ,  
    Neil
    B
    ,  
    et al
    Unintended consequences of implementing a national performance measurement system into local practice.
    J Gen Intern Med
    2012
    27
    405
    12
     PubMed
    CrossRef
     PubMed
  90. Kizer
    KW
    ,  
    Kirsh
    SR
    .  
    The double edged sword of performance measurement [Editorial].
    J Gen Intern Med
    2012
    27
    395
    7
     PubMed
    CrossRef
     PubMed
  91. Casalino
    LP
    ,  
    Gans
    D
    ,  
    Weber
    R
    ,  
    Cea
    M
    ,  
    Tuchovsky
    A
    ,  
    Bishop
    TF
    ,  
    et al
    US physician practices spend more than $15.4 billion annually to report quality measures.
    Health Aff (Millwood)
    2016
    35
    401
    6
     PubMed
    CrossRef
     PubMed
  92. Roland
    M
    ,  
    Campbell
    S
    .  
    Successes and failures of pay for performance in the United Kingdom.
    N Engl J Med
    2014
    370
    1944
    9
     PubMed
    CrossRef
     PubMed
  93. Parast
    L
    ,  
    Doyle
    B
    ,  
    Damberg
    CL
    ,  
    Shetty
    K
    ,  
    Ganz
    DA
    ,  
    Wenger
    NS
    ,  
    et al
    Challenges in assessing the process-outcome link in practice.
    J Gen Intern Med
    2015
    30
    359
    64
     PubMed
    CrossRef
     PubMed
  94. Findlay
    S
    ,  
    Berenson
    R
    ,  
    Lott
    R
    ,  
    Gnadinger
    T
    .  
    Health Policy Brief: Medicare's New Physician Payment System. A 2015 law has the potential to transform how Medicare pays physicians.
    Health Affairs.
    21 April 2016
Figure.

Literature flow diagram.

P4P = pay-for-performance.

* The current systematic review updates and expands on the review by Damberg and colleagues (7).

Table 1. Findings From Studies of Ambulatory-Based Pay-for-Performance Programs

Table 1. Findings From Studies of Ambulatory-Based Pay-for-Performance Programs

Table 2. Findings From Studies of Hospital-Based Pay-for-Performance Programs

Table 2. Findings From Studies of Hospital-Based Pay-for-Performance Programs

Table 3. Strength of the Evidence

Table 3. Strength of the Evidence
PDF Supplemental Content
Supplement. Supplemental Materials

Clinical Slide Sets

Terms of Use

The In the Clinic® slide sets are owned and copyrighted by the American College of Physicians (ACP). All text, graphics, trademarks, and other intellectual property incorporated into the slide sets remain the sole and exclusive property of the ACP. The slide sets may be used only by the person who downloads or purchases them and only for the purpose of presenting them during not-for-profit educational activities. Users may incorporate the entire slide set or selected individual slides into their own teaching presentations but may not alter the content of the slides in any way or remove the ACP copyright notice. Users may make print copies for use as hand-outs for the audience the user is personally addressing but may not otherwise reproduce or distribute the slides by any means or media, including but not limited to sending them as e-mail attachments, posting them on Internet or Intranet sites, publishing them in meeting proceedings, or making them available for sale or distribution in any unauthorized form, without the express written permission of the ACP. Unauthorized use of the In the Clinic slide sets will constitute copyright infringement.

This feature is available only to Registered Users

Subscribe/Learn More
Submit a Comment

0 Comments

PDF
Not Available
Citations
Citation

Mendelson A, Kondo K, Damberg C, et al. The Effects of Pay-for-Performance Programs on Health, Health Care Use, and Processes of Care: A Systematic Review. Ann Intern Med. 2017;166:341–353. [Epub ahead of print 10 January 2017]. doi: https://doi.org/10.7326/M16-1881

Download citation file:

  • Ris (Zotero)
  • EndNote
  • BibTex
  • Medlars
  • ProCite
  • RefWorks
  • Reference Manager

© 2019

×
Permissions

Published: Ann Intern Med. 2017;166(5):341-353.

DOI: 10.7326/M16-1881

Published at www.annals.org on 10 January 2017

73 Citations

See Also

With the Merit-Based Incentive Payment System, Pay for Performance Is Now National Policy
View MoreView Less

Related Articles

Raising the Bar in Attribution
Annals of Internal Medicine; 167 (6): 434-435
The Value-Based Payment Modifier: Program Outcomes and Implications for Disparities
Annals of Internal Medicine; 168 (4): 255-265
With the Merit-Based Incentive Payment System, Pay for Performance Is Now National Policy
Annals of Internal Medicine; 166 (5): 368-369
Collateral Damage: Pay-for-Performance Initiatives and Safety-Net Hospitals
Annals of Internal Medicine; 163 (6): 473-474
View MoreView Less

Journal Club

In patients with chest pain, HEART score–guided and usual care did not differ for MACE or health care use
Annals of Internal Medicine; 167 (4): JC22
Preoperative geriatric assessment reduced hospital stay and complications in patients ≥ 65 y having vascular surgery
Annals of Internal Medicine; 167 (2): JC2
Review: Interventions improve hospital antibiotic prescribing and reduce hospital stay but do not affect mortality
Annals of Internal Medicine; 166 (10): JC59
In older hospitalized patients, adding transitional care to in-hospital geriatric assessment did not improve ADL
Annals of Internal Medicine; 164 (12): JC63
View MoreView Less

Related Point of Care

Community-Acquired Pneumonia
Annals of Internal Medicine; 151 (7): ITC4-1
Community-Acquired Pneumonia
Annals of Internal Medicine; 163 (7): ITC1
View MoreView Less

Related Topics

Healthcare Delivery and Policy
High Value Care
Hospital Medicine

Healthcare Delivery and Policy, High Value Care, Hospital Medicine.

PubMed Articles

Individual participant data meta-analysis of intervention studies with time-to-event outcomes: A review of the methodology and an applied example.
Res Synth Methods 2019.
rhBMP2 alone does not induce macrophage polarization towards an increased inflammatory response.
Mol Immunol 2019;
View More

Results provided by: PubMed

CME/MOC Activity Requires Users to be Registered and Logged In.
Sign in below to access your subscription for full content
INDIVIDUAL SIGN IN
Sign In|Set Up Account
You will be directed to acponline.org to register and create your Annals account
Annals of Internal Medicine
CREATE YOUR FREE ACCOUNT
Create Your Free Account|Why?
To receive access to the full text of freely available articles, alerts, and more. You will be directed to acponline.org to complete your registration.
×
The Comments Feature Requires Users to be Registered and Logged In.
Sign in below to access your subscription for full content
INDIVIDUAL SIGN IN
Sign In|Set Up Account
You will be directed to acponline.org to register and create your Annals account
Annals of Internal Medicine
CREATE YOUR FREE ACCOUNT
Create Your Free Account|Why?
To receive access to the full text of freely available articles, alerts, and more. You will be directed to acponline.org to complete your registration.
×
link to top

Content

  • Home
  • Latest
  • Issues
  • Channels
  • CME/MOC
  • In the Clinic
  • Journal Club
  • Web Exclusives

Information For

  • Author Info
  • Reviewers
  • Press
  • Readers
  • Institutions / Libraries / Agencies
  • Advertisers

Services

  • Subscribe
  • Renew
  • Alerts
  • Current Issue RSS
  • Latest RSS
  • In the Clinic RSS
  • Reprints & Permissions
  • Contact Us
  • Help
  • About Annals
  • About Mobile
  • Patient Information
  • Teaching Tools
  • Annals in the News
  • Share Your Feedback

Awards and Cover

  • Personae (Cover Photo)
  • Junior Investigator Awards
  • Poetry Prize

Other Resources

  • ACP Online
  • Career Connection
  • ACP Advocate Blog
  • ACP Journal Wise

Follow Annals On

  • Twitter Link
  • Facebook Link
acp link acp
silverchair link silverchair

Copyright © 2019 American College of Physicians. All Rights Reserved.

Print ISSN: 0003-4819 | Online ISSN: 1539-3704

Privacy Policy

|

Conditions of Use

This site uses cookies. By continuing to use our website, you are agreeing to our privacy policy. | Accept
×

You need a subscription to this content to use this feature.

×
PDF Downloads Require Access to the Full Article.
Sign in below to access your subscription for full content
INDIVIDUAL SIGN IN
Sign In|Set Up Account
You will be directed to acponline.org to register and create your Annals account
INSTITUTIONAL SIGN IN
Open Athens|Shibboleth|Log In
Annals of Internal Medicine
PURCHASE OPTIONS
Buy This Article|Subscribe
You will be redirected to acponline.org to sign-in to Annals to complete your purchase.
CREATE YOUR FREE ACCOUNT
Create Your Free Account|Why?
To receive access to the full text of freely available articles, alerts, and more. You will be directed to acponline.org to complete your registration.
×
Access to this Free Content Requires Users to be Registered and Logged In. Please Choose One of the Following Options
Sign in below to access your subscription for full content
INDIVIDUAL SIGN IN
Sign In|Set Up Account
You will be directed to acponline.org to register and create your Annals account
Annals of Internal Medicine
CREATE YOUR FREE ACCOUNT
Create Your Free Account|Why?
To receive access to the full text of freely available articles, alerts, and more. You will be directed to acponline.org to complete your registration.
×