0
Clinical Guidelines |

Comparative Benefits and Harms of Second-Generation Antidepressants: Background Paper for the American College of Physicians FREE

Gerald Gartlehner, MD, MPH; Bradley N. Gaynes, MD, MPH; Richard A. Hansen, PhD, RPh; Patricia Thieda, MA; Angela DeVeaugh-Geiss, MS; Erin E. Krebs, MD, MPH; Charity G. Moore, PhD, MSPH; Laura Morgan, MA; and Kathleen N. Lohr, PhD
[+] Article and Author Information

From Danube University, Krems, Austria; University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; Indiana University School of Medicine, Roudebush Veterans Affairs Medical Center, and Regenstrief Institute, Indianapolis, Indiana; University of Pittsburgh, Pittsburgh, Pennsylvania; and RTI International, Research Triangle Park, North Carolina.


Disclaimer: The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Healthcare Research and Quality or the U.S. Department of Health and Human Services of a particular drug, device, test, treatment, or other clinical service.

Acknowledgment: The authors thank Timothy S. Carey, MD, MPH, and Stacey Williams, MA, from the University of North Carolina at Chapel Hill, and also Linda Lux, MPA, and Loraine Monroe of RTI International.

Grant Support: By a contract from the Agency for Healthcare Research and Quality to the RTI International–University of North Carolina Evidence-based Practice Center (contract no. 290-02-0016).

Potential Financial Conflicts of Interest:Employment: A. DeVeaugh-Geiss (GlaxoSmithKline). Consultancies: B.N. Gaynes (Pfizer, Wyeth-Ayerst, Shire Pharmaceutical). Honoraria: B.N. Gaynes (GlaxoSmithKline). Stock ownership or options (other than mutual funds): A. DeVeaugh-Geiss (GlaxoSmithKline). Expert testimony: B.N. Gaynes (Phillips Lytle). Grants received: B.N. Gaynes (Agency for Healthcare Research and Quality, National Institute of Mental Health, Bristol-Myers Squibb, Novartis, Pfizer, Robert Wood Johnson Foundation, M-3 Corporation), R.A. Hansen (GlaxoSmithKline). Grants pending: B.N. Gaynes (National Institute of Mental Health, Agency for Healthcare Research and Quality).

Requests for Single Reprints: Gerald Gartlehner, MD, MPH, Danube University, Karl Dorrek-Straße, 3500 Krems, Austria; e-mail, gerald.gartlehner@donau-uni.ac.at.

Current Author Addresses: Dr. Gartlehner: Danube University, Karl Dorrek-Straße, 3500 Krems, Austria.

Dr. Gaynes: Department of Psychiatry, Campus Box 7160, University of North Carolina, Chapel Hill, NC 27599.

Dr. Hansen: University of North Carolina, School of Pharmacy, Campus Box 7360, Chapel Hill, NC 27599.

Ms. Thieda and Ms. Morgan: University of North Carolina, Sheps Center for Health Services Research, 725 Martin Luther King Jr. Boulevard, Chapel Hill, NC 27599.

Ms. DeVeaugh-Geiss: University of North Carolina, Department of Epidemiology, Campus Box 7435, Chapel Hill, NC 27599.

Dr. Krebs: Roudebush Veterans Affairs Medical Center, 1481 West 10th Street, Indianapolis, IN 46202.

Dr. Moore: Center for Research on Health Care Data, University of Pittsburgh, 200 Meyran Avenue, Suite 300, Pittsburgh, PA 15213.

Dr. Lohr: RTI International, PO Box 12194, 3040 Cornwallis Road, Research Triangle Park, NC 27709-2194.


Ann Intern Med. 2008;149(10):734-750. doi:10.7326/0003-4819-149-10-200811180-00008
Text Size: A A A

Background: Second-generation antidepressants dominate the management of major depressive disorder, dysthymia, and subsyndromal depression. Evidence on the comparative benefits and harms is still accruing.

Purpose: To compare the benefits and harms of second-generation antidepressants (bupropion, citalopram, duloxetine, escitalopram, fluoxetine, fluvoxamine, mirtazapine, nefazodone, paroxetine, sertraline, trazodone, and venlafaxine) for the treatment of depressive disorders in adults.

Data Sources: MEDLINE, EMBASE, PsychLit, Cochrane Central Register of Controlled Trials, and International Pharmaceutical Abstracts from 1980 to April 2007, limited to English-language articles. Reference lists of pertinent review articles were manually searched and the Center for Drug Evaluation and Research database was explored to identify unpublished research.

Study Selection: Abstracts and full-text articles were independently reviewed by 2 persons. Six previous good- or fair-quality systematic reviews or meta-analyses were included, as were 155 good- or fair-quality double-blind, placebo-controlled, or head-to-head randomized, controlled trials of at least 6 weeks' duration. For harms, 35 observational studies with at least 100 participants and follow-up of at least 12 weeks were also included.

Data Extraction: Using a standard protocol, investigators abstracted data on study design and quality-related details, funding, settings, patients, and outcomes.

Data Synthesis: If data were sufficient, meta-analyses of head-to-head trials were conducted to determine the relative benefit of response to treatment and the weighted mean differences on specific depression rating scales. If sufficient evidence was not available, adjusted indirect comparisons were conducted by using meta-regressions and network meta-analyses. Second-generation antidepressants did not substantially differ in efficacy or effectiveness for the treatment of major depressive disorder on the basis of 203 studies; however, the incidence of specific adverse events and the onset of action differed. The evidence is insufficient to draw conclusions about the comparative efficacy, effectiveness, or harms of these agents for the treatment of dysthymia and subsyndromal depression.

Limitation: Adjusted indirect comparisons have methodological limitations and cannot conclusively rule out differences in efficacy.

Conclusion: Current evidence does not warrant the choice of one second-generation antidepressant over another on the basis of differences in efficacy and effectiveness. Other differences with respect to onset of action and adverse events may be relevant for the choice of a medication.

Major depressive disorder (MDD) is the most prevalent axis I disorder, affecting more than 16% of U.S. adults during their lifetime (1). In 2000, the economic burden of depressive disorders was an estimated $83.1 billion (2), more than 30% of which was attributable to direct medical expenses.

Pharmacotherapy dominates the medical management of MDD. Since the mid-1980s, second-generation antidepressants have gradually replaced tricyclic antidepressants and monoamine oxidase inhibitors as first-line medications, primarily because of their lower toxicity in overdose and similar general efficacy (3). These newer treatments include selective serotonin reuptake inhibitors, serotonin and norepinephrine reuptake inhibitors, selective serotonin and norepinephrine reuptake inhibitors, and other second-generation drugs (Table 1).

Table Jump PlaceholderTable 1.  Second-Generation Antidepressants Approved for Use in the United States

To date, only 2 systematic reviews have assessed the comparative efficacy and harms of second-generation antidepressants (3 - 4). These studies reported no substantial differences in efficacy or harms among agents. However, because of a lack of direct head-to-head comparisons, assessments in both studies were primarily qualitative. Consequently, uncertainties persist about the differences among the drugs for which sufficient head-to-head evidence is lacking.

We systematically assessed evidence on the comparative benefits and harms of second-generation antidepressants for the acute, continuation, and maintenance phases of treatment of MDD; subsyndromal depression; and dysthymia and the comparative efficacy and effectiveness for such accompanying symptoms as anxiety, insomnia, or neurovegetative symptoms. We also sought to determine whether efficacy, effectiveness, and harms differed among subgroups of patients on the basis of age, sex, race or ethnicity, or comorbid conditions.

To our knowledge, this is the first meta-analysis of second-generation antidepressants to assess quantitatively all possible comparisons among drugs in this class. We update findings of an earlier report on these pharmaceuticals (5) for the Agency for Healthcare Research and Quality.

An open process (described at http://www.effectivehealthcare.ahrq.gov) involving the public, the Agency for Healthcare Research and Quality's Scientific Resource Center for Effective Health Care program, and various stakeholder groups produced key questions. We followed a standardized protocol for all review steps (5).

Data Sources

We searched MEDLINE, EMBASE, PsychLit, Cochrane Central Register of Controlled Trials, and International Pharmaceutical Abstracts from 1980 to April 2007. We used Medical Subject Heading terms when available and keywords when appropriate. We combined terms for depressive disorders with a list of 12 specific second-generation antidepressants—bupropion, citalopram, duloxetine, escitalopram, fluoxetine, fluvoxamine, mirtazapine, nefazodone, paroxetine, sertraline, trazodone, and venlafaxine—and their specific trade names. We limited electronic searches to “adult 19 + years,” “human,” and “English language.”

We manually searched reference lists of pertinent review articles and letters to the editor and used the Center for Drug Evaluation and Research database (up to April 2007) to identify unpublished research submitted to the U.S. Food and Drug Administration. The Scientific Resource Center invited pharmaceutical manufacturers to submit dossiers on completed research for each drug. We received dossiers from 3 pharmaceutical companies (Eli Lilly and Company, Indianapolis, Indiana; GlaxoSmithKline, Philadelphia, Pennsylvania; and Wyeth, Madison, New Jersey).

Study Selection

Two persons independently reviewed abstracts and relevant full-text articles. To assess efficacy or effectiveness regarding response, speed of onset, remission, maintenance of remission, and quality of life, we included head-to-head controlled trials of at least 6 weeks' duration that compared 1 drug with another. Because head-to-head evidence was lacking for many comparisons, we included placebo-controlled trials for indirect comparison models. To assess harms (specific adverse events, rates of adverse events, and discontinuations attributable to adverse events), we also examined data from observational studies with at least 100 participants and follow-up of at least 12 weeks. To assess differences of benefits and harms in subgroups and patients with accompanying symptoms, we reviewed both head-to-head and placebo-controlled trials. We included meta-analyses if we found them to be relevant for a key question and of good or fair methodological quality (6).

If both reviewers agreed that a study did not meet eligibility criteria, we excluded it. We also excluded studies that met eligibility criteria but were reported only as an abstract. Investigators resolved disagreements about inclusion or exclusion by consensus or by involving a third reviewer.

Data Extraction and Quality Assessment

We used a structured, Web-based data abstraction form (SRS 4.0, TrialStat, Ottawa, Ontario, Canada) onto which trained reviewers abstracted data from each study and assigned an initial quality rating. A senior reviewer read each abstracted article, evaluated completeness of data abstraction, and confirmed the quality rating. Investigators resolved disagreements by discussion and consensus or by consulting an independent party.

We assessed the internal validity (quality) of trials on the basis of predefined criteria and applied ratings of good, fair, or poor (5,7 - 8). Primary elements of quality assessment included randomization and allocation concealment, similarity of compared groups at baseline, blinding, use of intention-to-treat analysis, and overall and differential loss to follow-up. To assess observational studies, we used criteria involving selection of case patients or cohorts and control participants, adjustment for confounders, methods of outcomes assessment, length of follow-up, and statistical analysis (9). We rated studies with a fatal flaw in 1 or more categories as poor quality (Appendix Table 1) and did not include them in our analyses for this review unless no other head-to-head evidence was available. To identify effectiveness studies, we used a tool that distinguishes efficacy trials from effectiveness studies on the basis of certain elements of study design (10). Such studies have greater generalizability of results than efficacy trials because they enroll less selected study populations, use treatment modalities that mimic clinical practice, and assess health outcomes along with adverse events.

Lacking clear definitions about the equivalence of dosages among second-generation antidepressants in the published literature, we developed a roster of low, medium, and high dosages for each drug based on the interquartile dosing range (5). We used this roster, which does not indicate dosing equivalence, to detect gross inequalities in dosing that could affect comparative efficacy and effectiveness.

Table Jump PlaceholderAppendix Table 1.  Characteristics of Studies with Poor Internal Validity
Data Synthesis

If data were sufficient, we conducted meta-analyses of head-to-head comparisons. Efficacy outcomes included the relative benefit of achieving response (more than 50% improvement from baseline), which reflects the ratio of benefits in one treatment group to benefits in another, and the weighted mean difference of changes on the Hamilton Depression Rating Scale or the Montgomery-Asberg Depression Rating Scale.

For each meta-analysis, we conducted a test of heterogeneity (I2 index) and applied both random- and fixed-effects models. We report the random-effects results because the results from both models were very similar in all meta-analyses. We assessed publication bias by using funnel plots and the Begg adjusted rank correlation test (11) based on the Kendall τ coefficient.

Because no head-to-head evidence was available for the majority of drug comparisons, we conducted adjusted indirect comparisons (5). We employed meta-regressions of placebo-controlled trials by using individual drugs as covariates. When the number of trials was insufficient for meta-regressions, we used modified network meta-analysis (12). Evidence suggests that indirect comparisons agree with head-to-head trials if component studies are similar and treatment effects are expected to be consistent in patients included in different trials (13), although these assumptions are usually not verifiable.

All statistical analyses used StatsDirect Statistical Software program, version 2.3.8 (StatsDirect, Sale, United Kingdom); Stata, version 9.1 (StataCorp, College Station, Texas); and SAS, version 9.1 (SAS Institute, Cary, North Carolina).

Rating the Strength of Evidence

We rated the strength of the available evidence for specific key questions and outcomes in a 3-part hierarchy (high, moderate, and low) (5) by using a modified GRADE (Grading of Recommendations, Assessment, Development, and Evaluation) approach (14 - 15) that incorporates 4 key elements: study design, study quality, consistency of results, and directness (availability of data on outcomes or populations of interest).

Role of Funding Source

The Agency for Healthcare Research and Quality participated in formulating the key questions and reviewed and commented on planned methods and data analysis. The Agency had no role in study selection, quality ratings, or interpretation and synthesis of the evidence, although staff reviewed interim and final evidence reports and distributed them for external peer review by outside experts.

We identified 2318 citations from searches and reviews of reference lists (Figure 1). Of the 203 included studies (Appendix Tables 2 to 11), 140 (69.0%) were financially supported by pharmaceutical companies and 19 (9.3%) by governmental agencies or independent funds. For 44 (21.7%) studies, we could not determine the funding source.

Grahic Jump Location
Figure 1.
Study flow diagram.

The number of included articles differs from the number of included studies because some studies have multiple publications.

Grahic Jump Location
Table Jump PlaceholderAppendix Table 2.  Comparative Efficacy and Effectiveness Studies on Therapy for Major Depressive Disorder
Table Jump PlaceholderAppendix Table 3.  Comparative Efficacy and Effectiveness Studies on Therapy for Dysthymia
Table Jump PlaceholderAppendix Table 4.  Comparative Efficacy and Effectiveness Studies on Therapy for Subsyndromal Depressive Disorders
Table Jump PlaceholderAppendix Table 5.  Comparative Efficacy and Effectiveness Studies on Maintaining Remission and Preventing Relapse
Table Jump PlaceholderAppendix Table 6.  Comparative Efficacy and Effectiveness Studies on Therapy for Recurrent and Treatment-Resistant Depression
Table Jump PlaceholderAppendix Table 7.  Placebo-Controlled Studies of Relapse and Recurrence
Table Jump PlaceholderAppendix Table 8.  Comparative Efficacy and Effectiveness Studies of Treatment in Adults with Major Depressive Disorder and Accompanying Symptoms
Table Jump PlaceholderAppendix Table 9.  Studies of Comparative Risk for Harms in Adults with Major Depressive Disorder
Table Jump PlaceholderAppendix Table 10.  Comparative Efficacy and Effectiveness Studies in Subgroups
Table Jump PlaceholderAppendix Table 11.  Randomized, Placebo-Controlled Trials Included for Indirect Comparisons
Major Depressive Disorder

Overall, we found no substantial differences in comparative efficacy and effectiveness of second-generation antidepressants for treatment of MDD (Tables 2, 3, and 4 and Figures 2, 3, and 4). This finding pertains to the acute, continuation, and maintenance phases of treatment; to patients with accompanying symptom clusters; and to subgroups defined by age, race or ethnicity, sex, or comorbid conditions (we found only sparse evidence for subgroups). Nevertheless, second-generation antidepressants are not identical drugs. They differ somewhat with respect to onset of action and frequency of some adverse events. Generally, effectiveness studies with less stringent eligibility criteria provided results similar to those of efficacy trials, indicating good generalizability of our findings to primary care populations.

Table Jump PlaceholderTable 2.  Summary of Findings on General Effectiveness
Table Jump PlaceholderTable 3.  Summary of Findings on Adverse Events: Comparative Risk for Harms
Table Jump PlaceholderTable 4.  Summary of Findings on Effectiveness in Subgroups