Internet-Delivered Interpersonal Psychotherapy Versus Internet-Delivered Cognitive Behavioral Therapy for Adults With Depressive Symptoms: Randomized Controlled Noninferiority Trial
Background: Face-to-face cognitive behavioral therapy (CBT) and interpersonal psychotherapy (IPT) are both effective treatments for depressive disorders, but access is limited. Online CBT interventions have demonstrated efficacy in decreasing depressive symptoms and can facilitate the dissemination of therapies among the public. However, the efficacy of Internet-delivered IPT is as yet unknown.
Objective: This study examines whether IPT is effective, noninferior to, and as feasible as CBT when delivered online to spontaneous visitors of an online therapy website.
Methods: An automated, 3-arm, fully self-guided, online noninferiority trial compared 2 new treatments (IPT: n=620; CBT: n=610) to an active control treatment (MoodGYM: n=613) over a 4-week period in the general population. Outcomes were assessed using online self-report questionnaires, the Center for Epidemiological Studies Depression scale (CES-D) and the Client Satisfaction Questionnaire (CSQ-8) completed immediately following treatment (posttest) and at 6-month follow-up.
Results: Completers analyses showed a significant reduction in depressive symptoms at posttest and follow-up for both CBT and IPT, and were noninferior to MoodGYM. Within-group effect sizes were medium to large for all groups. There were no differences in clinical significant change between the programs. Reliable change was shown at posttest and follow-up for all programs, with consistently higher rates for CBT. Participants allocated to IPT showed significantly lower treatment satisfaction compared to CBT and MoodGYM. There was a dropout rate of 1294/1843 (70%) at posttest, highest for MoodGYM. Intention-to-treat analyses confirmed these findings.
Conclusions: Despite a high dropout rate and lower satisfaction scores, this study suggests that Internet-delivered self-guided IPT is effective in reducing depressive symptoms, and may be noninferior to MoodGYM. The completion rates of IPT and CBT were higher than MoodGYM, indicating some progress in refining Internet-based self-help. Internet-delivered treatment options available for people suffering from depression now include IPT.
Trial Registration: International Standard Randomized Controlled Trial Number (ISRCTN): 69603913; http://www.controlled-trials.com/ISRCTN69603913 (Archived by WebCite at http://www.webcitation.org/6FjMhmE1o)
J Med Internet Res 2013;15(5):e82)
- interpersonal relations;
- cognitive behavior therapy;
- depressive disorder;
- randomized controlled trial
Depression is a highly prevalent mental disorder  and it is expected to rank as the leading cause of burden of disease in high-income countries by 2030 [ ]. Depression is associated with serious disability [ ], loss in quality of life [ ], and substantial economic costs both at an individual and a societal level [ , ]. Both pharmacological and psychological treatments for depressive disorders are effective in reducing symptoms [ ]. Clinical practice guidelines recommend cognitive behavior therapy (CBT) and interpersonal psychotherapy (IPT) as options for psychological treatment [ - ]. CBT is based on the cognitive theory that negative automatic thoughts, maladaptive information processing, and avoidance behavior play a key role in the development and maintenance of depression [ ]. IPT originates from interpersonal theory [ ]. It links stressful life events and insufficient social support to the development and maintenance of depressive symptoms [ ]. Both psychotherapies are brief, highly structured, and can be manualized. CBT and IPT have shown to be effective in reducing depression symptoms compared to treatment as usual [ , , ]. Meta-analyses show that, when compared head-to-head, CBT and IPT do not differ significantly from one another in their effectiveness [ , , ].
Both CBT and IPT require significant therapist time. Long waiting lists caused by low workforce numbers are common . Perceived social stigma, which hinders help seeking [ ], and high costs [ ] may discourage individuals with a psychiatric disorder from seeking professional help. Internet-based self-help interventions offer potential solutions to these barriers. Immediately accessible and less costly, online interventions may offer a valuable alternative to face-to-face therapy. Previous studies and meta-analyses have demonstrated unguided Internet-based self-help interventions to be effective for common mental disorders, with a pooled effect size of 0.28, but dropout rates are high [ ]. CBT programs have been successfully delivered on the Internet [ , ]. However, to our knowledge, no study has examined the effectiveness of Internet-based IPT.
The present study examined the effectiveness of Internet-delivered IPT and a new Internet-delivered CBT module (from e-couch ) compared to an online CBT intervention (MoodGYM). MoodGYM was originally developed for youth, but has known efficacy in reducing depressive symptoms in adults [ - ]. The trial was designed within a noninferiority framework. Noninferiority trials are used when there is clear evidence of efficacy for an existing standard treatment, such that it is ethically unacceptable to employ a placebo or inactive control group [ ] and when a new treatment is hypothesized to have comparable, but not necessarily superior, effectiveness to the established intervention [ ]. We hypothesized that the new Internet-delivered modules of IPT and CBT would be noninferior to a CBT module (MoodGYM) in reducing symptoms of depression and anxiety. We also predicted that the Internet-delivered IPT module would be rated by its users as being as feasible, acceptable, and satisfactory as MoodGYM.
Participants and Procedure
This automated, 3-arm, fully self-guided, online noninferiority trial compared 2 new treatments (IPT and CBT) to an active control treatment (MoodGYM) for depressed individuals. The Internet-delivered CBT and IPT interventions (from e-couch) were developed at the Centre for Mental Health Research (CMHR) at the Australian National University (ANU). The e-couch program targets a range of conditions currently (depression, generalized anxiety disorder, social anxiety disorder) with other conditions to be added in the future. It also provides modules for bereavement and loss, as well as divorce and separation. It comprises a mental health literacy component and psychotherapeutic components for each condition (eg, CBT, IPT, applied relaxation, physical activity, and behavioral activation for depression). This study compared the IPT and CBT components with the 4-module version of MoodGYM. To increase external validity, there was no specific promotion for the trial. Spontaneous visitors from around the world who registered on the e-couch Internet website  between October 2009 and October 2010 and who showed interest in participating in the research trial (by clicking a “I want more information about the trial” button), were given information about the study. Those who provided both informed consent to participating in the trial (by clicking on the “I agree” button on the webpage) and an email address were then asked to complete an online baseline screening survey. Individuals who were 18 years of age or older and not currently receiving treatment for depression by a mental health specialist were eligible for inclusion in the study. Individuals with suicide intention, as measured with a suicidal ideation screening item on the Web Screening Questionnaire (WSQ) [ ], or those who scored above 27 (95th percentile or higher) on the Center for Epidemiological Studies Depression scale (CES-D) at baseline, were immediately provided with an information page containing advice about obtaining appropriate professional help, including emergency help. They could, however, continue to participate in the study. Excluded were individuals who were health professionals treating people with depression or anxiety, researchers reviewing depression or anxiety sites, or students studying anxiety or depression as part of a college or university course. Individuals who did not meet the inclusion criteria were directed to the public version of the e-couch program, which provides interventions for depression, generalized anxiety disorder, and social phobia. Individuals were not required to provide their real names, but were asked to use a pseudonym instead. Ethical approval for the study was provided by the Human Research Ethics Committee of the ANU (protocol number 2008/269).
Participants were randomly assigned to MoodGYM, CBT, or IPT, stratified by sex, age, and presenting depression symptom severity. The randomization schedule for participant allocation to condition groups was prepared by using an automated system built into the trial software, and randomization occurred automatically. The allocation sequence was concealed from the researchers. Participants randomized to the intervention groups were aware of the allocated arm. Following randomization, an automated email containing log-in details for the assigned program was sent to each participant, at which point the intervention could be accessed immediately.
All programs were offered over 4 weeks. Users were required to complete the modules in order. Participants were able to revisit previous pages of the modules and scores of previous assessments, but were not able to repeat the assessments. Each week an automated email was sent to advise participants of the availability of their new module. Participants were always offered the option to pause and restart at their chosen time. See- for screenshots of the 3 programs.
Internet-Delivered Cognitive Behavioral Therapy (CBT e-couch)
The Internet-delivered CBT intervention comprised 1 component of the depression stream of e-couch  and is based on the principles of CBT [ ]. In addition to an explanation of the rationale of CBT, the program consisted of 3 major modules: identifying negative thoughts, tackling negative thoughts, and undertaking behavioral activation (based on activity scheduling developed by Lewinsohn [ ]). The program contained 18 exercises and assessments in total, which were saved in a personal workbook.
Internet-Delivered Interpersonal Psychotherapy (IPT e-couch)
The Internet-delivered form of IPT comprised 1 component of the depression stream of e-couch . It consisted of 4 modules (grief, role disputes, role transition, and interpersonal deficits) and a personal workbook (containing 13 exercises and assessments). The IPT program was based on the IPT clinician manual of Weissman et al [ ], with each of the 4 IPT areas constructed to reflect the areas and topics relevant to each area. Interactive exercises reflected the topics and questions described in the Interpersonal Inventory. Participants did not choose IPT areas, but could decide the order in which they were completed; exercises within each of the IPT areas were not compulsory.
Internet-Delivered CBT (MoodGYM)
The online CBT package comprised a 4-module version of MoodGYM  delivered over 4 weeks. The details of the program are described elsewhere [ , ]. In this trial, a set of 4 of the CBT modules, a personal workbook (containing 22 exercises and assessments), and a feedback evaluation form were used. The modules cover the identification of and behavioral methods to overcome dysfunctional thinking, assertiveness, and self-esteem training. Each module takes approximately 20 to 40 minutes to complete [ ]. The relaxation module was removed from the program for this study to match the time length of the other 2 programs. Previous research has demonstrated that this component is not needed for efficacy [ ].
All questionnaires comprised online standard self-report measures taken at baseline (pretest), immediately after the intervention (posttest), and 6 months after the intervention (follow-up). Measures of participant characteristics were collected at baseline, symptoms measures were administered at all 3 time points, and user satisfaction was collected at posttest.
The survey included questions concerning sociodemographic characteristics (age, gender, country of origin, location, and education level), previous history of depression, previous use of treatments for depression, marital status, preference for randomization condition, perceived need for treatment, and current medication.
Primary Outcome Measures: Depressive Symptoms
Center for Epidemiological Studies Depression Scale
The 20-item self-report CES-D was used to assess depressive symptoms (item score: 0-3; total score range: 0-60) . The Internet CES-D is reliable and valid with a cut-off score of 22 (sensitivity: 0.94; specificity: 0.62) [ ]. The Cronbach alpha in this study was .90. Because the CES-D was administered online, a cut-off score of 22 is used in this study.
Secondary Outcomes: Satisfaction, User-Perceived Benefits, and Adherence
Client Satisfaction Questionnaire
The Client Satisfaction Questionnaire (CSQ-8) assesses global client satisfaction with treatments . The 8-item self-report questionnaire uses scale response options from 1 to 4, with total score ranges from 8 to 32. Previous research has reported that the CSQ-8 has high internal consistency [ ] and was comparable to the Cronbach alpha in this study (Cronbach alpha=.90).
Preference for randomization condition was assessed by asking the question “Do you have a preference to be in one of the programs?” at baseline. Participants replied with no preference or “yes, program 1, 2, 3.” These data were included in the analyses (no preference, preference: match/no match).
Adherence was measured in 2 ways for each individual: (1) completion of posttest surveys (all groups), and (2) the number of IPT, CBT, or MoodGYM modules completed.
Noninferiority trials require the a priori specification of a noninferiority margin. We used the confidence interval (CI) approach  to define the noninferiority margin for this study. The noninferiority margin of the primary outcome measure CES-D was set at a lower-bound 95% CI pre-post within-group effect size of 0.33, which was based on the lower-bound 95% CI margin of the pre-post within-group effect size of 0.56 (95% CI 0.33-0.79) of MoodGYM [ ]. To decrease the risk of type 1 error (false acceptance of an ineffective treatment), the standard deviation (SD) of the pretest was used in calculating the lower-bound 95% CI margin of the effect size of 0.56; this yields a conservative estimate of benefit. For noninferiority trials, the null is E-S≤-delta and the alternative hypotheses (1-sided) is E-S>-delta, where E is the experimental treatment and S is the standard treatment. shows the formula for calculating the t test statistic when testing noninferiority (using the formula of Mascha and Sessler [ ]).
The null hypothesis is rejected to claim noninferiority of the standard control treatment and the new treatment, if T is larger than the value of T from a t distribution with nE–nS–2df at 1–alpha. The P value is the probability of observing a larger value of Tl, if the null hypothesis (ie, inferiority) were true in the population sampled from. For a P value less than alpha, we reject the null hypothesis and conclude noninferiority . As this was a noninferiority trial, this criterion did not apply for the upper bound of the CI. Using 1-beta=.90 and alpha=.05 (2-sided), we needed at least 150 participants in each condition at posttest (a total sample of 450 participants) to reach sufficient statistical power.
Data integrity (distribution, outliers, skewness, and kurtosis) tests were conducted. Measures of skewness and kurtosis indicated deviations from normality for baseline CES-D scores because of some extreme responses. The Box-Cox model omitting the pretest scores was fitted and the transformed data were compared with the raw data using mixed model analyses. For ease of interpretation of the test results, only raw data are reported because the conclusions were the same. The baseline characteristics of the 3 groups were compared by using 1-way analysis of variance (ANOVA), and Kruskal-Wallis and Mann-Whitney U tests for continuous measures and chi-square (χ2) tests for categorical variables.
In noninferiority trials, intention-to-treat (ITT) analysis will often increase the risk of falsely claiming noninferiority (type I error) . Non-ITT analyses are preferred as a protection from ITTs increase of type I error risk [ ]. Therefore, the effects of the interventions were analyzed by using both ITT analyses and per-protocol approaches as recommended for noninferiority trials by Piaggio et al [ ]. There is greater confidence in results when the conclusions are consistent [ ].To conduct the per-protocol analysis, 2 groups were created: those who returned the posttest and follow-up surveys (completers), and those who completed half or more of the treatment modules and returned the surveys (adherent completers). Linear mixed models (LMM) were used for both types of analyses. Restricted maximum likelihood estimation was used with an unstructured covariance structure accommodation with participant effects. The LMM gives unbiased estimates of ITT effect under the assumption that data from participants who withdrew were missing at random (MAR). Test time was treated as a categorical variable because we were interested in the differences between groups on each occasion of measurement.
Between-group and within-group effect sizes were calculated according to Cohen’s d (standardized mean difference) . Clinically meaningful changes on the CES-D were assessed using the clinically significant change (CSC) formula (with a CES-D score <22) and the reliable change index (RCI) [ ]. The RCI reflects the degree of change that occurred beyond the fluctuations of an imprecise measuring instrument, with values greater than 1.96 representing statistically significant change [ ]. In the present study, pretest SD scores of the CES-D with a reliability of 0.90 were used in the RCI formula. In addition, analyses were also undertaken for the subsample of participants who had symptoms severe enough to be considered clinical cases at baseline (score ≥22 on the CES-D). Chi-square tests and 1-way ANOVA tests with Bonferroni correction at posttest were used to examine differences in treatment completers and noncompleters (those who completed less than half of the modules). Statistical analyses were conducted using SPSS version 19.0, except for the Box-Cox transformation procedures, which were conducted using Stata 9.
Of the 10,598 individuals who registered on e-couch during the trial period, 5796 expressed interest in the trial and proceeded to screening. Of these, 3166 did not meet the eligibility criteria (eg, under 18 years of age or currently receiving treatment for depression by a mental health specialist) and were excluded from the study. Of the remaining 2630 participants, 2045 provided informed consent; however, 116 of these did not subsequently verify their email address. Accordingly, a total of 1929 participants were randomized to 1 of the 3 conditions. However, 66 of these participants were excluded after randomization because it became apparent that they were ineligible at baseline for participation (eg, being a researcher or a student, n=21). In addition, 45 randomized participants did not complete the baseline assessment. This was missed at first screening because of a technical fault, but was picked up subsequently.shows the flowchart of participants (CONSORT-EHEALTH checklist [ ] presented in ).
Of the total sample (N=1843), 543 (29.46%) were aged between 25 to 29 years, and most were female, (1334/1843, 72.38%). Participants were primarily Australian or New Zealand residents (751/1843, 40.75%) and most were well educated, having completed postsecondary education (1606/1843, 87.14%). The mean CES-D baseline score was 36 (SD 11.52). There were no significant differences between the groups at baseline with respect to depressive symptoms (χ22= 3.1, P=.21), demographic characteristics (see), or treatment preference before randomization (P=.73).
Treatment Adherence and Attrition
In total, 30% (549/1843) of participants completed the posttest assessment and 28% (336/1843) completed the follow-up assessment. Of participants who were adherent to the program (completed 50% or more of the modules), 25.8% (476/1843) and 16% (294/1843) completed posttest and follow-up assessments, respectively. Of the IPT participants, 49.5% (307/620) completed at least half of the intervention (≥2 modules) and 27.3% (169/620) completed all modules. For the CBT participants, 37.7% (230/610) finished 2 or more modules and 14.4% (88/610) completed all modules. A total of 195 of 613 participants (31.8%) finished half or more of the MoodGYM program. Of these, 10.9% (67/613) finished the whole program. Reasons given for dropout included technical problems, personal issues (lack of time), disease-specific barriers (feeling too depressed to work on the program or not convinced that the program would help), general intervention problems (programs was taking too long, too much text to read, boring, or too repetitive), specific intervention issues (the examples were not relevant to the participant), or engagement issues (preferred to obtain help from somewhere other than a computer). However, most participants (1248/1294, 96%) did not provide any reason for dropout. Those who dropped out of treatment had significantly higher scores on the CES-D (χ21=4.3, P=.04), but differences were small (mean difference 1.26). Furthermore, dropout rates were significantly higher for participants assigned to MoodGYM (451/613, 74%) compared with IPT (414/620, 67%) or CBT (429/610, 70%; χ22= 6.8, P=.03). Those who dropped out of treatment were more likely to be female (914/1294, 71%; χ21= 6.6, P=.01), and under 50 years of age (1090/1294, 84%; χ21=21.6, P<.001). No significant differences were found for medication use (P=.73), treatment preference (P=.14), or marital status (P=.60).
Effectiveness and Noninferiority
Results are presented for 3 groups: all participants (all those enrolled in the trial, ITT), completers (those completing online surveys at posttest and at 6-month follow-up), and adherent completers (those completing ≥50% of the modules).
For completers, the within-group effect sizes on the primary outcome measure CES-D were large for all treatments at posttest (IPT d=0.76 vs CBT d=0.87) and follow-up (IPT d=1.02 vs CBT d=1.44). Between-group effect sizes were small (posttest: IPT vs MoodGYM d=0.14, 95% CI –0.06 to 0.35; CBT vs MoodGYM d=0.05, 95% CI –0.17 to 0.26; follow-up: IPT vs MoodGYM d=0.18, 95% CI –0.09 to 0.45; CBT vs MoodGYM d=0.12, 95% CI –0.15 to 0.39). Within-group effect sizes for adherent completers ranged from d=0.74 to d=0.90 at posttest and d=1.02 to d=1.33 at follow-up. The between-group effect size for IPT vs MoodGYM was higher (posttest: d=0.23, 95% CI 0.0-0.46; follow-up: d=0.31, 95% CI 0.02-0.60) than that for CBT vs MoodGYM (posttest: d=0.02, 95% CI –0.25 to 0.22; follow-up: d=0.04, 95% CI –0.26 to 0.34). The ITT analyses yielded medium within-group effect sizes (d=0.59 to d=0.67 at posttest and d=0.66 to d=0.80 at follow-up). Between-group effect sizes were small (posttest: IPT vs MoodGYM d=0.09, 95% CI–0.02 to 0.21); CBT vs MoodGYM d=0.01, 95% CI –0.10 to 0.12; follow-up: IPT vs MoodGYM d=0.09, 95% CI –0.02 to 0.21; CBT vs MoodGYM d=0.03, 95% CI –0.08 to 0.14). Seeand .
The previously determined noninferiority margin (d=0.33) was converted to delta=3.795 points differences on the CES-D (based on a SD of 11.5). Using the formula of Mascha and Sessler , a completers analysis indicated that IPT compared to MoodGYM was found to be noninferior at posttest (t366=4.046, P<.001, 95% CI –0.89 to 4.73). The mean difference between IPT and MoodGYM on the CES-D for completers at posttest was 1.92 points (95% CI –0.86 to 4.70, P=.17). MoodGYM participants scored nonsignificantly lower at posttest. CBT completers were also found to be noninferior to MoodGYM (t341=2.142, P=.02; 95% CI –3.57 to 2.33), with a mean difference at posttest of 0.62 points (lower for CBT) which was not statistically significant (95% CI –2.30 to 3.54, P=.68). For adherent completers, results were similar (IPT vs MoodGYM posttest: t316=4.506, P<.001; CBT vs MoodGYM posttest: t282=2.246, P<.001; 95% CI –3.39 to 2.91). There was a nonsignificant mean difference between IPT and MoodGYM on the CES-D for adherent completers at posttest of 3.05 which was higher for IPT (95% CI 0.06-6.04, P=.05), but not for CBT vs MoodGYM (mean difference: 0.24; 95% CI –2.88 to 3.36, P=.88, lower for CBT). An ITT analysis also indicated that IPT and CBT were found to be noninferior to MoodGYM (IPT: t1231=4.769, P<.001, 95% CI –0.41 to 4.43; CBT: t1221=3.207, P<.001, 95% CI –2.27 to 2.71). Mean depression scores were not significantly different across the 3 programs at posttest (IPT vs MoodGYM: 2.01, 95% CI –0.32 to 4.34, P=.09, higher for IPT; CBT vs MoodGYM: 0.22, 95% CI: –2.17 to 2.61, P=.86, higher for CBT).
presents the means and SDs for completers, adherent completers, and the ITT sample as produced by the LMM procedure. Because LMM does not yield SDs, we calculated them manually by using the formula SD=SEM×√N. For the completers of posttest and/or follow-up, there was a significant overall improvement over time for all groups on the CES-D (F2,434.0=290.309, P<.001). There was no significant group×time interaction effects on the CES-D at posttest (F4,436.3=1.15, P=.33). Results were similar for the ITT sample and the adherent completers (see ).
Residuals of the models were inspected and showed nonnormality. Therefore, to be thorough, power transforms were estimated fitted using a Box-Cox model that included the same terms as the mixed model omitting the pretest scores. The test of deviations of residuals from normality was significant for just the IPT group at posttest (t386.273=2.36, P=.02). We compared the contrast results by using the transformed data to the raw data, indicated that they have the same pattern of significance. Because the normality violation was not profound and because it is easier to interpret raw data (and retransformation of model estimates is not always appropriate), we have presented raw data in this paper.
Clinically Significant Change and Reliable Change Index for Completers
For completers, no significant differences in CSC were found between the 3 programs at posttest (χ22=1.78, Ρ=.41) and follow-up (χ22=3.70, P=.16). The number of participants showing CSC at posttest were n=61 for IPT, n=65 for CBT, and n=52 for MoodGYM. For adherent completers, results were similar. Using the formula of Jacobson and Truax  for RCI (the degree of change that occurred beyond the fluctuations of an imprecise measuring instrument) with a Cronbach alpha of .90, all programs reached the RCI critical value of 1.96 at posttest (IPT: 2.01; CBT: 2.41; MoodGYM: 2.21) and follow-up (IPT: 2.78; CBT: 3.82; MoodGYM: 2.86). The RCI results were similar for adherent completers at posttest (IPT: 1.97; CBT: 2.40; MoodGYM: 2.47) and follow-up (IPT: 2.76; CBT: 3.59; MoodGYM: 3.28; see ).
Analyses were also undertaken for the subsample of participants who had symptoms severe enough to be considered clinical cases at baseline. A CES-D value of ≥22 is considered to indicate clinical caseness . For the ITT sample scoring ≥22 on the CES-D, analyses showed a significant overall improvement over time for all groups on the CES-D (F2,368.8=306.190, P<.001). No differences between the treatments over time were found on the CES-D (F4,451.2=0.976, P=.42). Within-group effect sizes on the primary outcome measure CES-D were small for all treatments at posttest (IPT d=0.55; CBT and MoodGYM d=0.56) and medium at follow-up (CBT d=0.65; MoodGYM d=0.61) except for IPT, which was small (IPT d=0.58). The RCI was below the critical value of 1.96 at posttest for all programs. At follow-up, all programs reached the critical value of 1.96 (IPT: 2.26; CBT: 2.51; MoodGYM: 2.46). For completers and adherent completers scoring ≥22 on the baseline CES-D, results were similar (see ). RCI for completers was above 1.96 for all programs at posttest (IPT: 2.93; CBT: 3.26; MoodGYM: 3.47) and follow-up (IPT: 3.84; CBT: 4.80; MoodGYM: 4.97). RCI for adherent completers was above 1.96 for all programs at posttest (IPT: 2.95; CBT: 3.40; MoodGYM: 3.57) and follow-up (IPT: 4.00; CBT: 4.93; MoodGYM: 4.77, see ).
There was a significant difference between the 3 interventions in treatment satisfaction scores at posttest for completers as measured with the CSQ-8 (F2,535=18.75, P<.001). Post hoc analyses using Tukey’s honestly significant difference (HSD) test showed that participants randomized to IPT (n=201) had a significantly lower total satisfaction score (mean 20.55, SD 4.80) compared to MoodGYM (n=158; mean 22.81, SD 4.58) with a mean difference of 2.26 (SD 0.49, P<.001) and CBT (n=179; mean 23.26, SD 4.47) with a mean difference of 2.71 (SD 0.48, P<.001).
Principal Results: Noninferiority, Effectiveness, and Efficacy
The present study is the first to show that Internet-delivered IPT can be effective in the treatment of depressive symptoms at posttest and at 6-month follow-up. Both the IPT and the CBT online interventions employed in the trial showed significant medium to large within-group effect sizes on the CES-D for completers and adherent completers. For the ITT sample, effect sizes were smaller, but still moderate in size. Of the clinical cases, completers and adherent completers showed medium to large effect sizes on posttest and follow-up ratings. We found that IPT and CBT were noninferior compared to MoodGYM for those who returned posttest, and between-group effect sizes were small. Although recent MoodGYM studies report similar effect sizes to our study, our conclusions need to be taken with some caution given that the effect size found in this study differed from the effect size from the initial study, and therefore might hamper assay sensitivity. Furthermore, the new CBT program reached consistently higher, but not significant, effect sizes compared to the IPT and the standard MoodGYM program. Overall, the between-group effect sizes were larger for IPT versus MoodGYM compared to CBT versus MoodGYM.
Comparison With Prior Work
Our findings of the equivalent effectiveness of CBT and IPT are concordant with previous research on face-to-face interventions . The within-group effect size for completers of MoodGYM found in our study was similar to that of 2 recently published studies of unguided MoodGYM [ , ], but was considerably higher than the trial conducted by its originators in 2004 [ , ]. To be able to draw reliable conclusions of noninferiority, it is important to establish effect sizes of similar size to prior trials. To minimize bias, it is important to replicate the conditions under which the control treatment was previously examined (eg, the same population sample, outcome measures, assessment time points, and delivery of treatment). MoodGYM is automated and has fidelity as an intervention. We have no reason to assume that omission of the MoodGYM relaxation module accounts for the difference in the effect size found in this study, because removal did not affect treatment effectiveness in a previous dismantling study [ ]. The most likely cause of the observed differences lies in the different samples recruited. The present study recruited participants directly from those visiting a self-help website, whereas the original study consisted of a sample of participants selected randomly from the Australian electoral roll. The present sample had higher depression scores at baseline compared to the original trial [ , ]. The difference in dropout rate (42% in the original study vs 70% in our study) might also account for the higher effect size we found for our completer analysis, because the effect size in our ITT sample (MoodGYM d=0.66) is more similar to the effect size of the completers in the originators study (MoodGYM d=0.56) [ , ]. Lower dropout rates in the original study may have arisen from the addition of weekly phone calls by lay interviewers, which might be considered as minimal contact therapy [ ] and might affect the dropout rate [ ]. However, a recent study by Farrer et al [ ] found no significant difference in dropout between participants receiving weekly telephone calls in addition to MoodGYM and those receiving only self-guided MoodGYM. This aside, we can conclude that Internet-delivered IPT is likely to be an effective treatment for depressive symptoms, and thereby offers people with depression another online treatment option
The CSC rates in the current study were lower than those reported in other online studies [, ]. One explanation for this finding might be that those studies incorporated guidance, whereas this study was fully automated. Another explanation might be that the baseline CES-D scores were higher in this study than typically found in other studies [ ]. Hence, the drop in CES-D score required to achieve a score in the nonclinical range (ie, a CSC) is more difficult to reach. Clinical cases of the ITT samples showed no reliable change. However, completers of clinical cases showed reliable change for all programs at posttest and follow-up. CBT reached consistently higher RCI rates. CBT might be superior to IPT and/or MoodGYM. However, as we did not set our hypothesis a priori to test noninferiority between IPT and CBT, or superiority between CBT and the other programs, conclusions cannot be drawn because of insufficient power.
Feasibility and Satisfaction
There was no significant difference in treatment preference at baseline before randomization. This lack of preference for treatment condition is important, because it suggests that a disparity between the preferred and allocated conditions was unlikely to negatively impact disproportionately on the findings. However, treatment satisfaction ratings were significantly lower for the IPT program compared with MoodGYM and CBT. One explanation for these findings may be related to what people were looking for in an online intervention. Also, although there were no patient program preferences before randomization, it is unknown how participants felt about being randomized to IPT immediately postrandomization but before exposure to the treatment. Online CBT is widely known, whereas fewer individuals know about IPT. To the extent that the social psychological literature has demonstrated that familiarity breeds liking, it may be possible that differences in satisfaction in treatment were driven in part by differences in familiarity with each treatment. Another explanation may be that the IPT program was too brief. Adherence to the treatment was considerably lower than the original MoodGYM trial . As mentioned earlier, the influence of weekly telephone calls in the original study might have influenced the dropout rate, as might the source of participants from among spontaneous visitors to a self-help website. Some studies of unguided self-help have reported similar dropout rates [ , , ], but others have not [ , ]. Completion rates for the new IPT and CBT programs were significantly higher than for MoodGYM. This could suggest that the new programs are more acceptable, particularly to adults. Within a noninferiority framework, this finding is very important, because newer implementations of e-therapy were at least as effective as MoodGYM, whereas completion rates—a key problem in this field [ ]—were higher. However, MoodGYM is a well-known open-access program. Some participants assigned to MoodGYM could have undertaken the program previously, and if so may have been less willing to finish the intervention.
This study has several limitations. First, as mentioned previously, the effect size found in this study differs from that on which we based the noninferiority margin and power calculation. Second, the noninferiority margin of the primary outcome measure is usually based on the lower-bound CI of the between-group effect size of the traditional treatment . In our case, this would be an effect size of 0.33 and a lower-bound 95% CI of 0.11 [ ]. To reach sufficient statistical power to be able to detect a significant difference, we would need at least 14,000 participants per condition. Therefore, we used an alternative approach to calculate the lower-bound noninferiority margin. Based on the study of Mackinnon et al [ ] we used the within-group effect size of 0.56 instead. This resulted in a noninferiority margin of an effect size of 0.33, which is a 3.795 difference on the CES-D. Although this difference is still liberal, an effect size of 0.30 is considered as the minimum for clinically meaningful change [ ]. Third, LMM is based on the MAR assumption, while dropout rates were very high. It is widely recognized that the MAR assumption is untestable. MAR assumes that the pattern of missing data does not depend on the unobserved data. This is a substantially weaker assumption than missing completely at random (MCAR), in which the data are missing independent of values of the observed and unobserved data. Therefore, LMM is the most robust of the methods for analyzing the data.
Low adherence, however, could underestimate differences between groups, and therefore increase the likelihood of finding noninferiority. However, our completer analyses revealed no statistical differences in effectiveness between the 3 programs. Nevertheless, our conclusions need to be taken with caution given the high dropout rates. One possible explanation of the difference in attrition rates across the programs might be that MoodGYM takes longer to complete compared with the other programs, and lengthier programs might be associated with greater attrition . Although MoodGYM had the highest dropout rate, dropout was high among all conditions, a finding that is common for Internet interventions. High dropout rates are likely with minimal exclusion criteria, unguided interventions [ , ], and little or no financial commitment [ ]. However, a recent study by Hilvert-Bruce et al [ ] showed that noncompleters derive benefit before dropping out. Also, there were significant baseline differences (CES-D score, gender, age) between participants who completed the programs and those who did not, which might indicate selection bias. The ITT analyses demonstrated effects nevertheless. In addition, it was unknown whether participants used other treatments during the study. This could mask real differences between groups if this use of additional treatments was more prevalent for 1 group compared with the others. Because we did not measure additional use of other treatments, we cannot rule this possibility out entirely. Finally, in face-to-face IPT, one focus is chosen, whereas in the Internet-delivered IPT, all modules were undertaken by the participant.
Future research is needed to replicate IPT noninferiority compared to CBT programs, to test whether the new CBT program is superior to other programs, to examine whether guided Internet-delivered IPT is as effective as face-to-face IPT, to investigate methods to improve adherence, to investigate whether IPT would also be effective outside of a randomized controlled trial setting, and whether Internet-delivered IPT is also effective in the treatment of other disorders, such as social phobia or panic disorder. It is important that future research investigates individual characteristics, such as recent life events, that predict treatment response for IPT. There will also be value in investigating whether a planned extended version of e-couch IPT will yield higher satisfaction ratings.
Although a firm conclusion regarding the noninferiority of IPT and the sustainability of results compared to CBT cannot be drawn yet, we can conclude that Internet-delivered IPT is an effective treatment for depressive symptoms, and thereby offers those with depression another online treatment option. An Internet-accessed IPT program could potentially be more appealing to IPT-trained therapists than a CBT-based one, perhaps making such clinicians more likely to recommend it to their clients. In the United Kingdom, the first wave of the Increasing Access to Psychological Therapies (IAPT) initiative was CBT only, but recently it has been expanded to other approaches, including IPT. Given that MoodGYM is already a resource used within IAPT (mostly without support), Internet-delivered IPT could well be a feasible option in second-wave IAPT services. Furthermore, the new e-couch CBT program was shown to be noninferior to the active CBT-based control program and thus may provide an open-access alternative to MoodGYM. Another important finding is that the completion rates of the new treatments were higher, indicating some progress in refining Internet-based self-help.
This study was funded by the Faculty of Psychology and Education of the VU University, Amsterdam, and the Center for Mental Health Research at The Australian National University. HC is supported by NHMRC Fellowship 525411. KG is supported by an NHMRC Senior Research Fellowship No 525413. We wish to thank Ada Tam, Philip Batterham, Jos Twisk, Bob Forrester, Filip Smit, and Lisanne Warmerdam for their helpful comments and participation in the project.
Conflicts of Interest
The interventions investigated in the current study were developed at the Centre for Mental Health Research, The Australian National University. HC and KG are authors and developers of the MoodGYM and e-couch websites, but derive no personal or financial benefit from their operation.
Multimedia Appendix 1
CONSORT-EHEALTH checklist V1.6.2 [PDF File (Adobe PDF File), 996KB ].
Multimedia Appendix 2
Mean depression scores over time for interpersonal psychotherapy (IPT), cognitive behavior therapy (CBT), and MoodGYM (intention-to-treat, N=1843).PDF File (Adobe PDF File), 11KB
Multimedia Appendix 3
Results and effect sizes (Cohen's d) for Center for Epidemiological Studies Depression scale (CES-D) per protocol and intention-to-treat (ITT) analyses for clinical cases (≥22 on the CES-D).PDF File (Adobe PDF File), 48KB
- Kruijshaar ME, Barendregt J, Vos T, de Graaf R, Spijker J, Andrews G. Lifetime prevalence estimates of major depression: an indirect estimation method and a quantification of recall bias. Eur J Epidemiol 2005;20(1):103-111. [Medline]
- Mathers CD, Loncar D. Projections of global mortality and burden of disease from 2002 to 2030. PLoS Med 2006 Nov;3(11):e442 [FREE Full text] [CrossRef] [Medline]
- van Schaik DJ, van Marwijk HW, Beekman AT, de Haan M, van Dyck R. Interpersonal psychotherapy (IPT) for late-life depression in general practice: uptake and satisfaction by patients, therapists and physicians. BMC Fam Pract 2007;8:52 [FREE Full text] [CrossRef] [Medline]
- Cuijpers P, de Graaf R, van Dorsselaer S. Minor depression: risk profiles, functional disability, health care use and risk of developing major depression. J Affect Disord 2004 Apr;79(1-3):71-79. [CrossRef] [Medline]
- Kessler RC, McGonagle KA, Zhao S, Nelson CB, Hughes M, Eshleman S, et al. Lifetime and 12-month prevalence of DSM-III-R psychiatric disorders in the United States. Results from the National Comorbidity Survey. Arch Gen Psychiatry 1994 Jan;51(1):8-19. [Medline]
- Smit F, Cuijpers P, Oostenbrink J, Batelaan N, de Graaf R, Beekman A. Costs of nine common mental disorders: implications for curative and preventive psychiatry. J Ment Health Policy Econ 2006 Dec;9(4):193-200. [Medline]
- Cuijpers P, Andersson G, Donker T, van Straten A. Psychological treatment of depression: results of a series of meta-analyses. Nord J Psychiatry 2011 Dec;65(6):354-364. [CrossRef] [Medline]
- Practice guideline for the treatment of patients with major depressive disorder (revision). American Psychiatric Association. Am J Psychiatry 2000 Apr;157(4 Suppl):1-45. [Medline]
- Ellis P, Royal Australian and New Zealand College of Psychiatrists Clinical Practice Guidelines Team for Depression. Australian and New Zealand clinical practice guidelines for the treatment of depression. Aust N Z J Psychiatry 2004 Jun;38(6):389-407. [CrossRef] [Medline]
- National Institute for Clinical Excellence (NICE). Depression: management of depression in primary and secondary care (amended). In: NICE Clinical Practice Guideline 23. London: NICE; 2007.
- Beck AT, Rush AJ, Shaw BF, Emery G. Cognitive Therapy of Depression. New York: Guilford Press; 1979.
- Klerman GL, Budman S, Berwick D, Weissman MM, Damico-White J, Demby A, et al. Efficacy of a brief psychosocial intervention for symptoms of stress and distress among patients in primary care. Med Care 1987 Nov;25(11):1078-1088. [Medline]
- Weissman MM, Markowitz JC, Klerman GL. Clinician's Quick Guide to Interpersonal Psychotherapy. New York: Oxford University Press; 2007.
- Cuijpers P, Geraedts AS, van Oppen P, Andersson G, Markowitz JC, van Straten A. Interpersonal psychotherapy for depression: a meta-analysis. Am J Psychiatry 2011 Jun;168(6):581-592. [CrossRef] [Medline]
- Jakobsen JC, Hansen JL, Simonsen S, Simonsen E, Gluud C. Effects of cognitive therapy versus interpersonal psychotherapy in patients with major depressive disorder: a systematic review of randomized clinical trials with meta-analyses and trial sequential analyses. Psychol Med 2012 Jul;42(7):1343-1357. [CrossRef] [Medline]
- Cameron PA, Thompson DR. Changing the health-care workforce. Int J Nurs Pract 2005 Feb;11(1):1-4. [CrossRef] [Medline]
- Barney LJ, Griffiths KM, Jorm AF, Christensen H. Stigma about depression and its impact on help-seeking intentions. Aust N Z J Psychiatry 2006 Jan;40(1):51-54. [CrossRef] [Medline]
- Palmqvist B, Carlbring P, Andersson G. Internet-delivered treatments with or without therapist input: does the therapist factor have implications for efficacy and cost? Expert Rev Pharmacoecon Outcomes Res 2007 Jun;7(3):291-297. [CrossRef] [Medline]
- Andrews G, Cuijpers P, Craske MG, McEvoy P, Titov N. Computer therapy for the anxiety and depressive disorders is effective, acceptable and practical health care: a meta-analysis. PLoS One 2010;5(10):e13196 [FREE Full text] [CrossRef] [Medline]
- Griffiths KM, Farrer L, Christensen H. The efficacy of internet interventions for depression and anxiety disorders: a review of randomised controlled trials. Med J Aust 2010 Jun 7;192(11 Suppl):S4-11. [Medline]
- e-couch. URL: https://www.ecouch.anu.edu.au/welcome [accessed 2013-01-22] [WebCite Cache]
- Christensen H, Griffiths KM, Jorm AF. Delivering interventions for depression by using the internet: randomised controlled trial. BMJ 2004 Jan 31;328(7434):265 [FREE Full text] [CrossRef] [Medline]
- Farrer L, Christensen H, Griffiths KM, Mackinnon A. Internet-based CBT for depression with and without telephone tracking in a national helpline: randomised controlled trial. PLoS One 2011;6(11):e28099 [FREE Full text] [CrossRef] [Medline]
- Mackinnon A, Griffiths KM, Christensen H. Comparative randomised trial of online cognitive-behavioural therapy and an information website for depression: 12-month outcomes. Br J Psychiatry 2008 Feb;192(2):130-134 [FREE Full text] [CrossRef] [Medline]
- Pocock SJ. The pros and cons of noninferiority trials. Fundam Clin Pharmacol 2003 Aug;17(4):483-490. [Medline]
- Mascha EJ, Sessler DI. Equivalence and noninferiority testing in regression models and repeated-measures designs. Anesth Analg 2011 Mar;112(3):678-687 [FREE Full text] [CrossRef] [Medline]
- Donker T, van Straten A, Marks I, Cuijpers P. A brief Web-based screening questionnaire for common mental disorders: development and validation. J Med Internet Res 2009;11(3):e19 [FREE Full text] [CrossRef] [Medline]
- Lewinsohn PM. The behavioral study and treatment of depression. In: Hersen M, Eisler RM, Miller PM, editors. Progress in Behavioral Modification, 1st ed. New York: Academic; 1975:19-65.
- MoodGYM. URL: https://moodgym.anu.edu.au/welcome [accessed 2013-01-22] [WebCite Cache]
- Christensen H, Griffiths KM, Korten A. Web-based cognitive behavior therapy: analysis of site usage and changes in depression and anxiety scores. J Med Internet Res 2002 Mar;4(1):e3 [FREE Full text] [CrossRef] [Medline]
- Christensen H, Griffiths KM, Mackinnon AJ, Brittliffe K. Online randomized controlled trial of brief and full cognitive behaviour therapy for depression. Psychol Med 2006 Dec;36(12):1737-1746. [CrossRef] [Medline]
- Radloff LS. The CES-D Scale: a self-report depression scale for research in the general population. Appl Psychol Meas 1977;1(3):385-401.
- Donker T, van Straten A, Marks I, Cuijpers P. Brief self-rated screening for depression on the Internet. J Affect Disord 2010 May;122(3):253-259. [CrossRef] [Medline]
- Nguyen TD, Attkisson CC, Stegner BL. Assessment of patient satisfaction: development and refinement of a service evaluation questionnaire. Eval Program Plann 1983;6(3-4):299-313. [Medline]
- Attkisson CC, Zwick R. The client satisfaction questionnaire. Psychometric properties and correlations with service utilization and psychotherapy outcome. Eval Program Plann 1982;5(3):233-237. [Medline]
- Hwang IK, Morikawa T. Design issues in noninferiority equivalence trials. Drug Inform J 1999;33:1205-1218 [FREE Full text] [WebCite Cache]
- Jones B, Jarvis P, Lewis JA, Ebbutt AF. Trials to assess equivalence: the importance of rigorous methods. BMJ 1996 Jul 6;313(7048):36-39 [FREE Full text] [Medline]
- Piaggio G, Elbourne DR, Altman DG, Pocock SJ, Evans SJ, CONSORT Group. Reporting of noninferiority and equivalence randomized trials: an extension of the CONSORT statement. JAMA 2006 Mar 8;295(10):1152-1160. [CrossRef] [Medline]
- Cohen J. Statistical Power Analysis for the Behavioral Sciences. Hillsdale, NJ: L Erlbaum Associates; 1988.
- Jacobson NS, Truax P. Clinical significance: a statistical approach to defining meaningful change in psychotherapy research. J Consult Clin Psychol 1991 Feb;59(1):12-19. [Medline]
- McGlinchey JB, Atkins DC, Jacobson NS. Clinical significance methods: Which one to use and how useful are they? Behavior Therapy 2002 Sep 2002;33(4):529-550. [CrossRef]
- Eysenbach G, CONSORT-EHEALTH Group. CONSORT-EHEALTH: improving and standardizing evaluation reports of Web-based and mobile health interventions. J Med Internet Res 2011;13(4):e126 [FREE Full text] [CrossRef] [Medline]
- Lintvedt OK, Griffiths KM, Sørensen K, Østvik AR, Wang CE, Eisemann M, et al. Evaluating the effectiveness and efficacy of unguided internet-based self-help intervention for the prevention of depression: a randomized controlled trial. Clin Psychol Psychother 2013 Sep;20(1):10-27. [CrossRef] [Medline]
- Newman MG, Erickson T, Przeworski A, Dzus E. Self-help and minimal-contact therapies for anxiety disorders: Is human contact necessary for therapeutic efficacy? J Clin Psychol 2003 Mar;59(3):251-274. [CrossRef] [Medline]
- Andersson G. Using the Internet to provide cognitive behaviour therapy. Behav Res Ther 2009 Mar;47(3):175-180. [CrossRef] [Medline]
- Hedman E, Andersson G, Ljótsson B, Andersson E, Rück C, Mörtberg E, et al. Internet-based cognitive behavior therapy vs. cognitive behavioral group therapy for social anxiety disorder: a randomized controlled non-inferiority trial. PLoS One 2011;6(3):e18001 [FREE Full text] [CrossRef] [Medline]
- Warmerdam L, van Straten A, Twisk J, Riper H, Cuijpers P. Internet-based treatment for adults with depressive symptoms: randomized controlled trial. J Med Internet Res 2008;10(4):e44 [FREE Full text] [CrossRef] [Medline]
- de Graaf LE, Huibers MJ, Riper H, Gerhards SA, Arntz A. Use and acceptability of unsupported online computerized cognitive behavioral therapy for depression and associations with clinical outcome. J Affect Disord 2009 Aug;116(3):227-231. [CrossRef] [Medline]
- Melville KM, Casey LM, Kavanagh DJ. Dropout from Internet-based treatment for psychological disorders. Br J Clin Psychol 2010 Nov;49(Pt 4):455-471. [CrossRef] [Medline]
- Spek V, Cuijpers P, Nyklícek I, Riper H, Keyzer J, Pop V. Internet-based cognitive behaviour therapy for symptoms of depression and anxiety: a meta-analysis. Psychol Med 2007 Mar;37(3):319-328. [CrossRef] [Medline]
- Eysenbach G. The law of attrition. J Med Internet Res 2005;7(1):e11 [FREE Full text] [CrossRef] [Medline]
- Wilson DB, Lipsey MW. The role of method in treatment effectiveness research: evidence from meta-analysis. Psychol Methods 2001 Dec;6(4):413-429. [Medline]
- Hilvert-Bruce Z, Rossouw PJ, Wong N, Sunderland M, Andrews G. Adherence as a determinant of effectiveness of internet cognitive behavioural therapy for anxiety and depressive disorders. Behav Res Ther 2012 Aug;50(7-8):463-468. [CrossRef] [Medline]
|ANU: The Australian National University|
|CBT: cognitive behavioral therapy|
|CES-D: Center for Epidemiological Studies Depression scale|
|CMHR: Centre for Mental Health Research|
|CSC: clinically significant change|
|CSQ-8: Client Satisfaction Questionnaire|
|IAPT: Increasing Access to Psychological Therapies|
|IPT: interpersonal psychotherapy|
|LMM: linear mixed model|
|MAR: missing at random|
|RCI: reliable change index|
|WSQ: Web Screening Questionnaire|
Edited by G Eysenbach; submitted 11.08.12; peer-reviewed by L Palmili, H O'Mahen; comments to author 29.12.12; revised version received 31.01.13; accepted 15.02.13; published 13.05.13
©Tara Donker, Kylie Bennett, Anthony Bennett, Andrew Mackinnon, Annemieke van Straten, Pim Cuijpers, Helen Christensen, Kathleen M Griffiths. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 13.05.2013.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.