Published on 15.11.13 in Vol 15, No 11 (2013): November
Preprints (earlier versions) of this paper are available at http://preprints.jmir.org/preprint/2791, first published Jun 25, 2013.
Smartphones for Smarter Delivery of Mental Health Programs: A Systematic Review
Background: The rapid growth in the use of mobile phone applications (apps) provides the opportunity to increase access to evidence-based mental health care.
Objective: Our goal was to systematically review the research evidence supporting the efficacy of mental health apps for mobile devices (such as smartphones and tablets) for all ages.
Methods: A comprehensive literature search (2008-2013) in MEDLINE, Embase, the Cochrane Central Register of Controlled Trials, PsycINFO, PsycTESTS, Compendex, and Inspec was conducted. We included trials that examined the effects of mental health apps (for depression, anxiety, substance use, sleep disturbances, suicidal behavior, self-harm, psychotic disorders, eating disorders, stress, and gambling) delivered on mobile devices with a pre- to posttest design or compared with a control group. The control group could consist of wait list, treatment-as-usual, or another recognized treatment.
Results: In total, 5464 abstracts were identified. Of those, 8 papers describing 5 apps targeting depression, anxiety, and substance abuse met the inclusion criteria. Four apps provided support from a mental health professional. Results showed significant reductions in depression, stress, and substance use. Within-group and between-group intention-to-treat effect sizes ranged from 0.29-2.28 and 0.01-0.48 at posttest and follow-up, respectively.
Conclusions: Mental health apps have the potential to be effective and may significantly improve treatment accessibility. However, the majority of apps that are currently available lack scientific evidence about their efficacy. The public needs to be educated on how to identify the few evidence-based mental health apps available in the public domain to date. Further rigorous research is required to develop and test evidence-based programs. Given the small number of studies and participants included in this review, the high risk of bias, and unknown efficacy of long-term follow-up, current findings should be interpreted with caution, pending replication. Two of the 5 evidence-based mental health apps are currently commercially available in app stores.
J Med Internet Res 2013;15(11):e247
Global mobile phone penetration reached 91% at the end of 2012, with 4.3 billion unique mobile subscribers  identified. Mobile health (mHealth)—specifically, mental health supported by mobile devices—thus has the potential to be delivered to large numbers of people worldwide. The first mobile software applications or “apps” became available to download on a mobile device in 2008. Since then, penetration has increased rapidly and is anticipated to continue rising. As of September 2012, an estimated 1,520,000 apps had been developed for mobile devices [ ], and around 13,600 health apps intended for use by consumers were available for download in Apple’s App Store [ ]. About 6% of these apps targeted mental health outcomes, while 18% focused on related health issues, such as sleep, stress, relaxation, and smoking behaviors. A survey among the Australian general public indicated that 76% would be interested in using mobile phones for mental health monitoring and self-management [ ]. This suggests that mHealth is acceptable and may be a useful vehicle for enhancing access to evidence-based monitoring and self-help for individuals with mild-to-moderate common mental health conditions [ ]. Clinical practice guidelines recommend cognitive behavior therapy (CBT) and self-help resources (such as mHealth) as options for psychological treatment for individuals experiencing mild-to-moderate symptoms of anxiety or depression [ ]. mHealth apps can be used as stand-alone self-help programs or as a conjunctive treatment modality in guided programs, for example, part of a website or through direct contact with a mental health professional. The app can include treatment components such as cognitive therapy (CT), behavioral activation (BA), psychoeducation, or monitoring of symptoms.
Advantages of mHealth include the improvement of treatment accessibility and participant retention, real-time symptom and activity monitoring and tracking of treatment progress through ecological momentary assessment (EMA), provision of personalized feedback and motivational support, portability and flexibility of use, and the potential to improve adherence to treatment [- ]. However, there are also disadvantages with using mobile devices for mental health. Technical problems and factors related to telecommunication can arise (eg, battery failures, reliability and sustainability of connections [ ]), and issues of data security, patient privacy, and the identification and timely management of crises and risk of harm must be carefully considered when integrating smartphone technology into behavioral health care [ ].
Previous research suggests that mental health interventions delivered through mobile apps can be effective in treating a range of mental health disorders, such as depression, stress, anxiety, and smoking cessation [, ]. However, the thriving development of mental health apps warrants a systematic review of the available evidence base in this growing area. Previous reviews examining evidence-based mental health apps did not incorporate quantitative analyses [ ] or included mHealth interventions that were not directly downloadable as an app (such as programs using SMS [short message service] text messaging or Internet-enabled interventions on mobile phones [ , ]). Therefore, the aim of this paper is to systematically review the available evidence-based apps directly downloadable on mobile devices (such as smartphones and tablets) for mental health symptoms or disorders (depression, anxiety, substance use, sleep disorders, suicidal behavior, psychotic disorders, eating disorders, stress, gambling) in children, adolescents, adults, and older individuals.
Search Strategy and Selection of Studies
A comprehensive literature search in bibliographic databases (MEDLINE, Embase, the Cochrane Central Register of Controlled Trials, PsycINFO, PsycTESTS, and Compendex and Inspec) for relevant articles published from January 1, 2008 (launch date of the first app), to May 30, 2013, was conducted. Terms indicative of mobile apps and mental health disorders were used to search these databases, with the search being limited to “humans”, English, and peer-reviewed journals (see- for the full search string). The identified titles and abstracts were screened for eligibility by 2 independent researchers. Full text copies of all potentially relevant papers, or papers where there was insufficient information in the abstract to determine eligibility, were obtained. Full text articles were further screened and discarded from further analyses if they met exclusion criteria. In addition, references of earlier reviews and reference lists of the included primary articles were examined. Furthermore, key technology journals (Cybertechnology, Behavior and Social Networking; Journal of Medical Internet Research; and Studies in Health Technology and Informatics) were hand-searched. We also reviewed Beacon, a website for evidence-based online programs for mental health, developed and delivered by the Centre for Mental Health Research at the Australian National University. Finally, a search was conducted of prominent individual authors’ and researchers’ names in the field of mHealth or Internet interventions (see ) in MEDLINE. Data extraction of relevant articles was completed by 2 independent researchers, with disagreements resolved through discussion or with a third researcher.
We applied strict inclusion criteria in order to investigate any evidence-based mental health apps that could be downloaded from app stores (eg, Google Play for Google Android  or the Apple iTunes store [ ]). Studies examining the effects of mental health apps on mental health symptoms or disorders (depression, anxiety, substance use, sleep disorders, suicidal behavior, self-harm, psychotic disorders, eating disorders, stress, and gambling) that were directly downloadable on a mobile device (eg, smartphone or tablet) compared with a control group were included. The control group could consist of a wait list, treatment-as-usual, or another treatment. Studies without a control group (pre-post design) were also included. There was no restriction on participant age. Studies were excluded if they did not include an intervention or if mental health symptoms/disorders were not an outcome, and if the intervention was an Internet-based intervention, virtual reality exposure treatment, interactive voice response technology intervention, or a text messaging-only intervention without a mobile application component. Studies were also excluded if the intervention was downloaded on a computer and transferred (eg, through Bluetooth or infrared) to a mobile device, if the intervention targeted a medical disorder (eg, irritable bowel syndrome, diabetes), if the paper provided a description of the mobile application but no outcome data, and if the intervention was developed before 2008. Conference abstracts, protocol papers, case studies, non-peer reviewed papers, and non-English papers were also excluded.
Study quality was assessed according to 6 basic criteria of the Cochrane Risk of Bias Assessment Tool : sequence generation, allocation concealment, blinding of outcome assessors, incomplete outcome data, selective outcome reporting, and other sources of bias. For the third criterion (blinding of outcome), we omitted blinding of participants since blinding participants for treatment allocation is rarely achievable in intervention trials for mental health disorders.
Primary outcome measures included reduction of depression symptoms, anxiety symptoms, substance use, sleep disturbance, suicidal behavior (suicide ideation, suicide plans, and attempts), self-harm, psychotic symptoms, symptoms of eating disorders, and gambling, as assessed with validated mental health scales.
When data were available and extractable, intention-to-treat (ITT) within-group and between-group effect sizes (Cohen’s d) for the intervention group were calculated by taking the difference between the mean pre- and posttest scores (within-group effect size) or the difference of the posttest scores (between-group effect size) and dividing by the pooled standard deviation. Effect sizes of 0.8 can be assumed to be large, while effect sizes of 0.5 are moderate, and effect sizes of 0.2 are small . Where authors provided only t test statistics, we computed effect sizes using the formula: d=t / sort(df) [ ]. Hedges’ g effect sizes were converted to Cohen’s d. Authors were contacted to provide additional data if needed. Two studies [ , ] did not provide sufficient data to calculate ITT within-group effect sizes.
Selection and Inclusion of Studies
A total of 5464 abstracts in MEDLINE (n=1859), Embase (n=1030), the Cochrane Central Register of Controlled Trials (n=277), PsycINFO (n=1095), PsycTESTS (n=1), and Compendex and Inspec (n=1203) were examined (N=4997 abstracts in total, after removal of duplicates). The majority of records that were excluded addressed nonpsychological technical issues, provided descriptions of mobile apps without outcome data, or were protocol papers or conference abstracts. Of these, 133 full text papers potentially eligible for inclusion were retrieved for further consideration, of which 126 were excluded. Seven trials met inclusion criteria. A further screening for potentially relevant references in recent systematic reviews or meta-analyses and the included studies, individual author names in MEDLINE, and hand-searching of technology journals (Cybertechnology, Behavior and Social Networking; Journal of Medical Internet Research; and Studies in Health Technology and Informatics [January 1, 2008, to May 30, 2013]) and the Beacon website resulted in 95 potentially relevant abstracts and retrieval of 64 additional full text papers for further assessment. Of these, only 1 study met inclusion criteria and was included in the final analysis. In total, 8 trials were identified. These described 5 apps (Mobilyze! , mobiletype [ , ], DBT Coach [ ], Mobile Stress Management [ , , ], and Get Happy Program [ ]) (see for a flowchart of the screening process). There was a high degree of consensus among raters who screened the titles and abstracts (an interrater reliability of 95.2%).
Characteristics of Included Studies
A total of 227 participants were recruited across all studies. One study  did not provide sufficient information about sample size per treatment arm. Of the 8 included studies, 4 trials describing 3 apps assessed depression (Mobilyze!, mobiletype, Get Happy Program), and 3 studies describing 1 app (Mobile Stress Management) assessed stress as a primary outcome measure. Substance use was used as an outcome measure in 1 study (DBT Coach).
provides an overview of the included studies (see for the complete version of the table). One study used BA and another used CBT as the therapeutic mode of the intervention. Two studies described a trial delivering emotional self-awareness (ESA), 1 study was based on dialectical behavioral therapy (DBT) and opposite action (ie, emotional regulation skills), and 3 studies described an app delivering stress inoculation training (SIT) as the content of the intervention. Four studies describing 3 trials used an attention-placebo as a control group, 1 study used an active comparison, and 1 study did not specify the nature of the control group. Two studies used a pre-post design without a control group, and all studies except one were feasibility and/or pilot studies. Two studies recruited adults from the community, 1 study recruited from an outpatient clinic, and 2 studies recruited from the workplace. Two studies describing 1 trial recruited adolescents from general practice, and 1 study targeted female university students. Four studies delivered the intervention through a stand-alone mobile app, while 3 studies describing 2 trials used a mobile app alongside a website and EMA to deliver the intervention. One study used a mobile application in conjunction with traditional face-to-face therapy. All included studies delivered the program on a mobile phone, with 1 study also including iPads. Delivery length varied between 6 days and 8 weeks. Five studies assessed posttest outcomes only, whereas 3 studies describing 2 trials undertook follow-up assessments as well (6 weeks and 3 months). Five studies describing 4 apps were guided by mental health professionals through phone or email contact, whereas in 3 studies describing 1 app, participants independently navigated their way through the trial.
|Author (year); name of app||Trial||Primary outcome measure||Study sample||Intervention group||Control group||Delivery type||Delivery length|
|Withinf and betweeng effect size|
|Burns et al (2011);|
|Pre-post pilot||MDD||Adults from the com-munity||n=8;|
|NA||Mobile app + website + EMA on mobile phone||8 weeks;|
|PHQ-9: d=1.95a,e,fd=2.28a,e,f GAD-7: d=1.37a,e,f|
|Kauer et al (2012);|
Reid et al (2011);
|RCT||MDD||Adolescents from general practice||n=68;|
ESA + Individualized data summary reports + meeting with GP
Attention control + part of the individualized data summary reports + meeting with GP
|Stand-alone mobile app + EMA on mobile phone||8 modules over 2-4 weeks; MHP||DASS Stress:|
|Rizvi et al (2011);|
|BPD and substance use||Adults from out-patient clinic||N=21;|
DBT + OA
|NA||Mobile app on mobile phone + F2F DBT||10-14 days;|
Emotional intensity to use substance:d=0.52c,d,f
Urge to use substance: d=0.29c,d,f
|Villani et al (2012);|
Mobile Stress Management
|RCT||Stress||Female oncology nurses||n=8;|
|Stand-alone mobile app on mobile phone||8 videos over 4 weeks; no support||NA|
|Villani et al (2011);|
Mobile Stress Management
|RCT||Stress||Female oncology nurses||n=15;|
|n=15; Attention control||Stand-alone mobile app on mobile phone||8 videos over 4 weeks; no support||STAI (anxiety trait): d=0.41a,d,f|
COPE (Active): d=-0.45a,d,f
COPE (Denial): d=0.53a,d,f
|Grassi et al (2011);|
Mobile Stress Management
|RCT||Stress||Female university students||n=not reported; SIT||n=not reported; Control||Stand-alone mobile app on mobile phone||6 videos over 6 days; no support||NA|
|Watts et al (2013);|
Get Happy Program
|Pilot RCT||MDD||Adults from the community||n=15;|
CBT via mobile app
CBT via computer
|Stand-alone mobile app on mobile phone + iPad||6 modules over 8 weeks;|
|PHQ-9: d=1.56a,e,fd=-0.14a,gd=1.69b,e,fd=-0.28b,gBDI-II : d=1.90a,e,fd=-0.11a,gd=2.11b,e,fd=-0.48b,g K10:d=1.93a,e,fd=0.01a,gd=1.23b,e,fd=0.03b,g|
cwithin immediate coaching session
fwithin-group effect size;
gbetween-group effect size
The quality of the studies varied but was generally low (see). Three studies describing 2 apps reported adequate sequence generation [ , , ], whereas 3 studies [ , , ] did not outline their sequence generation method. Three studies [ , , ] reported allocation to conditions by an independent (third) party, whereas 3 other studies [ , , ] did not provide sufficient information on allocation. Two studies that included diagnostic interviews [ , ] reported using blinded outcome assessors, and 4 studies [ , , , ] did not report blinding of assessors or used self-report outcome measures. Two studies [ , ] were not eligible for ratings for sequence generation, allocation concealment, or blinding of outcome assessors due to the pre-post study design. In 6 studies [ , - ], ITT analyses (completeness of follow-up data) were conducted; 1 of these failed to describe dropout rates [ ], and only 1 study [ ] described reasons for dropout during the intervention. Two studies [ , ] did not state the nature of the statistical analyses or dropout rate at all. Insufficient information and a high risk of bias of selective outcome reporting was present in 3 studies [ , , ] and 2 studies [ , ] respectively. Three studies [ , , ] had a high risk of other sources of bias (eg, absence of a control group, possible treatment infidelity) while for 5 studies [ - , ] the risk of bias from other sources was unclear (due to significant difference at baseline for stress outcome, unequal number of participants in intervention and control group, and insufficient information). None of the included studies met all 6 quality criteria of the Cochrane tool (see ).
|Trials||Sequence generation||Allocation concealment||Blinding||Incomplete outcome data||Selective outcome reporting||Other sources of bias||Total|
|Burns et al, 2011||NA||NA||NA||0||0||2||2|
|Grassi et al, 2011||1||1||1||1||1||1||6|
|Kauer et al, 2012||0||0||0||1||2||1||4|
|Reid et al, 2011||0||0||0||1||2||1||4|
|Rizvi et al, 2011||NA||NA||NA||1||1||2||4|
|Villani et al, 2011||1||1||1||1||0||1||5|
|Villani et al, 2012||1||1||1||1||1||1||6|
|Watts et al, 2013||0||0||1||0||0||2||3|
a0: low risk of bias; 1: insufficient information; 2: high risk of bias; NA: not applicable.
Effects of the Mental Health Apps
Four studies describing 3 mobile apps [, , , ] targeted depression. Burns et al [ ] found a significant reduction in depression caseness (Mini-International Neuropsychiatric Interview [MINI]: Z=2.15, beta [week]=-.65, P=.03), as well as depression and anxiety symptoms at posttest (Patient Health Questionnaire [PHQ-9]: d=1.95, P<.001; Quick Inventory of Depression Symptoms-Clinician Rated: d=2.28, P<.001; Generalized Anxiety Disorder-7 item scale: d=1.37, P<.001) in a pilot test of the guided Mobilyze! app alongside a website and EMA for adults from the general population. The Mobilyze! app will be publicly available for download soon.
In a randomized controlled trial (RCT) of a guided mobiletype app with EMA conducted by Kauer et al  and Reid et al [ ], no significant differences were found at posttest and follow-up on outcomes of depression, anxiety, and stress among adolescents from general practice compared to an attention control group (Depression and Anxiety Stress Scale [DASS] anxiety: d=0.07, P=.76; DASS depression: d=0.09, P=.69). However, it should be noted that the control group received largely the same intervention as the experimental group, with the exception of two components; ESA training via EMA and minimal feedback reports. Mediator analyses yielded an indirect effect of group on depression via ESA (beta=–0.610, 95% CI –5.596 to –0.003). Significant small to moderate within-group differences over time were found for the intervention group (DASS stress: d=0.37 at posttest; d=0.59 at follow-up; DASS anxiety: d=0.31 at posttest; d=0.45 at follow-up; DASS depression: d=0.34 at posttest; d=0.64 at follow-up) and control group (DASS stress: d=0.14 at posttest; d=0.41 at follow-up; DASS anxiety: d=0.07 at posttest; d=0.08 at follow-up; DASS depression: d=0.42 at posttest; d=0.61 at follow-up). Between-group effect sizes were small and nonsignificant (DASS stress: d=0.14 at posttest; d=0.22 at follow-up; DASS Anxiety: d=0.25 at posttest; d=0.07 at follow-up; DASS depression: d=0.11 at posttest; d=0.09 at follow-up). The mobiletype app is not publicly available for download to date.
Watts et al  found a significant reduction over time (P<.001) and large effect sizes in a pilot RCT of a partially guided CBT-based program for depression delivered either via a computer or mobile app (Get Happy Program) (PHQ-9: d=1.56; Beck Depression Inventory [BDI-II]: d=1.90; Kessler10 [K10]: d=1.93). No differences between the two groups were found for depression over time with P>.05 (PHQ-9: P=.34, d=–0.14; BDI-II: P=.52, d=–0.11; K10: P=.90, d=–0.01). The Get Happy app is not publicly available for download to date.
Three RCTs describing 1 unguided mobile app (Mobile Stress Management) using SIT [, , ] found a significant decrease in state and trait anxiety (State and Trait Anxiety Inventory [STAI]) and a significant increase in active coping skills among oncology nurses [ , ] and female university students [ ] compared to a control group. Grassi et al [ ] used a simplified version of the Mobile Stress Management app, which was also effective for reducing stress. However, both Villani et al [ ] and Grassi et al [ ] did not provide statistical results for intervention versus control group comparisons. Villani et al [ ] reported significant decreases in state anxiety over time (F1,28=71.365, P≤.001) and a significant group x time interaction effect for state anxiety (F1,28=27.476, P≤.001). Within-group effect sizes (converted from t test statistics) were small for active coping strategies (COPE Inventory [COPE] Active: d=0.42), and large for state anxiety (STAI: d=0.84) and denial coping strategies (COPE Denial: d=1.08). The Mobile Stress Management app is publicly available for download (Italian version only).
A pilot feasibility study aiming to reduce substance use (alcohol, drugs, and tobacco) among adults suffering from borderline personality disorder using a mobile app (DBT Coach ) in conjunction with face-to-face DBT therapy, indicated a significant reduction (P<.05) within each DBT Coach session in emotional intensity and urge to use substances (d=0.52 and d=0.29 respectively). Furthermore, a significant reduction (P<.05) in symptoms of depression (BDI: P=.014, d=0.55), global symptom severity (Brief Symptom Inventory: P=.021, d=0.43), and confidence in participants’ ability to use opposite action (ie, emotion regulation) skills (Behavior Confidence Questionnaire: P=.008, d=0.59) was noted from pre- to post assessment. outlines the ITT within-group effect sizes. The DBT Coach app is publicly available for download.
Ecological Momentary Assessment
Mixed findings were obtained from the 2 studies using EMA as part of the intervention. In the Burns et al  study, promising accuracy rates (60-91%) were achieved in predicting categorical contextual states (eg, location) based upon participant EMA entries. For participant states rated on continuous self-report scales (eg, mood), predictive capability was poor. Notwithstanding these technological outcomes, Reid et al [ ] and Kauer et al [ ] demonstrated that increased self-monitoring with EMA by participants did lead to increased ESA and thereby reduced depressive symptoms.
Intervention Feasibility and Adherence
Three studies providing usability and feasibility outcomes (eg, acceptability of the technology, perceived usefulness, perceived utility) reported moderate to high rates of mobile phone usage, feasibility, and participant satisfaction with the intervention [, , ]. The dropout rate was reported in 4 studies and varied between 12.5% and 34.3% [ , , , ]. Reported reasons for dropout, where described, were mostly due to technical problems [ ].
Principal Results and Comparison With Prior Work
In general, the studies included in this systematic review showed promising results for evidence-based mental health apps in reducing depressive symptoms and caseness, stress, anxiety, and substance use, similar to previous reviews of mHealth [, ]. However, due to the high risk of bias in some studies, these findings need to be considered with caution pending replication. Due to the absence of a control group in 2 studies [ , ], it was difficult to determine whether the beneficial effects were attributable to the app itself, a function of natural remission or regression to the mean, or in case of the DBT Coach app [ ], due to the face-to-face DBT therapy offered to all participants in conjunction with the app. Additionally, a clear conclusion about the efficacy of the DBT Coach for substance use treatment cannot be drawn yet, since—besides the absence of a control group—change in substance use (eg, amount of alcohol units per week) prior to or after treatment was not reported, nor was a distinction made between different types of substance use (alcohol, drugs, nicotine cessation). Furthermore, some studies failed to provide sufficient information regarding dropout rates or did not report the statistical analyses used [ , ].
The mobiletype app was the only intervention that failed to yield any significant direct effect on depression, although a significant indirect effect was found in a reduction of depressive symptoms through the direct effect of increased ESA . Because the attention-placebo control group received almost the same intervention as the experimental group, except for the ESA component, the nonsignificant finding is likely to be the cause of this finding. This study suggests that repeated self-monitoring over time using EMA on a mobile device may increase ESA and thereby reduce depressive symptoms. Evidence supports a similar mechanism underlying improvements in depression with CBT, where one of the most important components of CBT for depression involves rating one’s mood and activities in a diary to raise awareness of how activities influence mood states [ ]. The development of mobile devices has facilitated the collection of EMA data, thereby providing a portable and convenient delivery mode with which an individual can incorporate EMA and regular mood monitoring in their daily lives and improve ESA as part of treatment for depression. Although EMA shows promising results in predicting categorical contextual states, it needs to be further optimized to be able to accurately predict mood states [ ]. Once refined in such a way to maximize accuracy and temporal resolution and minimize bias, EMA holds considerable potential to reveal dynamic interplay between mood, cognition, and behavior, increase participant self-awareness of such processes, and thereby enhance mental health treatment [ ]. Together with the use of biomedical and/or activity sensors, timely personalized feedback can be generated to prompt users. mHealth interventions therefore have the potential to improve current depression treatment considerably [ ]. In a similar way to guided Internet interventions [ ], guided apps might derive larger effect sizes and adherence rates than stand-alone self-help apps, but more research is necessary to elucidate this.
Usability, Helpfulness, and Satisfaction
Usability, helpfulness, and satisfaction ratings, where assessed, were moderate to high [, , ], indicating that mHealth apps are perceived to be a useful vehicle for enhancing access to evidence-based monitoring and self-help. However, common technical problems (eg, battery failure, connectivity, freezing of app) need to be overcome. Adherence rates (if reported) were high, in line with previous research in mHealth [ ], but higher when compared to adherence rates seen with Internet-based interventions [ ]. It might be that the method of delivery (mobile phone) and its portability and flexible usage, and/or its delivery of personalized feedback may account for these higher retention rates for mobile apps. However, some of the included studies provided subjects with monetary rewards for participation, which is likely to artificially raise adherence rates as well.
Sustainability of Results
Most studies included only posttest assessment or a short-term follow-up (6 weeks). Although 1 study showed sustainable results at 3-month follow-up , sustainability of results over a medium- to long-term timeframe requires further investigation and replication. As such, on the basis of current evidence, sustainability of results cannot yet be determined.
Since mental health apps downloadable for use by the general population are increasing rapidly, despite evidence for their efficacy being largely unknown, the focus of this systematic review was on apps only. We applied very stringent inclusion criteria to ensure that we identified the evidence-based mental health apps that could be downloadable in the future by the general public from app stores, for example, Google Play for Google Android  or the Apple App Store [ ]. Therefore, several highly sophisticated programs using mobile technology were excluded, such as the myCompass program [ ] for depression, anxiety, and stress. The CBT-based myCompass program is delivered via a website with an Internet-enabled mobile phone component and encourages real-time self-monitoring of moods, mood triggers, and lifestyle behaviors using SMS text messaging and email prompts. Other examples of similar programs include an SMS-based txt2quit intervention [ ] and a video-based STUB IT intervention [ ], both of which have been shown to be effective for smoking cessation, and an SMS-based intervention [ ] to increase medication adherence in individuals with schizophrenia. We were also unable to include the innovative INTREPID research [ ], which used virtual reality exposure therapy on mobile phones to reduce anxiety.
There are more than 3000 mental health apps for Android, Apple, and Microsoft freely available to download to date, compared to the 8 evidence-based apps we identified through our systematic review. Only 2 of the apps included in this review are currently available for public download, comprising less than 1% of the commercially available apps. A recently published review on existing (commercial) mHealth apps for the most prevalent health conditions in the Global Burden of Disease list provided by the World Health Organization  echoes this finding. The authors concluded that the development of mHealth apps was first and foremost driven by commercial and economic motivations rather than scientific motivations behind research. Although the numerous protocols [ , ] and case studies [ , ] we excluded indicate a nascent field of research, the rapid growth and development of thousands of non-evidence-based mental health technologies has generated the need for independent regulation. This is underlined by the alarming findings from previous research [ - ] indicating that only 13-26% of Web-based or app-based interventions for smoking cessation adhere to treatment guidelines. A recent study on commercial apps using EMA for alcohol use echoes these findings [ ]. The US Food and Drug Administration has taken an important step towards the development of quality control guidelines for health apps [ ], but there are still major issues and dangers concerning the lack of quality control of commercially available mental health apps. Further research and work must be undertaken to develop, test, and disseminate evidence-based mHealth interventions among the public to ensure optimal public health outcomes.
This review has several limitations. First, despite an extensive search, the number of included studies was small, which restricted our interpretations as to whether mHealth apps have an effect on reducing mental health symptoms. Second, the number of participants in the included studies was small. As a result, the studies were probably underpowered to detect the more subtle effects of the interventions. Furthermore, small sample sizes hamper the precision and accuracy of the statistical results and therefore limit our interpretations . Third, the quality of the included studies was low. Historically, low quality trials yield positive results [ ]. Due to the small number of studies, we were unable to examine whether significant differences existed between higher- and lower-quality studies. Fourth, there were no studies that examined the long-term efficacy of mental health apps. Therefore, long-term effects remain as yet unknown. Finally, only studies from peer-reviewed, English language journals were included in this review. However, the effect of language bias has been shown to have a minimal impact on the conclusions of systematic reviews [ ].
There is a very clear need for more research in this area. Trials with an RCT design of high quality to minimize risk of bias are needed to determine the efficacy of mental health apps. Unfortunately, the competitive nature and time-consuming process of grant applications and RCT designs necessary for such high-quality research contrasts sharply with the speed of development in this highly innovative technology. Component testing with small sample sizes may offer one solution to help bridge the gap between academia and real-world applications . Research is particularly weak in the domains of sleep disturbance, anxiety disorders, and smoking cessation and needs further investigation. The cost-effectiveness and cost-utility of mHealth, compared to standard care or Internet-based treatment, requires further examination.
In summary, although a firm conclusion cannot yet be drawn, the current systematic review suggests that mobile apps for mental health have the potential to be effective in reducing depression, anxiety, stress, and possibly substance use for individuals experiencing these symptoms. Given the widespread usage of mobile and smartphones and increasing uptake of tablet devices, mHealth has the potential to increase treatment accessibility globally. The difference in the volume of commercial apps compared to the small number of tested evidence-based apps is striking. It warrants the need for public education and further development and research into evidence-based mental health apps and consideration of industry regulation.
This study is funded by Black Dog Institute, University of New South Wales. HC is supported by National Health and Medical Research Council Fellowship 525411.
Conflicts of Interest
Multimedia Appendix 1
Search string MEDLINE and Embase.GIF File, 40KB
Multimedia Appendix 2
Search string Cochrane Compendex and Inspec.GIF File, 52KB
Multimedia Appendix 3
Search string PsycINFO and PsycTESTS.GIF File, 75KB
Multimedia Appendix 4
Individual author names.PDF File (Adobe PDF File), 26KB
Multimedia Appendix 5
Psychosocial studies of applications on mobile devices (intention-to-treat).PDF File (Adobe PDF File), 182KB
- Source Digit. Global Mobile Penetration Reached 91 Percent in Q3 2012; 6.4 Billion Mobile Subscribers Worldwide. 2012. URL: http://sourcedigit.com/1264-global-mobile-penetration-q3-2012/ [accessed 2013-06-17] [WebCite Cache]
- CNet. Google ties Apple with 700,000 Android Apps. URL: http://news.cnet.com/8301-1035_3-57542502-94/google-ties-apple-with-700000-android-apps/ [accessed 2013-06-17] [WebCite Cache]
- Mobi Health News. Report: 13K iPhone consumer health apps in 2012. URL: http://mobihealthnews.com/13368/report-13k-iphone-consumer-health-apps-in-2012/ [accessed 2013-06-17] [WebCite Cache]
- Proudfoot J, Parker G, Hadzi Pavlovic D, Manicavasagar V, Adler E, Whitton A. Community attitudes to the appropriation of mobile phones for monitoring and managing depression, anxiety, and stress. J Med Internet Res 2010;12(5):e64 [FREE Full text] [CrossRef] [Medline]
- National Institute for Clinical Excellence (NICE). Depression: management of depression in primary and secondary care (amended). In: NICE Clinical Practice Guideline. London: NICE; 2007:23.
- Harrison V, Proudfoot J, Wee PP, Parker G, Pavlovic DH, Manicavasagar V. Mobile mental health: review of the emerging field and proof of concept study. J Ment Health 2011 Dec;20(6):509-524. [CrossRef] [Medline]
- Whittaker R, McRobbie H, Bullen C, Borland R, Rodgers A, Gu Y. Mobile phone-based interventions for smoking cessation. Cochrane Database Syst Rev 2012;11:CD006611. [CrossRef] [Medline]
- Carter MC, Burley VJ, Nykjaer C, Cade JE. Adherence to a smartphone application for weight loss compared to website and paper diary: pilot randomized controlled trial. J Med Internet Res 2013;15(4):e32 [FREE Full text] [CrossRef] [Medline]
- Proudfoot J, Nicholas J. Monitoring evaluation in low intensity CBT interventions. In: Bennett-Levy J, Richards DA, Farrand P, Christensen H, Griffiths KM, Kavanagh DJ, et al, editors. Oxford Guide to Low Intensity CBT Interventions. Oxford: Oxford University Press; 2010:97-104.
- Warmerdam L, Riper H, Klein M, van den Ven P, Rocha A, Ricardo Henriques M, et al. Innovative ICT solutions to improve treatment outcomes for depression: the ICT4Depression project. Stud Health Technol Inform 2012;181:339-343. [Medline]
- Burns MN, Begale M, Duffecy J, Gergle D, Karr CJ, Giangrande E, et al. Harnessing context sensing to develop a mobile intervention for depression. J Med Internet Res 2011;13(3):e55 [FREE Full text] [CrossRef] [Medline]
- Luxton DD, McCann RA, Bush NE, Mishkind MC, Reger GM. mHealth for mental health: Integrating smartphone technology in behavioral healthcare. Prof Psychol: Res Pract 2011;42(6):505-512.
- Ehrenreich B, Righter B, Rocke DA, Dixon L, Himelhoch S. Are mobile phones and handheld computers being used to enhance delivery of psychiatric treatment? A systematic review. J Nerv Ment Dis 2011 Nov;199(11):886-891. [CrossRef] [Medline]
- Google. Google play URL: https://play.google.com/store [accessed 2013-06-17] [WebCite Cache]
- Apple. iTunes URL: http://www.apple.com/itunes/ [accessed 2013-06-17] [WebCite Cache]
- Higgins JP, Altman DG, Gøtzsche PC, Jüni P, Moher D, Oxman AD, Cochrane Bias Methods Group, Cochrane Statistical Methods Group. The Cochrane Collaboration's tool for assessing risk of bias in randomised trials. BMJ 2011;343:d5928 [FREE Full text] [Medline]
- Cohen J. Statistical power analysis for the behavioral sciences. Hillsdale, N.J: L. Erlbaum Associates; 1988.
- Rosnow RL. Effect sizes for experimenting psychologists. Can J Exp Psychol 2003 Sep;57(3):221-237. [Medline]
- Grassi A, Gaggioli A, Riva G. New technologies to manage exam anxiety. Stud Health Technol Inform 2011;167:57-62. [Medline]
- Villani D, Grassi A, Cognetta C, Cipresso P, Toniolo D, Riva G. The effects of a mobile stress management protocol on nurses working with cancer patients: a preliminary controlled study. Stud Health Technol Inform 2012;173:524-528. [Medline]
- Kauer SD, Reid SC, Crooke AH, Khor A, Hearps SJ, Jorm AF, et al. Self-monitoring using mobile phones in the early stages of adolescent depression: randomized controlled trial. J Med Internet Res 2012;14(3):e67 [FREE Full text] [CrossRef] [Medline]
- Reid SC, Kauer SD, Hearps SJC, Crooke AHD, Khor AS, Sanci LA, et al. A mobile phone application for the assessment and management of youth mental health problems in primary care: a randomised controlled trial. BMC Fam Pract 2011;12:131. [Medline]
- Rizvi SL, Dimeff LA, Skutch J, Carroll D, Linehan MM. A pilot study of the DBT coach: an interactive mobile phone application for individuals with borderline personality disorder and substance use disorder. Behav Ther 2011 Dec;42(4):589-600. [CrossRef] [Medline]
- Villani D, Grassi A, Cognetta C, Toniolo D, Cipresso P, Riva G. Self-help stress management training through mobile phones: An experience with oncology nurses. Psychol Serv 2013 Aug;10(3):315-322. [CrossRef] [Medline]
- Watts S, Mackenzie A, Thomas C, Griskaitis A, Mewton L, Williams A, et al. CBT for depression: a pilot RCT comparing mobile phone vs. computer. BMC Psychiatry 2013;13:49 [FREE Full text] [CrossRef] [Medline]
- Beck AT, Rush AJ, Shaw BF, Emery G. Cognitive therapy of depression. New York: Guilford Press; 1979.
- Ebner-Priemer UW, Trull TJ. Ecological momentary assessment of mood disorders and mood dysregulation. Psychol Assess 2009 Dec;21(4):463-475. [CrossRef] [Medline]
- Christensen H, Griffiths KM, Farrer L. Adherence in internet interventions for anxiety and depression. J Med Internet Res 2009 Apr;11(2):e13 [FREE Full text] [CrossRef] [Medline]
- Brendryen H, Kraft P. Happy ending: a randomized controlled trial of a digital multi-media smoking cessation intervention. Addiction 2008 Mar;103(3):478-84; discussion 485. [CrossRef] [Medline]
- Donker T, Bennett K, Bennett A, Mackinnon A, van Straten A, Cuijpers P, et al. Internet-delivered interpersonal psychotherapy versus internet-delivered cognitive behavioral therapy for adults with depressive symptoms: randomized controlled noninferiority trial. J Med Internet Res 2013;15(5):e82 [FREE Full text] [CrossRef] [Medline]
- Black Dog Institute. MyCompass. URL: http://www.blackdoginstitute.org.au/docs/mycompassbackgroundinfo.pdf [accessed 2013-06-24] [WebCite Cache]
- Milne K, Bowler S, Li J, Salmon P. Evaluation of the first year of the Txt 2 Quit service: 17 June 2008–16 June 2009. Wellington, New Zealand: The Quit Group; 2009. URL: http://www.quit.org.nz/file/research/FINAL%202008-09%20Txt2Quit%20evaluation%20report%2020090731.pdf [accessed 2013-06-18] [WebCite Cache]
- Whittaker R, Maddison R, McRobbie H, Bullen C, Denny S, Dorey E, et al. A multimedia mobile phone-based youth smoking cessation intervention: findings from content development and piloting studies. J Med Internet Res 2008;10(5):e49 [FREE Full text] [CrossRef] [Medline]
- Granholm E, Ben-Zeev D, Link PC, Bradshaw KR, Holden JL. Mobile Assessment and Treatment for Schizophrenia (MATS): a pilot trial of an interactive text-messaging intervention for medication adherence, socialization, and auditory hallucinations. Schizophr Bull 2012 May;38(3):414-425 [FREE Full text] [CrossRef] [Medline]
- Riva G, Gorini A, Gaggioli A. The Intrepid project - biosensor-enhanced virtual therapy for the treatment of generalized anxiety disorders. Stud Health Technol Inform 2009;142:271-276. [Medline]
- Martínez-Pérez B, de la Torre-Díez I, López-Coronado M. Mobile health applications for the most prevalent conditions by the World Health Organization: review and analysis. J Med Internet Res 2013;15(6):e120 [FREE Full text] [CrossRef] [Medline]
- Bockting CL, Kok GD, van der Kamp L, Smit F, van Valen E, Schoevers R, et al. Disrupting the rhythm of depression using Mobile Cognitive Therapy for recurrent depression: randomized controlled trial design and protocol. BMC Psychiatry 2011;11:12 [FREE Full text] [CrossRef] [Medline]
- Ly KH, Carlbring P, Andersson G. Behavioral activation-based guided self-help treatment administered through a smartphone application: study protocol for a randomized controlled trial. Trials 2012;13:62 [FREE Full text] [CrossRef] [Medline]
- Morris ME, Kathawala Q, Leen TK, Gorenstein EE, Guilak F, Labhard M, et al. Mobile therapy: case study evaluations of a cell phone application for emotional self-awareness. J Med Internet Res 2010;12(2):e10 [FREE Full text] [CrossRef] [Medline]
- Abroms LC, Padmanabhan N, Thaweethai L, Phillips T. iPhone apps for smoking cessation: a content analysis. Am J Prev Med 2011 Mar;40(3):279-285 [FREE Full text] [CrossRef] [Medline]
- Bock B, Graham A, Sciamanna C, Krishnamoorthy J, Whiteley J, Carmona-Barros R, et al. Smoking cessation treatment on the Internet: content, quality, and usability. Nicotine Tob Res 2004 Apr;6(2):207-219. [CrossRef] [Medline]
- Bock BC, Graham AL, Whiteley JA, Stoddard JL. A review of web-assisted tobacco interventions (WATIs). J Med Internet Res 2008;10(5):e39 [FREE Full text] [CrossRef] [Medline]
- Cohn AM, Hunter-Reel D, Hagman BT, Mitchell J. Promoting behavior change from alcohol use through mobile technology: the future of ecological momentary assessment. Alcohol Clin Exp Res 2011 Dec;35(12):2209-2215 [FREE Full text] [CrossRef] [Medline]
- U.S. Food and Drug Administration. Draft guidance for industry and food and drug administration staff- mobile medical applications. URL: http://www.fda.gov/medicaldevices/deviceregulationandguidance/guidancedocuments/ucm263280.htm [accessed 2013-09-05] [WebCite Cache]
- Leon AC, Davis LL, Kraemer HC. The role and interpretation of pilot studies in clinical research. J Psychiatr Res 2011 May;45(5):626-629 [FREE Full text] [CrossRef] [Medline]
- Cuijpers P, Andersson G, Donker T, van Straten A. Psychological treatment of depression: results of a series of meta-analyses. Nord J Psychiatry 2011 Dec;65(6):354-364. [CrossRef] [Medline]
- Wright RW, Brand RA, Dunn W, Spindler KP. How to write a systematic review. Clin Orthop Relat Res 2007 Feb;455:23-29. [CrossRef] [Medline]
- Mohr DC, Cheung K, Schueller SM, Hendricks Brown C, Duan N. Continuous evaluation of evolving behavioral intervention technologies. Am J Prev Med 2013 Oct;45(4):517-523. [CrossRef] [Medline]
|BA: behavioral activation|
|BDI: Beck Depression Inventory|
|BDI-II: Beck Depression Inventory-Second edition|
|CBT: cognitive behavior therapy|
|COPE: COPE Inventory|
|CT: cognitive therapy|
|DASS: Depression, Anxiety and Stress Scale|
|DBT: dialectical behavior therapy|
|EMA: ecological momentary assessment|
|ESA: emotional self-awareness|
|K10: Kessler Psychological Distress Scale-10 item scale|
|MDD: major depressive disorder|
|MINI: Mini-International Neuropsychiatric Interview|
|PHQ: Patient Health Questionnaire|
|PHQ-9: Patient Health Questionnaire-9 item scale|
|RCT: randomized controlled trial|
|SIT: stress inoculation training|
|STAI: State-Trait Anxiety Inventory|
Edited by P Carlbring; submitted 25.06.13; peer-reviewed by J Ruwaard, O Kristjansdottir; comments to author 21.08.13; revised version received 17.09.13; accepted 18.09.13; published 15.11.13
©Tara Donker, Katherine Petrie, Judy Proudfoot, Janine Clarke, Mary-Rose Birch, Helen Christensen. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 15.11.2013.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.