Background

JMIR

J Med Internet Res

Journal of Medical Internet Research

1438-8871

JMIR Publications

Toronto, Canada

v24i3e29114

35319470

10.2196/29114

Review

Process and Outcome Evaluations of Smartphone Apps for Bipolar Disorder: Scoping Review

Mavragani

Amaryllis

Khalili-Mahani

Najmeh

Lobban

Fiona

Mehdizadeh

Hamed

Tatham

Iona

https://orcid.org/0000-0002-0758-9896

Clarke

Ellisiv

MBBS 1

https://orcid.org/0000-0002-4260-473X

Grieve

Kelly Ann

BA 1 2

https://orcid.org/0000-0001-7508-6986

Kaushal

Pulkit

MBBS, MD 1 2

https://orcid.org/0000-0002-4609-268X

Smeddinck

Jan

PhD 3

https://orcid.org/0000-0003-0562-8473

Millar

Evelyn Barron

PhD 1

https://orcid.org/0000-0002-8992-3332

Sharma

Aditya Narain

MBBS, MD, PhD 1

Translational and Clinical Research Institute Faculty of Medical Sciences Newcastle University

Academic Psychiatry, Wolfson Research Centre

Campus for Ageing and Vitality

Newcastle upon Tyne, NE4 5PL

United Kingdom 44 1912875262 aditya.sharma@ncl.ac.uk

https://orcid.org/0000-0003-4632-4521

1 Translational and Clinical Research Institute Faculty of Medical Sciences Newcastle University

Newcastle upon Tyne

United Kingdom 2 National Specialist Adolescent Mood disorders Service Cumbria Northumberland Tyne and Wear NHS Foundation Trust Walkergate Park

Newcastle upon Tyne

United Kingdom 3 Open Lab, Human Computer Interaction Urban Sciences Building Newcastle University

Newcastle upon Tyne

United Kingdom

Corresponding Author: Aditya Narain Sharma aditya.sharma@ncl.ac.uk

3 2022

23 3 2022

24 3

e29114

26 3 2021 7 5 2021 28 7 2021 1 12 2021

©Iona Tatham, Ellisiv Clarke, Kelly Ann Grieve, Pulkit Kaushal, Jan Smeddinck, Evelyn Barron Millar, Aditya Narain Sharma. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 23.03.2022.

2022

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

Background

Mental health apps (MHAs) provide opportunities for accessible, immediate, and innovative approaches to better understand and support the treatment of mental health disorders, especially those with a high burden, such as bipolar disorder (BD). Many MHAs have been developed, but few have had their effectiveness evaluated.

Objective

This systematic scoping review explores current process and outcome measures of MHAs for BD with the aim to provide a comprehensive overview of current research. This will identify the best practice for evaluating MHAs for BD and inform future studies.

Methods

A systematic literature search of the health science databases PsycINFO, MEDLINE, Embase, EBSCO, Scopus, and Web of Science was undertaken up to January 2021 (with no start date) to narratively assess how studies had evaluated MHAs for BD.

Results

Of 4051 original search results, 12 articles were included. These 12 studies included 435 participants, and of these, 343 had BD type I or II. Moreover, 11 of the 12 studies provided the ages (mean 37 years) of the participants. One study did not report age data. The male to female ratio of the 343 participants was 137:206. The most widely employed validated outcome measure was the Young Mania Rating Scale, being used 8 times. The Hamilton Depression Rating Scale-17/Hamilton Depression Rating Scale was used thrice; the Altman Self-Rating Mania Scale, Quick Inventory of Depressive Symptomatology, and Functional Assessment Staging Test were used twice; and the Coping Inventory for Stressful Situations, EuroQoL 5-Dimension Health Questionnaire, Generalized Anxiety Disorder Scale-7, Inventory of Depressive Symptomatology, Mindfulness Attention Awareness Scale, Major Depression Index, Morisky-Green 8-item, Perceived Stress Scale, and World Health Organization Quality of Life-BREF were used once. Self-report measures were captured in 9 different studies, 6 of which used MONARCA. Mood and energy levels were the most commonly used self-report measures, being used 4 times each. Furthermore, 11 of the 12 studies discussed the various confounding factors and barriers to the use of MHAs for BD.

Conclusions

Reported low adherence rates, usability challenges, and privacy concerns act as barriers to the use of MHAs for BD. Moreover, as MHA evaluation is itself developing, guidance for clinicians in how to aid patient choices in mobile health needs to develop. These obstacles could be ameliorated by incorporating co-production and co-design using participatory patient approaches during the development and evaluation stages of MHAs for BD. Further, including qualitative aspects in trials that examine patient experience of both mental ill health and the MHA itself could result in a more patient-friendly fit-for-purpose MHA for BD.

child and adolescent mental health scoping review bipolar disorder mental health

Introduction Background

There are many critical factors that can influence the course and outcome of mental health disorders. Two key factors are (1) early and accurate identification of the first onset and subsequent relapses of the disorder, leading to the institution of appropriate management, and (2) access to appropriate treatment locations. For bipolar disorder (BD), the average delay between the onset of symptoms and the first institution of treatment can be as long as 10 to 15 years [1-3]. Between 35% and 50% of patients with mental health disorders receive no treatment because appropriate treatment locations are rare [4]. BD is no exception to this rule. A United Kingdom–based 2015 study found that the median diagnostic delay in the South London and Maudsley National Health Service (NHS) Trust was 62 days, with the median treatment delay being a further 31 days [5]. Research regarding pathways to redress these delays is urgently required, and with the potential to reliably scaffold processes and scale to both large numbers and remote locations, digital technology holds considerable potential to address these challenges.

In 2020, an estimated 6 billion smartphones were in use across the globe [6]. In the United Kingdom, there has been a 79% increase in the number of 5- to 15-year-old children owning mobile phones since 2015 [7]. Although smartphone ownership tends to be more common in high-income countries, as economies develop, the price of smartphones will decrease, and this correlation will reduce [6]. One form of digital technology that can capitalize on this increased smartphone usage globally is mental health apps (MHAs).

Figure 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flowchart for scoping reviews.

Prior Work

Currently, MHAs can be seen to improve engagement and accessibility for individuals in rural areas where health care provision is increasingly difficult to access [8,9], and they are well accepted by service users [10]. So much so that financial incentives have been implemented for behavioral health information technologies in US policy [11]. Such advances in digital mental health care have now been adopted by clinicians in the treatment of common mental disorders in the United Kingdom [9]. Cost-effectiveness is crucial to health care, especially in a government-funded system as comprehensive as the NHS. Evidence suggests that the use of tele-psychiatry interventions reduced pressure on mental health services in low- and middle-income countries in comparison to a control group [12], but noted the importance of rigorous app evaluation. This is echoed by Tal et al [13], who described both the potential opportunities and risks posed by digital mental health, and how they must be balanced in order to achieve meaningful change.

The socioeconomic cost of BD in the United Kingdom is well recognized [14]. Previous literature suggests that the use of MHAs for BD can increase patient engagement and provide real-time symptom monitoring to allow for improved recognition of symptoms of relapse [15]. Subsequently, this reduces barriers to treatment, such as lack of resources and time. However, the efficacy of MHAs for BD is unclear [16], and a paucity of evidence in how to assess and evaluate MHAs for BD makes these statements difficult to qualify. To date, there is a lack of regulatory guidelines regarding MHAs, including those for BD, as health technologies are a relatively new resource within the NHS. This could potentially lead to unsafe use and practice [17]. Little is known about how MHAs (including those for BD) are developed and scrutinized, and studies predict that consumers, policymakers, health services, and funders will demand a robust evaluation process before funding, prescribing, or using these services [18].

As the development of an MHA for BD requires iterative processes with stakeholders outside of the academic and clinical research environment, process evaluation is important (in addition to more traditional outcome measure methods) to ensure the app remains user-friendly and functional without compromising clinical outcomes. This scoping review aims to address this research gap through mapping existing literature on process and outcome evaluation methods of MHAs for BD to increase the understanding around currently available evaluation tools and the latest practice.

Aim

The aim of this scoping review is to systematically explore current process and outcome measures to identify the best practice for evaluating MHAs for BD. The focus is on apps for BD designed for individuals across the lifespan. Conducting a scoping review will allow health care systems to be more structurally informed on how to accurately evaluate the effectiveness of such technologies when implementing them into routine care [19]. The specific objectives of this scoping review, based on the detailed framework of Levac et al [20], were to map the available evidence and report on (1) process evaluation methods (ie, participant usability and functionality), (2) outcome measure methods (ie, data on how widely a measure is used and concordance of the target population with completing a measure), (3) outcome measures used to measure mental health improvement (eg, well-being measures), and (4) methods for best practice in the evaluation of MHAs for BD.

Methods Overview

The review was informed by the Arksey and O’Malley 5-step framework [19], which was further developed by the Levac, Colquhoun, and O’Brien model [20]. This includes identification of a research question, study selection and criteria, data extraction, and content analysis. Employing this methodological framework will support in examining the broader field of the evaluation of MHAs for BD to identify the best practice.

Search Strategy

A scoping search was initially performed in the following databases: PsycINFO, MEDLINE, Embase, EBSCO, Scopus, and Web of Science. Relevant search terms were identified from key papers, and the search strategy was developed iteratively in MEDLINE and then translated across the other databases, up to January 2021. Due to time constraints, grey literature sources were not investigated, and the search was limited to articles published in English, as no resources were available to undertake translation work. Broad search terms were used to reduce the likelihood of article omission. The complete search strategy for MEDLINE is available in Multimedia Appendix 1. The reference lists of included studies were hand searched for additional reports of relevance.

Selection Criteria

For studies to be included in this review, they needed to meet the following inclusion and exclusion criteria. Studies were included if they (1) were related to BD, (2) targeted individuals across the lifespan, (3) included qualitative and/or quantitative evaluation methods and measures, (4) were published in the English language, (5) had no start date limit, (6) included any type of study design, (7) included participants with symptoms of BD or diagnosed with BD according to International Classification of Diseases-10, Diagnostic and Statistical Manual of Mental Disorders-IV, or Diagnostic and Statistical Manual of Mental Disorders-5, (8) included evaluation of the functionality of the MHA and/or evaluation of the participant outcomes, and (9) included any function (eg, screening, mood monitoring, or medication adherence). Studies were excluded if they (1) were based on a web-based intervention with no MHA counterpart, (2) included MHAs that were only psychotherapeutic intervention specific with no evaluation, (3) were based on MHA development only, and (4) included a participant population without symptoms of BD or not diagnosed with BD. Where systematic review papers were identified, these were not included. However, their reference lists were hand searched to identify primary articles relevant for inclusion.

Given that the aim of the study was to recognize the scope of research already conducted, both qualitative and quantitative research designs were included. As few studies focused on children and adolescents as their participant population, no age limits were applied.

Selection Process

The search was completed by 2 researchers (IT and PK), who independently screened articles by the title and abstract against the inclusion criteria. Articles that fulfilled the inclusion criteria were then subjected to full-text screening by IT and PK. Conflicts were discussed with a third researcher (EBM) to reach consensus. Eight conflicts arose altogether, including 6 when screening the title, 1 when screening the title and abstract, and 1 when screening the full paper.

Data Collection Process

Characterization of the data and the results were exported into a customized data extraction form that was piloted in a subset of included studies. Data extracted included study name, authors, year, country, study design, MHA for BD, whether the MHA for BD was independent or adjunctive, sample size, mean age of the participants, gender of the participants, inclusion/exclusion criteria for the participants, results, tools used, measures used, time points, and whether it addressed any of the 4 objectives.

Data Synthesis and Quality Assessment

EC and IT analyzed the data using narrative synthesis and placed this in the context of the current literature to formulate conclusions. The studies were assessed using the Mixed Methods Appraisal Tool (MMAT) [21].

Results Overview

The original database search yielded 4051 articles. Hand searching of relevant review articles was conducted, which yielded a further 5 articles. After duplicates were removed, 3730 articles remained. Screening of the title and abstract resulted in 3642 articles being excluded. The remaining 88 articles were then subjected to full-text screening, and 76 articles were excluded (71 due to a lack of focus on BD, 2 could not be located, 1 only focused on app development, 1 did not diagnose participants according to our specified criteria, and 1 had no app evaluation).

Study Characteristics

Overall, 12 studies were identified as part of this review, which evaluated 7 MHAs for BD. Multimedia Appendix 2 describes the characteristics and assessment compliance of each study [22-33], and Multimedia Appendix 3 describes the results of the respective compliance with the standards set out in the MMAT [22-33]. Across all 12 studies, data from 435 participants were analyzed (343 with BD). Five out of the 12 studies stated the type of BD for 167 participants (112 had bipolar I disorder, 52 had bipolar II disorder, and 3 had bipolar disorder not otherwise specified). Eleven of the 12 studies provided the mean age (37 years) of the participants with BD. All 12 studies provided information on the gender (M:F of 137:206) of the participants with BD.

Assessment of the quality of all 12 studies (quantitative and mixed methods) was performed (IT and EBM) using the MMAT (version 2018) [21]. The results of their respective compliance with the standards set out in the MMAT can be found in Multimedia Appendix 3. Scores ranged from 20% to 100%. However, low-quality studies were not excluded in order to summarize the small pool of available literature.

Process Evaluation

Six studies examined the self-perceived participant usability and functionality of the MHAs for BD. This examination ranged from detailed feedback questionnaires given to the participants [22,23] to participant feedback suggesting that a reminder prompt from the MHA for BD increased completion rates. Only 1 study mentioned functionality problems in MHAs for BD. Bardram et al [23] commented that MONARCA only worked 63 out of the 69 days of the study period and the information quality score was lower due to unresponsive error messages. The authors also noted that the Android market locked the app during the study period, negatively affecting the pattern of usage during that time.

Two studies recognized that technical problems were likely to arise and so implemented a system to solve these problems. Hidalgo-Mazzei et al [34] supplied participants with technical support via telephone, so they could contact the researcher for further assistance. A similar system was put in place by Schärer et al [24]. Subjects were able to report errors and receive immediate assistance by phone or personal communication. However, it was found that the MHA for BD required a certain amount of knowledge as a prerequisite, which restricted its use in comparison to a text message equivalent [24].

Outcome Evaluation

A variety of validated outcome measures were used to evaluate the selected MHAs for BD. The most widely employed measure was the Young Mania Rating Scale (n=8) [35]. The Hamilton Depression Rating Scale [36] was applied 3 times, and the Altman Self-Rating Mania Scale [37], Quick Inventory of Depressive Symptomatology [38], and Functional Assessment Staging Test [39] were used on 2 occasions. The Coping Inventory for Stressful Situations [40], EuroQoL 5-Dimension Health Questionnaire [41], Generalized Anxiety Disorder Scale-7 [41], Inventory of Depressive Symptomatology [42], Mindfulness Attention Awareness Scale [43], Major Depression Index [44], Morisky-Green 8-item [45], Perceived Stress Scale [46], and World Health Organization Quality of Life-BREF [47] were all utilized just once. Other measures were assessed in 9 different studies, 6 of which used MONARCA. These measures included mood, sleep length, medication taken, activity level, irritability, mixed mood, cognitive problems, alcohol consumption, stress level, menstruation, individualized early warning sign, energy level, anxiety, elation, sadness, anger, speed of thoughts, and impulsivity. Mood and energy level were the most commonly utilized measures, being used 4 times each.

Outcome Measure Methods

Eleven of the 12 studies presented a debate on the confounding factors affecting the efficacy of MHAs for BD. These confounding factors and the number of times they were mentioned in the 11 studies are shown in Table 1.

Table 1

Identified potential confounders for mental health app efficacy.

Potential confounder	Number of studies in which mentioned
Participants were mainly stable or euthymic	4
Participant insight when experiencing a manic phase varies	3
Sample size was too small	3
Length of study was too short	3
Low retention or adherence rate	2
Method of objective data collection was not robust enough	2
Patients were found to be capable of experiencing both manic and depressive symptoms concurrently	1
Questionnaires given were too simplistic	1
Opportunity of free-text input not given	1
Error in the app	1
Change in mobile phone communication habits	1
Low prevalence of manic symptoms	1
Potential sampling bias	1
Order of questions in the questionnaire did not vary and so was open to mindless input	1
Scales not delivered often enough	1
Participants switched the mental health app (MHA) off during the study	1
Participants may have chosen not to complete the surveys due to their mood	1
Subjective scales	1
The MHA gave daily confrontation with depressive symptoms	1
The MHA was not sensitive enough to manic or depressive symptoms	1
Participants were already involved in a medication adherence intervention	1

Evaluation of MHAs for BD

Only 5 of the 12 studies commented on the future of the evaluation of MHAs for BD. Streicher et al [48] suggested that instead of measuring relapse or recurrence of affective episodes, a more sensitive measure would be assessment of mood instability or subsyndromal symptoms. They also commented that future research should include patients with bipolar disorder not otherwise specified, as this patient group may represent a large proportion of patients with BD. Hidalgo-Mazzei et al [34] acknowledged the low retention and adherence rates of MHAs for BD, and stated that researchers should focus on developing new approaches to motivate and engage patients with the intervention in the long term. The authors suggested adopting a user-centered design approach or incorporating gamification elements into a formal psychoeducation process.

Osmani et al [25] commented on the personalization of MHAs for BD, with a focus on physical activity; however, the authors found it difficult to generalize their results to the wider population. This was due to substantial variations between patients for both overall physical activity levels and physical activity levels within daily intervals. Therefore, the authors considered that an adaptive approach to user modeling would be better suited to detect early warning signs of the onset of episodes of BD and facilitate timely intervention. This involves personalizing goals and achievements around each patient’s individual needs. This has been evidenced in previous conference proceedings [48].

Schwartz et al [26] found that the generalizability of the results was limited due to a lack of a comparison group with a differing psychiatric diagnosis, which may exhibit overlapping symptoms. They recommended that future research should use additional comparison groups to better differentiate between symptoms.

Faurholt-Jepsen et al [27] proposed that emphasis should be placed on the differentiation between day-to-day difficulties and depressive symptoms. A positive reinforcing feedback mechanism may help minimize the negative processing bias and so, in theory, relieve the sustained depressive symptoms. They addressed the notion that it can be difficult for an intervention to have an effect on both depressive and manic symptoms, given the complexity of BD [27]. These suggestions are in keeping with the existing literature [49].

Discussion Principal Findings and Comparison With Prior Work

The aim of this scoping review was to better understand how MHAs for BD are being evaluated, particularly in terms of the process of use and outcome measures. Due to the scarcity of studies evaluating MHAs for BD specifically, inferences for discussion have been assumed from studies evaluating general MHAs. This relies on the assumption that the functions are similar.

The need for effective and diligent evaluation of MHAs for BD is well established in the literature; no credentialing is currently required for their development and release. Karcher et al [50] warned that the “questionable content” and sparse evidence base of the myriad of current MHAs available warrant careful consideration. Effective robust evaluation systems would be required in aiding patients and practitioners in making individualized appropriate decisions regarding their role and treatment options in patient care. The NHS in the United Kingdom made considerable progress in this area when they launched their Digital Technology Assessment Criteria (DTAC) in February 2021. This provides a “simpler and faster assessment process to help give staff, patients, and the public confidence that the digital health tools they use meet NHS standards” [51]. The DTAC bring together legislation and good practice into a core document [52] that all digital health technologies have to meet in order to be recommended by clinical policy teams within NHS England and NHS Improvement. Though this goes so far to provide the public with centrally regulated technologies, clinicians may lack the knowledge and skills required for the effective recommendation of an MHA [25-27,43-54]. Mindfulness and meditation MHAs are the most commonly recommended by general practitioners in Australia [54]. However, the clinical presentation of BD and its specialist management may deter general practitioners from researching or recommending MHAs for its monitoring or management. Therefore, training health care professionals’ skills in identifying and selecting high-quality MHAs for BD would be beneficial for patients. If, as the literature suggests, the use of MHAs decreases service use [10], the financial benefit may outweigh the cost of the additional training required.

One barrier in the development of MHA evaluation systems is the adherence rate. O’Connell [55] reported that 74% of users stop engaging with an MHA after only 10 uses. Low adherence reduces the confidence with which researchers can generalize their results [56]. Aforementioned personalization and gamification of MHAs for BD have been recommended; however, engagement can tail off once the initial novelty effect of the feature has subsided. It has been suggested that a change in the communication approach may help to solve this problem. Kenny et al [57] surmised that it is possible that in studies where participants (specifically young people) are aware of the importance of their input to achieve the research objective, engagement levels may be higher. This brings into focus the importance of participatory co-design and co-production of MHAs for BD. Eight of our 12 studies mentioned adherence rates (adherence rates were not applicable in 2 studies, and another 2 studies failed to mention the rates), with the average rate being 84%. In fact, Tsana et al [28] experienced 91% adherence over the first 3 months of the study period and 81.9% in the following 9 months. The variability between the studies by O’Connell [55] and Tsanas et al [28] illustrates that there is still work to be done to achieve successful and reliable compliance with such apps.

Interestingly, one reason for the low utilization of MHAs for BD may be decreased motivation, which is often a key feature of depressive episodes [57]. Previous literature suggests that “communities of practice” around an MHA can improve long-term concordance, with social interaction and communal use (whether in person or digitally), encouraging users to continue to access it [49,50]. Integrating a “days since last updated” screening tool would help identify early relapse in the usage of MHAs for BD, and aid in assessing clinical usability [18].

Torous et al [58] interviewed adolescent patients to identify which factors would be useful to bear in mind for MHA development. The results included safety, engagement, functionality, social interaction, awareness, gender, and participative engagement by young people. One study strongly suggested the abandonment of randomized controlled trials as a method of evaluating and improving apps, and instead called for iterative participatory research or single case designs [59]. As such, MHA developers would work in collaboration with patients throughout the design and development process in order to gain regular ecologically valid feedback so that relevant appealing prototypes are established. This could take the form of consumer-used tools or accreditation portals [60]. Then, when pilot and nonpilot studies are performed, both qualitative and quantitative data could be obtained in order to receive valuable feedback in how to further improve the app. The role of randomized controlled trials can then be established in validating the MHAs at later stages. Torous et al [58] theorized further reasons for low engagement, including poor usability, lack of a user-centric design, privacy concerns, and lack of trust. Another study found that MHA efficiency, effectiveness, memorability, and learnability and cognitive load were major usability barriers to continuing MHA use [61]. This evidence lends further strength to the argument that streamlining of the usability of MHAs should be at the core of future iterative development stages of MHAs in order to increase adherence rates and improve the ecological validity and reliability of evaluations.

Torous et al [58] also recognized accessibility as a factor to consider when developing MHAs. The Office for National Statistics stated that in 2018, 10% of the UK adult population were internet “nonusers” [62], meaning they had never used the internet or had not used it in the last 3 months. This brings the idea of the digital divide into the spotlight and shows the complexities it brings with it along with merely accessibility. The digital divide cannot be solved by just providing patients with devices, as they will also need digital literacy skills to use the device to its full potential. Even though Torous et al [58] interviewed adolescents, their study does illustrate the need to consider the skill level of the intended audience. Ennis et al [63] found that lack of technological skills was the reason for nonengagement with computers and mobile devices. Furthermore, Ennis et al found that only a quarter of their 121 participants reported familiarity and easy access to smartphones. Therefore, throughout the MHA evaluation process, patients’ skill sets, in addition to access, should be taken into consideration.

As well as employing participative engagement in MHA development, improving user awareness can come from creative measures, such as describing or advertising the MHA appropriately. Over a quarter of MHAs for depression failed to mention depression in the title or description [64].

Moving to the user-identified priority of safety [29,58], third parties obtaining confidential information is considered the greatest threat to MHA use [65]. Tools are available that can increase device security [50]; however, threats to privacy are continually emerging and endangering data security. For example, identity cannot be confirmed unless a video-calling app is used and personal devices are easily lost or stolen, leaving data vulnerable [50]. Karcher et al [50] determined that the greatest threat to patient privacy in MHA use was the possibility of confidential information being shared with third parties, whether via patient or clinician devices. Hacking of secure devices and new viruses were also identified as challenges to a secure patient database on MHAs. As well as being its own point for consideration when overcoming the challenges of evaluating the clinical effectiveness of MHAs, the complex legal and ethical considerations involved in MHA use are a consideration for clinicians themselves [50]. This has been highlighted in recent news, where contact tracing apps used to help curb the spread of COVID-19 have been the subject of widespread debate.

From a more pragmatic standpoint, MHA cost may be a factor in choosing the right MHA for BD, with 76% of people surveyed reporting interest in using their mobile phones for mental health monitoring and self-management if the MHA was free of charge [15]. Moreover, Larsen et al [66] reported that an MHA clinically relevant for depression is being removed from the market every 2.9 days. This furthers the challenge faced by both patients and clinicians in trying to identify a relevant and appropriate MHA for BD. If the MHA for BD is to be paid for and could be removed from the market without warning, it is difficult to justify its recommendation and purchase.

Limitations

This review had its own limitations. Only 7 MHAs for BD were evaluated, somewhat limiting the generalizability of the results of this review. Furthermore, only 5 studies commented on the future of the development and evaluation of MHAs for BD.

Conclusion

The studies in our review focused on patient monitoring as an indicator for process and outcome evaluation in MHAs for BD. They based their conclusions on whether the app improved assessment scores rather than interviewing patients on their experience of using the app. Although this is suggested to be a reliable way of measuring the process and outcome values, as modern medicine shifts to holistic patient-centered care, more emphasis should be put on users’ experiences rather than quantitative outcomes. In the long term, this will make patients feel respected and involved in the design of MHAs for BD, increasing adherence rates in both the short and long term.

Personalized medicine is a rapidly emerging movement in the field of health care. It is defined as a move away from the “one size fits all” approach to treatment, with new approaches and targeted therapies allowing for flexibility in the management of diseases. With this in mind, more MHAs for BD should be easily available in order to encourage patient choice and freedom to choose an MHA that is best suited to them. At the moment, MONARCA dominates the market, reducing the range and scope of MHAs for BD. As NHS England suggests [60], it can be difficult for an intervention to address both depressive and manic symptoms given the complexity of BD. This is all the more reason to develop a wider variety of apps, with some apps perhaps only focusing on either mania or depression.

The field of MHAs for BD shows promise in both improving patient care and creating a more cost-effective health care service [10]. However, as with any new development in health care, it must be appropriately evaluated and regulated. By encouraging patient co-design and co-evaluation, we can develop a new frontier in personalized digital health, while improving patient experience and care.

Multimedia Appendix 1

MEDLINE and PsycINFO searches.

Multimedia Appendix 2

Study characteristics.

Multimedia Appendix 3

Study designs and outcomes.

Abbreviations

bipolar disorder

DTAC

Digital Technology Assessment Criteria

MHA

mental health app

MMAT

Mixed Methods Appraisal Tool

NHS

National Health Service

IT: acquisition of data, analysis and interpretation of data, drafting the article, and final approval of the version to be published. EC: acquisition of data, analysis and interpretation of data, drafting the article, and final approval of the version to be published. KG: design of the study, revising the manuscript critically for important intellectual content, and final approval of the version to be published. PK: design of the study, acquisition of data, analysis and interpretation of data, revising the manuscript critically for important intellectual content, and final approval of the version to be published. JS: concept of the study, revising the manuscript critically for important intellectual content, and final approval of the version to be published. EBM: concept and design of the study, interpretation of the data, revising the manuscript critically for important intellectual content, and final approval of the version to be published. ANS: concept and design of the study, revising the manuscript critically for important intellectual content, and final approval of the version to be published.

None declared.

Medici

Videbech

Gustafsson

Munk-Jørgensen

Mortality and secular trend in the incidence of bipolar disorder

J Affect Disord 2015 09 01 183 39 44

10.1016/j.jad.2015.04.032

26001661

S0165-0327(15)00256-6

Drancourt

Etain

Lajnef

Henry

Raust

Cochet

Mathieu

Gard

Mbailara

Zanouy

Kahn

Cohen

Wajsbrot-Elgrabli

Leboyer

Scott

Bellivier

Duration of untreated bipolar disorder: missed opportunities on the long road to optimal treatment

Acta Psychiatr Scand 2013 02 127 2 136 44

10.1111/j.1600-0447.2012.01917.x

22901015

Dagani

Signorini

Nielssen

Bani

Pastore

Girolamo

Large

Meta-analysis of the Interval between the Onset and Management of Bipolar Disorder

Can J Psychiatry 2017 04 62 4 247 258

10.1177/0706743716656607

27462036

0706743716656607

PMC5407546

65th World Health Assembly

Global burden of mental disorders and the need for a comprehensive, coordinated response from health and social sectors at the country level: report by the Secretariat

World Health Organisation 2012

2021-03-10

https://apps.who.int/iris/handle/10665/78898

Patel

Shetty

Jackson

Broadbent

Stewart

Boydell

McGuire

Taylor

Delays before Diagnosis and Initiation of Treatment in Patients Presenting to Mental Health Services with Bipolar Disorder

PLoS One 2015 10 5 e0126530

10.1371/journal.pone.0126530

25992560

PONE-D-14-49301

PMC4439113

Poushter

Smartphone Ownership and Internet Usage Continues to Climb in Emerging Economies

Pew Research Center 2016

2021-03-10

https://www.pewresearch.org/global/2016/02/22/smartphone-ownership-and-internet-usage-continues-to-climb-in-emerging-economies/

Children and parents: media use and attitudes report: 2013

Ofcom 2013

2021-03-10

https://www.ofcom.org.uk/research-and-data/media-literacy-research/childrens/children-parents-oct-2013

Rosenfeld

Pendse

Nugent

How mobile health applications can help treat depression

The Brown University Child and Adolescent Behavior Letter 2017 08 22 33 9 1 6

10.1002/cbl.30236

Chandrashekar

Do mental health mobile apps work: evidence and recommendations for designing high-efficacy mental health mobile apps

Mhealth 2018 4 6

10.21037/mhealth.2018.03.02

29682510

mh-04-2018.03.02

PMC5897664

Ben-Zeev

Buck

Hallgren

Drake

Effect of Mobile Health on In-person Service Use Among People With Serious Mental Illness

Psychiatr Serv 2019 06 01 70 6 507 510

10.1176/appi.ps.201800542

30947636

Roberts

Chan

Torous

New tests, new tools: mobile and connected technologies in advancing psychiatric diagnosis

NPJ Digit Med 2018 1 20176

10.1038/s41746-017-0006-0

31304350

PMC6550288

Lazar

Pan

Ragguett

Lee

Subramaniapillai

Mansur

Rodrigues

McIntyre

Digital revolution in depression: A technologies update for clinicians

Personalized Medicine in Psychiatry 2017 12 4-6 1 6

10.1016/j.pmip.2017.09.001

Tal

Torous

The digital mental health revolution: Opportunities and risks

Psychiatr Rehabil J 2017 09 40 3 263 265

10.1037/prj0000285

28891658

2017-39812-001

Das Gupta

Guest

Annual cost of bipolar disorder to UK society

Br J Psychiatry 2002 03 180 227 33

10.1192/bjp.180.3.227

11872515

S0007125000268219

Proudfoot

The future is in our hands: the role of mobile phones in the prevention and management of mental disorders

Aust N Z J Psychiatry 2013 02 47 2 111 3

10.1177/0004867412471441

23382507

47/2/111

Wisniewski

Liu

Henson

Vaidyam

Hajratalli

Onnela

Torous

Understanding the quality, effectiveness and attributes of top-rated smartphone health apps

Evid Based Ment Health 2019 02 22 1 4 9

10.1136/ebmental-2018-300069

30635262

ebmental-2018-300069

PMC7061529

O'Brien

Colquhoun

Levac

Baxter

Tricco

Straus

Wickerson

Nayar

Moher

O'Malley

Advancing scoping study methodology: a web-based survey and consultation of perceptions on terminology, definition and methodological steps

BMC Health Serv Res 2016 07 26 16 305

10.1186/s12913-016-1579-z

27461419

10.1186/s12913-016-1579-z

PMC4962390

Anderson

Allen

Peckham

Goodwin

Asking the right questions: scoping studies in the commissioning of research on the organisation and delivery of health services

Health Res Policy Syst 2008 07 09 6 7

10.1186/1478-4505-6-7

18613961

1478-4505-6-7

PMC2500008

Arksey

O'Malley

Scoping studies: towards a methodological framework

International Journal of Social Research Methodology 2005 02 8 1 19 32

10.1080/1364557032000119616

Levac

Colquhoun

O'Brien

Scoping studies: advancing the methodology

Implement Sci 2010 09 20 5 69

10.1186/1748-5908-5-69

20854677

1748-5908-5-69

PMC2954944

Hong

Fàbregues

Bartlett

Boardman

Cargo

Dagenais

Gagnon

Griffiths

Nicolau

O’Cathain

Rousseau

Vedel

Pluye

The Mixed Methods Appraisal Tool (MMAT) version 2018 for information professionals and researchers

EFI 2018 12 18 34 4 285 291

10.3233/efi-180221

Hidalgo-Mazzei

Mateu

Reinares

Murru

Del Mar Bonnín

Varo

Valentí

Undurraga

Strejilevich

Sánchez-Moreno

Vieta

Colom

Psychoeducation in bipolar disorder with a SIMPLe smartphone application: Feasibility, acceptability and satisfaction

J Affect Disord 2016 08 200 58 66

10.1016/j.jad.2016.04.042

27128358

S0165-0327(16)30340-8

Bardram

Frost

Szántó

Faurholt-Jepsen

Vinberg

Kessing

Designing mobile health technology for bipolar disorder: a field trial of the monarca system

CHI '13: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems 2013

SIGCHI Conference on Human Factors in Computing Systems

April 27-May 2, 2013

Paris, France

2627 2636

10.1145/2470654.2481364

Schärer

Krienke

Graf

Meltzer

Langosch

Validation of life-charts documented with the personal life-chart app - a self-monitoring tool for bipolar disorder

BMC Psychiatry 2015 03 14 15 49

10.1186/s12888-015-0414-0

25885225

10.1186/s12888-015-0414-0

PMC4367878

Osmani

Maxhuni

Grünerbl

Lukowicz

Haring

Mayora

Monitoring activity of patients with bipolar disorder using smart phones

MoMM '13: Proceedings of International Conference on Advances in Mobile Computing & Multimedia 2013

International Conference on Advances in Mobile Computing & Multimedia

December 2-4, 2013

Vienna, Austria

85 92

10.1145/2536853.2536882

Schwartz

Schultz

Reider

Saunders

EFH

Daily mood monitoring of symptoms using smartphones in bipolar disorder: A pilot study assessing the feasibility of ecological momentary assessment

J Affect Disord 2016 02 191 88 93

10.1016/j.jad.2015.11.013

26655117

S0165-0327(15)30227-5

PMC4799837

Faurholt-Jepsen

Frost

Ritz

Christensen

Jacoby

Mikkelsen

Knorr

Bardram

Vinberg

Kessing

Daily electronic self-monitoring in bipolar disorder using smartphones - the MONARCA I trial: a randomized, placebo-controlled, single-blind, parallel group trial

Psychol Med 2015 10 45 13 2691 704

10.1017/S0033291715000410

26220802

S0033291715000410

Tsanas

Saunders

KEA

Bilderbeck

Palmius

Osipov

Clifford

Goodwin De Vos

Daily longitudinal self-monitoring of mood variability in bipolar disorder and borderline personality disorder

J Affect Disord 2016 11 15 205 225 233

10.1016/j.jad.2016.06.065

27449555

S0165-0327(16)30781-9

PMC5296237

Faurholt-Jepsen

Ritz

Frost

Mikkelsen

Margrethe Christensen

Bardram

Vinberg

Kessing

Mood instability in bipolar disorder type I versus type II-continuous daily electronic self-monitoring of illness activity using smartphones

J Affect Disord 2015 11 01 186 342 9

10.1016/j.jad.2015.06.026

26277270

S0165-0327(15)30114-2

Faurholt-Jepsen

Vinberg

Frost

Christensen

Bardram

Kessing

Smartphone data as an electronic biomarker of illness activity in bipolar disorder

Bipolar Disord 2015 11 17 7 715 28

10.1111/bdi.12332

26395972

Guidi

Salvi

Ottaviano

Gentili

Bertschy

de Rossi

Scilingo

Vanello

Smartphone Application for the Analysis of Prosodic Features in Running Speech with a Focus on Bipolar Disorders: System Performance Evaluation and Case Study

Sensors (Basel) 2015 11 06 15 11 28070 87

10.3390/s151128070

26561811

s151128070

PMC4701269

Faurholt-Jepsen

Frost

Vinberg

Christensen

Bardram

Kessing

Smartphone data as objective measures of bipolar disorder symptoms

Psychiatry Res 2014 06 30 217 1-2 124 7

10.1016/j.psychres.2014.03.009

24679993

S0165-1781(14)00187-5

Beiwinkel

Kindermann

Maier

Kerl

Moock

Barbian

Rössler

Using Smartphones to Monitor Bipolar Disorder Symptoms: A Pilot Study

JMIR Ment Health 2016 01 06 3 1 e2

10.2196/mental.4560

26740354

v3i1e2

PMC4720836

Hidalgo-Mazzei

Mateu

Reinares

Matic

Vieta

Colom

Internet-based psychological interventions for bipolar disorder: Review of the present and insights into the future

J Affect Disord 2015 12 01 188 1 13

10.1016/j.jad.2015.08.005

26342885

S0165-0327(15)30369-4

Young

Biggs

Ziegler

Meyer

A rating scale for mania: reliability, validity and sensitivity

Br J Psychiatry 1978 11 133 429 35

10.1192/bjp.133.5.429

728692

S0007125000198551

Hamilton

A rating scale for depression

J Neurol Neurosurg Psychiatry 1960 02 23 56 62

10.1136/jnnp.23.1.56

14399272

PMC495331

Altman

Hedeker

Peterson

Davis

The Altman Self-Rating Mania Scale

Biological Psychiatry 1997 11 15 42 10 948 955

10.1016/S0006-3223(96)00548-3

9359982

S0006-3223(96)00548-3

Rush

Trivedi

Ibrahim

Carmody

Arnow

Klein

Markowitz

Ninan

Kornstein

Manber

Thase

Kocsis

Keller

The 16-Item Quick Inventory of Depressive Symptomatology (QIDS), clinician rating (QIDS-C), and self-report (QIDS-SR): a psychometric evaluation in patients with chronic major depression

Biol Psychiatry 2003 09 01 54 5 573 83

10.1016/s0006-3223(02)01866-8

12946886

S0006322302018668

Madera

Such

Zhang

Baker

Grande

Use of the Functioning Assessment Short Test (FAST) in defining functional recovery in bipolar I disorder. Post-hoc analyses of long-term studies of aripiprazole once monthly as maintenance treatment

Neuropsychiatr Dis Treat 2019 15 2325 2338

10.2147/NDT.S209700

31616148

209700

PMC6699506

Endler

Parker

JDA

Assessment of multidimensional coping: Task, emotion, and avoidance strategies

Psychological Assessment 1994 03 6 1 50 60

10.1037/1040-3590.6.1.50

Spitzer

Kroenke

Williams

JBW

Löwe

A brief measure for assessing generalized anxiety disorder: the GAD-7

Arch Intern Med 2006 05 22 166 10 1092 7

10.1001/archinte.166.10.1092

16717171

166/10/1092

Rush

Carmody

Reimitz

The Inventory of Depressive Symptomatology (IDS): Clinician (IDS-C) and Self-Report (IDS-SR) ratings of depressive symptoms

Int. J. Method. Psychiat. Res 2006 06 9 2 45 59

10.1002/mpr.79

Brown

Ryan

The benefits of being present: mindfulness and its role in psychological well-being

J Pers Soc Psychol 2003 04 84 4 822 48

10.1037/0022-3514.84.4.822

12703651

Olsen

Jensen

Noerholm

Martiny

Bech

The internal and external validity of the Major Depression Inventory in measuring severity of depressive states

Psychol Med 2003 02 33 2 351 6

10.1017/s0033291702006724

12622314

Tan

Patel

Chang

Review of the four item Morisky Medication Adherence Scale (MMAS-4) and eight item Morisky Medication Adherence Scale (MMAS-8)

Innov Pharm 2014 01 01 5 3 347

10.24926/iip.v5i3.347

Cohen

Spacapan

Oskamp

Perceived stress in a probability sample of the United States

The social psychology of health 1988

Thousand Oaks, CA

Sage Publications, Inc

31 67

No authors listed

The World Health Organization Quality of Life assessment (WHOQOL): position paper from the World Health Organization

Soc Sci Med 1995 11 41 10 1403 9

10.1016/0277-9536(95)00112-k

8560308

027795369500112K

Streicher

Smeddinck

Dörner

Göbel

Kickmeier-Rust

Masuch

Zweig

Personalized and Adaptive Serious Games

Entertainment Computing and Serious Games. Lecture Notes in Computer Science, vol 9970 2016

Cham

Springer

332 377

Smeddinck

Herrlich

Malaka

Exergames for Physiotherapy and Rehabilitation: A Medium-term Situated Study of Motivational Aspects and Impact on Functional Reach

CHI '15: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems 2015

33rd Annual ACM Conference on Human Factors in Computing Systems

December 2-4, 2013

Seoul, Republic of Korea

4143 4146

10.1145/2702123.2702598

Karcher

Presser

Ethical and Legal Issues Addressing the Use of Mobile Health (mHealth) as an Adjunct to Psychotherapy

Ethics & Behavior 2016 09 03 28 1 1 22

10.1080/10508422.2016.1229187

New simpler and faster assessment process for digital health technologies launched for the NHS and social care

NHS 2021

2021-06-24

https://www.nhsx.nhs.uk/news/new-simpler-and-faster-assessment-process-for-digital-health-technologies-launched-for-the-nhs-and-social-care/

How to use the DTAC

NHS 2021-06-24

https://www.nhsx.nhs.uk/key-tools-and-info/digital-technology-assessment-criteria-dtac/how-to-use-the-dtac/

NHS Apps Library

NHS 2021-06-24

https://www.nhs.uk/apps-library/

Byambasuren

Beller

Glasziou

Current Knowledge and Adoption of Mobile Health Apps Among Australian General Practitioners: Survey Study

JMIR Mhealth Uhealth 2019 06 03 7 6 e13199

10.2196/13199

31199343

v7i6e13199

PMC6592476

O'Connell

23% of Users Abandon an App After One Use

DZone 2016

2021-03-01

https://dzone.com/articles/23-of-users-abandon-an-app-after-one-use

Arean

Hallgren

Jordan

Gazzaley

Atkins

Heagerty

Anguera

The Use and Effectiveness of Mobile Apps for Depression: Results From a Fully Remote Clinical Trial

J Med Internet Res 2016 12 20 18 12 e330

10.2196/jmir.6482

27998876

v18i12e330

PMC5209607

Kenny

Dooley

Fitzgerald

Ecological Momentary Assessment of Adolescent Problems, Coping Efficacy, and Mood States Using a Mobile Phone App: An Exploratory Study

JMIR Ment Health 2016 11 29 3 4 e51

10.2196/mental.6361

27899340

v3i4e51

PMC5155083

Torous

Nicholas

Larsen

Firth

Christensen

Clinical review of user engagement with mental health smartphone apps: evidence, theory and improvements

Evid Based Ment Health 2018 08 05 21 3 116 119

10.1136/eb-2018-102891

29871870

eb-2018-102891

Nicholas

Boydell

Christensen

mHealth in psychiatry: time for methodological change

Evid Based Ment Health 2016 05 19 2 33 4

10.1136/eb-2015-102278

27044849

eb-2015-102278

Improving Outcomes Through Personalised Medicine

NHS 2021-06-24

https://www.england.nhs.uk/wp-content/uploads/2016/09/improving-outcomes-personalised-medicine.pdf

Blankenhagel

Identifying Usability Challenges of eHealth Applications for People with Mental Disorders: Errors and Design Recommendations

PervasiveHealth'19: Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare 2019

13th EAI International Conference on Pervasive Computing Technologies for Healthcare

May 20-23, 2019

Trento, Italy

91 100

10.1145/3329189.3329195

Exploring the UK’s digital divide

Office for National Statistics 2019

2021-06-04

https://www.ons.gov.uk/peoplepopulationandcommunity/householdcharacteristics/homeinternetandsocialmediausage/articles/exploringtheuksdigitaldivide/2019-03-04

Ennis

Rose

Denis

Pandit

Wykes

Can't surf, won't surf: the digital divide in mental health

J Ment Health 2012 08 19 21 4 395 403

10.3109/09638237.2012.689437

22712756

PMC3433178

Shen

Levitan

Johnson

Bender

Hamilton-Page

Jadad

AAR

Wiljer

Finding a depression app: a review and content analysis of the depression app marketplace

JMIR Mhealth Uhealth 2015 02 16 3 1 e16

10.2196/mhealth.3713

25689790

v3i1e16

PMC4376135

Guidelines for the Practice of Telepsychology

American Psychological Association 2021-03-01

https://www.apa.org/practice/guidelines/telepsychology

Larsen

Nicholas

Christensen

Quantifying App Store Dynamics: Longitudinal Tracking of Mental Health Apps

JMIR Mhealth Uhealth 2016 08 09 4 3 e96

10.2196/mhealth.6020

27507641

v4i3e96

PMC4995352