Original Paper
Abstract
Background: First responders, military personnel, and veterans face disproportionate risk for mental health and wellness issues. Stigma and confidentiality are common barriers to traditional services. Mobile health interventions offer anonymous, convenient, and cost-effective alternatives.
Objective: This study aimed to evaluate the impact of GUIDE, a mobile health intervention, on wellness, emotional well-being, mental health, social connectedness, and personal growth in first responders, military personnel, and veterans.
Methods: In this unblinded, randomized, waitlist-controlled trial, 115 participants recruited online and offline were virtually enrolled and allocated into 3 groups: GUIDE with financial incentives (GUIDE+incentives, n=37), GUIDE-only (n=39), or waitlist control (n=39). Web-based surveys assessed baseline and posttrial wellness (PERMA [Positive Emotion, Engagement, Relationships, Meaning, Accomplishment] Overall, World Health Organization 5-item Well-being Index, Personal Well-being Score, and PERMA Health), emotional well-being (PERMA Positive Emotion, Negative Emotion, and Happiness; and Difficulties in Emotion Regulation Scale), mental health (Patient Health Questionnaire-8 [depression] and Generalized Anxiety Disorder-7 [anxiety]), social connectedness (PERMA Relationships and Loneliness), and personal growth (PERMA Accomplishment, Meaning, and Engagement). App engagement and technical merit were also evaluated.
Results: Among the participants, 93.0% (107/115) completed the posttrial assessment and were included in the main analysis. In repeated measures ANOVAs, there were no significant group×time interactions for wellness (ηp2=0.02-0.04), emotional well-being (ηp2=0.01-0.12), mental health (ηp2=0.01-0.02), social connectedness (ηp2=0.03-0.04), and personal growth (ηp2=0.01-0.02) (P≥.05). In post hoc pairwise comparisons of pre-post deltas per group, the GUIDE-only group showed significant improvements over the waitlist control group in wellness (d=0.05-0.31), emotional well-being (d=0.06-0.31), depression (d=–0.30), and anxiety (d=–0.29) (P<.05). No group differences were significant for social connectedness (d=–0.20 to 0.20) or personal growth (d=–0.04 to 0.06) (P≥.05). Among participants allocated to GUIDE interventions, 67% (51/76) completed at least three activities weekly. Compared to the GUIDE-only group, the GUIDE+incentives group completed significantly more activities (t72=2.01; P=.05), posts (t72=2.15; P=.04), and replies (t72=3.40; P=.001) but not lessons (t72=0.73; P=.47), likes (t72=1.22; P=.23), or mood surveys (t72=0.48; P=.63). App engagement correlated with improvements in wellness, emotional well-being, mental health, and personal growth measures, driven by educational and peer-support features. Participants who engaged more felt more accomplished, regardless of the features used (r=0.23-0.36). GUIDE had good appropriateness (mean 4.01) and feasibility (mean 4.01) scores, and satisfactory acceptability (mean 3.81) and usability (mean 70.62) scores. Exploratory subgroup analyses suggested that GUIDE may be most beneficial to military-affiliated and male individuals.
Conclusions: GUIDE is a feasible and appropriate intervention with the potential to improve the mental health and well-being of first responders, veterans, and military personnel. Financial incentives increased engagement with peer-support features but did not lead to significant improvements over waitlist controls. Future research should assess whether improvements are sustained in the long term.
Trial Registration: ClinicalTrials.gov NCT06336967; https://clinicaltrials.gov/study/NCT06336967
doi:10.2196/71155
Keywords
Introduction
Background
There is an urgent need to address mental suffering, isolation, and burnout among first responders, active-duty military personnel, and veterans. Due to the nature of their occupations, they are disproportionately exposed to trauma, stress, and sleep disturbances, which increase their risk of mental health conditions, including depression [-], anxiety [-], posttraumatic stress disorder [-], and substance abuse [-]. In the wake of the COVID-19 pandemic, professional stress has only compounded for first responders, who report that heightened workloads, isolation, increased intensity, and negative public sentiment have come at emotional, physical, and mental costs [,]. These conditions in turn put both military personnel and first responders at an elevated risk for suicide as compared to the general civilian population [-], to the extent that police officers and firefighters are more likely to die by suicide than in the line of duty []. This crisis comes with urgent costs to the public, since mental health and suicidality can result in absenteeism, problems with retention, higher error rates, and decreased job performance, which can lead to security risks at the national and local levels [,].
Despite the severity and prevalence of mental health issues, and the growing awareness of the importance of mental health from military personnel and first responders [], services to address this crisis continue to go underutilized, including by those who need them the most []. First responders and military personnel share concerns about stigma, and they fear being ostracized by peers, family members, and the leadership for seeking treatment or asking for help [,]. They pride themselves on strength and do not want to be perceived as weak or soft, which leads them to neglect symptoms until they are severe [,]. Additionally, the time and cost associated with traditional mental health care can be prohibitive for shift workers and those with busy schedules [].
Research indicates that education and training about mental health and wellness can play an important role in reducing stigma, increasing awareness, and cultivating a more resilient mindset [,,,,]. The challenge comes from making mental health and self-care training accessible and appealing to first responders and military personnel specifically, and importantly, emphasizing its confidentiality to address any privacy concerns. Peer support, another key strategy for building resilience and promoting well-being among “warriors,” faces similar challenges, since stigma and concerns about professional ramifications inhibit first responders and military personnel from reaching out to one another about their concerns, especially face-to-face [,-].
Overcoming Barriers With a Digital Solution
Existing solutions for mental health and wellness are not adequate in addressing the needs of first responders, veterans, and active-duty military personnel. These individuals are in need of interventions that take their unique needs and barriers into consideration. Mobile health (mHealth) interventions offer a promising and scalable alternative to existing solutions, with several advantages over traditional clinical models. Namely, they can be anonymous, confidential, convenient, cost-effective, and asynchronous, which in combination can counteract stigma, time, and cost barriers [,]. While digital interventions are relatively new, studies indicate that they can be efficacious in reducing depressive symptoms, anxiety levels, and isolation [-], and that they can play a proactive role in promoting well-being and preventing potential mental health issues from becoming severe [-].
One digital mental health intervention built explicitly for the “warrior” community is the GUIDE app []. GUIDE is a smartphone app that uses educational videos, prompts, small discussion groups, and surveys to build resilience and promote wellness among first responders and military personnel. The GUIDE team has conducted extensive research with stakeholders from across first responder and military communities to build a compelling platform that hinges upon 2 evidence-based approaches to wellness: education and peer support. Crucially, it does so with complete anonymity. The platform does not store personal identifiable information, so users can engage with the app without confidentiality concerns.
GUIDE provides wellness education through its learning management system of courses that focus on topics relevant to first responders and military personnel (eg, financial wellness, communication, stress reduction, physical wellness, and meditation). Each course is made up of 1 to 4 micro-lessons. GUIDE users can discuss the content of their lessons, as well as anything that is on their mind, in an assigned small group chat, which is the peer-support element of the GUIDE app. The small group chat is comprised of 15-30 peers from across the country and is moderated by GUIDE employees, so members can get the support they need in a positive environment, from people who understand.
In a 4-week quasiexperimental pilot study with 16 participants from a US police department, GUIDE with financial incentives showed promising results in promoting well-being, reducing anxiety, and encouraging personal growth []. Paired-samples pre-post t tests on the PERMA (Positive Emotion, Engagement, Relationships, Meaning, Accomplishment) overall score (PERMA Overall) and Personal Well-being Score (PWS) showed that users experienced statistically significant improvements in overall well-being (P=.007) and accomplishment (P<.001), and reductions in anxiety (P=.002).
Current Study
In this study, we aimed to test the impact of GUIDE with and without financial incentives on the wellness, emotional well-being, mental health, social connectedness, and personal growth of first responders, military personnel, and veterans through a fully powered randomized waitlist-controlled trial. For our theory of change model, see Figure S1 in . Additional objectives included exploring app engagement, assessing the influence of financial incentives on app usage, and reporting on implementation outcomes and technical merit (eg, feasibility and user friendliness). Subgroup analyses were conducted to explore whether there were variations in outcomes for first responders versus those with military affiliation, as well as by sex.
Methods
Study Design
This study is a fully powered, unblinded, randomized, waitlist-controlled trial designed to evaluate how the GUIDE app, a digital health intervention for first responders, military personnel, and veterans, impacts wellness, emotional well-being, mental health, social connectedness, and personal growth. Participants were allocated to 1 of 3 groups (GUIDE access with financial incentives [“GUIDE+incentives”], GUIDE access without financial incentives [“GUIDE-only”], and waitlisted GUIDE access [“waitlist control”]) for the 4-week trial.
Recruitment
Recruitment for this study took place online and in-person through social media posts, email lists, existing GUIDE partnerships, and local events for first responders and military personnel. Recruitment materials invited potential participants to take an eligibility screener survey on REDCap, a Health Insurance Portability and Accountability Act (HIPAA)-compliant web application for building and managing online surveys and databases. To be eligible, respondents had to be first responders, veterans, or military personnel in the United States with full-time employment; had to be willing to download the GUIDE app on their personal device; had to not be receiving regular compensation from Nlyten Corp or GUIDE; and had to not have previously used GUIDE. Respondents who previously used GUIDE; received regular compensation from GUIDE or Nlyten Corp; expressed disinterest in downloading GUIDE on their phone; were employed less than full time; or were not first responders, veterans, or military personnel were considered ineligible.
Because this was an app-based study with web-mediated interactions, technical literacy was also a “de facto” eligibility criterion. To avoid selection bias against nondigital or older first responders, we used in-person recruitment meetings where GUIDE personnel helped first responders, veterans, and military personnel sign up for the study via an iPad.
mHealth Intervention: The GUIDE App
GUIDE is a digital wellness app built with “warriors” in mind. It uses a small group chat, a learning management system, a “drivers of behavior change” model, and an anonymous user experience to promote social connectedness, personal growth, resilience, and well-being in a way that is accessible to the target population. See for a summary of the GUIDE model. GUIDE has been implemented in several use cases and care settings, including police departments, fraternal orders, veterans’ nonprofits, veteran service organizations, government departments (eg, military), and other organizations in support of the delivery of mental health programming. As of December 2024, GUIDE has over 18,000 users, who are referred to as members.
During app onboarding, members personalize GUIDE by answering questions about their wellness interests and experiences, as well as their backgrounds. To ease concerns about confidentiality, GUIDE does not collect personal identifying information and instead safeguards these data with Okta, an industry leader in identity and login privacy. The anonymity of GUIDE is a key strength of the app, since it gives first responders, military personnel, and veterans predisposed to stigma concerns a way to access mental health and health interventions, without risking exposing their vulnerabilities to family members, peers, or leaders.
After set-up, members reach the app homepage, where they fill out a brief questionnaire to calculate their baseline wellness score, which increases and decreases depending on whether assigned tasks are completed and how wellness questions are answered. Members may retake the initial questionnaire at any point to see how their wellness score changes over time. Underneath, the homepage contains a personalized daily plan, designed to take 15 minutes or less, which aggregates recommended content from across the app (eg, lessons, surveys, and peer posts) into a single list to help members get the most out of GUIDE. Beyond the homepage, GUIDE is organized into 4 tabs: group, courses, practices, and profile (Table S1 in ). Micro-lessons were developed by a team of experienced professionals specialized in social work, trauma-informed care, and product design. See for a full list of micro-lessons, organized by category.
GUIDE recommends that members complete 3 lessons and 3 mood tracker surveys per week, as well as 3 posts, 3 replies, and 3 likes in their small group chat, to get the most benefits from the app. Members may turn on notifications to promote sustained engagement.

Primary Outcome Measures
Wellness
PERMA Overall, our primary outcome variable, is a composite score from the PERMA-Profiler designed to measure self-reported flourishing in adults across 5 factors or subscales: Positive Emotion, Engagement, Relationships, Meaning, and Accomplishment []. It contains 15 items rated on an 11-point Likert scale (from 0 to 10). Scores are calculated as the average of the items comprising each factor or subscale, with higher scores indicating higher flourishing. A score of ≤6.4 is considered suboptimal, while a score of ≥8.0 is considered high []. It has demonstrated high internal consistency across settings (α=.92-.95) [].
The PWS is a self-reported measure based on the Office of National Statistics 4 subjective well-being questions (ONS4) and thresholds. It contains 4 items, rated on a scale of 0 to 4 (0, “Disagree;” 1, “Neutral;” 2, “Agree;” and 3, “Strongly agree”). The summary score is calculated as an aggregate of the 4 items, transformed to a 0-100 scale, with higher scores indicating higher well-being. It shows good internal reliability (α=.90) [].
The World Health Organization 5-item Well-being Index (WHO-5) is a self-reported measure of current mental well-being []. It consists of 5 statements rated on a 5-point scale in relation to the previous 2 weeks (5, “All of the time;” 4, “Most of the time;” 3, “More than half of the time;” 2, Less than half of the time;” 1, “Some of the time;” and 0, “At no time”). Scores are calculated by adding up each item and multiplying the total by 4, with 0 representing the worst imaginable well-being and 100 representing the best imaginable well-being. A clinical cutoff score of ≤50 indicates low well-being, while ≤28 indicates clinical depression []. It has high internal consistency and reliability (α=.81-.90), is valid in screening for depression, has strong construct validity among adolescents and adults, and is sensitive to change in interventions [,].
The PERMA Health subscale is a 3-item filler measure from the larger PERMA-Profiler, used to assess perceived physical health and vitality. Each item is rated from 0 to 10. Scores are calculated as the average of item scores, with higher scores indicating better perceived health. It is not a part of the main PERMA model and is instead used to provide additional context. It has high reliability across settings and contexts (α=.86-.93) [].
Emotional Well-Being
The PERMA Positive Emotion subscale is a 3-item subscale from the PERMA-Profiler, designed to measure positive emotions, including joy, hope, pleasure, happiness, and contentment []. Each item is ranked from 0 to 10. Scores are calculated as the average of item scores, with higher scores indicating greater positivity. A score of ≤6.4 is considered suboptimal, while ≥8.0 is considered high []. It has demonstrated good internal consistency across settings (α=.71-.89) [].
The PERMA Happiness subscale is a single filler item from the PERMA-Profiler, used alongside the primary PERMA model to provide additional context: “Taking all things together, how happy would you say that you are?” It is rated from 0 to 10, with higher scores indicating more happiness [].
The PERMA Negative Emotion subscale is a 3-item filler measure from the PERMA-Profiler, used to assess tendencies toward feeling sad, anxious, or angry. Each item is rated from 0 to 10. Scores are calculated as the average of item scores. Lower scores are better (ie, less negativity). It is not a part of the main PERMA model and is instead used to provide additional context. It has acceptable reliability and has been shown to be negatively correlated with well-being measures (α=.63-.75) [].
The Difficulties in Emotion Regulation Scale–Short Form (DERS-SF; emotion dysregulation) is an 18-item self-reported measure that assesses emotion dysregulation. It includes 6 subscales with 3 items each, including strategies, nonacceptance, impulse, goals, awareness, and clarity. Each item is rated on a 5-point scale of frequency ranging from 1 (“Almost never 0-10%”) to 5 (“Almost always 91-100%”). Items are summed (awareness items are reverse coded) to create a total score of 18 to 90, with higher scores indicating greater difficulties with emotion regulation []. It is a validated measure for assessing emotion regulation problems and possesses high internal consistency (α=.89-.91) [].
Mental Health
Patient Health Questionnaire-8 (PHQ-8) is an 8-item diagnostic and severity measure for assessing depressive disorders in large clinical settings []. Statements about the previous 2 weeks are rated on a 4-point scale ranging from 0 (“Not at all”) to 3 (“Nearly every day”). Scores are reported on a scale from 0 to 24, with higher scores indicating greater severity in depressive symptoms. Scores above 10 indicate clinically significant depressive symptoms. It has high internal consistency, reliability (α=.82), and validity, with 96.5% of those who score above the clinical cutoff having a depression diagnosis [,].
Generalized Anxiety Disorder-7 (GAD-7) consists of 7 items rated on a 4-point Likert scale ranging from 0 (“Not at all”) to 3 (“Nearly every day”), with the same options as in the PHQ-8 []. Items are summed to create a total severity score ranging from 0 to 21, with higher scores indicating greater symptom severity. Scores above 10 indicate clinically significant anxiety symptoms. It has high internal consistency, reliability (α=.89), and validity, with moderate correlations with related constructs such as depression and self-esteem [].
Social Connectedness
The PERMA Relationships subscale consists of 3 items from the larger PERMA-Profiler that measure whether a person feels cared about, authentically connected, and securely connected to others []. Each item is rated on a scale from 0 to 10. Scores are calculated as the average of item scores, with higher scores indicating higher social connectedness. A score of ≤6.4 is considered suboptimal, while ≥8.0 is considered high []. It has demonstrated high internal consistency across settings (α=.75-.85) [].
The PERMA Loneliness subscale is a single filler item from the PERMA-Profiler, used alongside the primary PERMA model to provide additional context: “How lonely do you feel in your daily life?” It is rated from 0 to 10, with lower scores indicating less loneliness. It is negatively correlated with the PERMA well-being measures [].
Personal Growth
The PERMA Accomplishment subscale contains 3 items from the PERMA-Profiler that measure drive toward personal goals []. Each item is rated from 0 to 10. Scores are calculated as the average of item scores, with higher scores indicating higher accomplishment. A score of ≤6.4 is considered suboptimal, while ≥8.0 is considered high []. It has demonstrated good internal consistency across settings (α=.70-.86) [].
The PERMA Meaning subscale contains 3 items from the PERMA-Profiler that measure sense of purpose []. Each item is rated from 0 to 10. Scores are calculated as the average of item scores, with higher scores indicating a greater sense of meaning. A score of ≤6.4 is considered suboptimal, while ≥8.0 is considered high []. It has demonstrated high internal consistency across settings (α=.85-.92) [].
The PERMA Engagement subscale contains 3 items from the PERMA-Profiler that measure absorption, interest, and involvement in the world []. Each item is rated from 0 to 10. Scores are calculated as the average of item scores, with higher scores indicating greater engagement. A score of ≤6.4 is considered suboptimal, while ≥8.0 is considered high []. It has acceptable internal consistency (α=.60-.81) [].
Secondary Outcome Measures
App Engagement
App engagement is assessed by the total and weekly sums of activities completed in the app during the trial period, calculated as the sum of lessons, posts, replies, likes, and mood surveys.
Weekly sums are assessed relative to recommended benchmarks from GUIDE (≥3 lessons, posts, replies, likes, and mood surveys), as well as minimum usage standards (≥3 activities).
Implementation Outcomes
The Acceptability of Intervention Measure (AIM) assesses how agreeable or likable an intervention is to stakeholders. It contains 4 items rated on a 5-point Likert Scale ranging from 1 (“Completely disagree”) to 5 (“Completely agree”). Scores are calculated as the average of item scores, with higher scores indicating greater acceptability. It has been shown to have good internal consistency and test-retest reliability (α=.83-.85) [].
The Intervention Appropriateness Measure (IAM) assesses the perceived fit of an intervention to a specific setting. It contains 4 items rated on a 5-point Likert Scale ranging from 1 (“Completely disagree”) to 5 (“Completely agree”). Scores are calculated as the average of item scores, with higher scores indicating greater appropriateness. It has been shown to have high internal consistency and test-retest reliability (α=.87-.91) [].
The Feasibility of Intervention Measure (FIM) assesses how easily an intervention can be carried out within a given setting. It contains 4 items rated on a 5-point Likert Scale ranging from 1 (“Completely disagree”) to 5 (“Completely agree”). Scores are calculated as the average of item scores, with higher scores indicating greater feasibility. It has been shown to have good internal consistency and test-retest reliability (α=.88-.89) [].
Technical Merit
The System Usability Scale (SUS) assesses the perceptions of usability of a given system (in this case, GUIDE) []. It contains 10 items rated on a 5-point Likert Scale ranging from 1 (“Strongly disagree”) to 5 (“Strongly agree”). Responses are transformed to get a total range of possible values from 0 to 100, with higher scores indicating better usability. It has high internal consistency and reliability (α=.91) []. Scores above 68 are regarded as satisfactory or above average, while scores above 80 are considered good [].
Procedure
Potential participants from across the United States were instructed to take an eligibility survey on REDCap. Respondents deemed eligible according to the study criteria were immediately directed to a copy of an Institutional Review Board (IRB)-approved informed consent form and HIPAA authorization for virtual signature. According to institutional guidelines, the informed consent stressed that this study would not influence their employment status and that they could opt out at any time without consequence.
After providing consent, REDCap automatically emailed the baseline assessment survey link, which included demographic questions and our self-report outcome measures (PERMA-Profiler, WHO-5, PWS, GAD-7, PHQ-8, and DERS-SF). Consented respondents received a separate welcome email sent manually by the research team, instructing them to schedule a virtual intake meeting. Participants were asked to complete the baseline assessment prior to their scheduled intake meeting.
Prior to intake, participants were randomly assigned to 1 of 3 study arms (GUIDE+incentives, GUIDE-only, or waitlist control). The clinical research coordinator conducted a manual block randomization procedure in Microsoft Excel, using the RAND() function to generate predetermined blocks at the beginning of the study. Participants were assigned to blocks and treatment groups sequentially based on the order in which they signed up for an intake meeting. If a participant missed their intake meeting, their assigned spot was given to the next enrolled participant to maintain the allocation balance. Since app onboarding and financial incentives were explained to participants during the intake meeting, this study was unblinded to both participants and study personnel.
Intake meetings were conducted by the clinical research coordinator over Zoom (Zoom Communications). The coordinator ensured that each participant understood what the study entailed and answered outstanding questions before informing them of their assigned treatment group. Regardless of the assigned group, all participants used this time to schedule a posttrial follow-up meeting for the end of the trial period, approximately 4 weeks out from the intake meeting. Afterward, participants from the waitlist control group were informed that they would receive access to GUIDE during the posttrial meeting and were dismissed.
Participants assigned to the GUIDE+incentives and GUIDE-only groups were instructed to download and set up the app on their personal device. They were provided with an invitation code associated with the research study, since GUIDE is an invite-only platform. After setting up the app, participants were given a brief overview of GUIDE features, including the home, group, courses, and practices tabs. Participants were told that if a higher level of care was required while using the app, they should navigate to the in-app SOS button and use the available crisis hotlines and resources.
Next, participants were informed about weekly usage benchmarks recommended by the GUIDE team (at least 3 lessons, small group posts, replies to peers, likes to peer posts, and mood surveys), while those assigned to the GUIDE+incentives group were additionally told that they would receive compensation for reaching these recommended benchmarks. Participants in the GUIDE+incentives group earned Amazon gift cards of increasing value for each week they met GUIDE’s criteria: US $10 for week 1 of the trial, US $20 for week 2, US $30 for week 3, and US $40 for week 4. Gift cards were emailed to participants by GUIDE on the Tuesday after they were earned.
After finishing this intake meeting, participants were considered enrolled in the study. All enrolled participants received US $50 from the research team for completing their intake meeting, regardless of treatment group. The day of the intake meeting was considered day 1 of the 28-day trial period. During the trial, participants in the GUIDE+incentives and GUIDE-only groups were asked to use GUIDE as recommended to support their wellness. Participants in these conditions received text message reminders from the GUIDE team if they did not meet the minimum weekly usage by day 5 of each trial week.
On the last day of the trial, participants automatically received a link from REDCap to their posttrial assessment. The posttrial assessment included the same primary outcome measures from the baseline assessment (PERMA-Profiler, WHO-5, PWS, GAD-7, PHQ-8, and DERS-SF). Additionally, assessments for participants assigned to the GUIDE+incentives and GUIDE-only groups included SUS, AIM, IAM, and FIM measures. After completing the posttrial assessment, participants received a US $50 payment from the research team for their time.
During the virtual posttrial follow-up meeting with the clinical research coordinator, participants in the waitlist control group received access to the GUIDE app and were provided the same overview that the participants in the GUIDE+incentives and GUIDE-only groups received during intake. Participants in the GUIDE+incentives and GUIDE-only groups provided feedback about their experiences using the app, which the research team aggregated and provided to the GUIDE team.
There were no changes to our methods after trial commencement, and deviations to the registered protocol did not occur; however, periodic bugs impacting participant access and use of GUIDE were mentioned in open-ended survey responses and during posttrial meetings.
Ethical Considerations
The study was conducted in accordance with the CONSORT (Consolidated Standards of Reporting Trials) guidelines for digital health interventions () []. Ethical approval was obtained from the University of Pennsylvania IRB (IRB number: 855282), and the study has been registered on ClinicalTrials.gov (NCT06336967). We obtained informed consent and HIPAA authorization via virtual signature for all participants. The informed consent stressed that this study would not influence their employment status and that they could opt out at any time without consequence. At the conclusion of the trial, data were anonymized and deidentified for future analysis.
Participants were compensated US $100 for their time and effort (US $50 for their completed baseline assessment and intake meeting, and US $50 for completing their posttrial assessment).
Data Analysis
To test how GUIDE access impacted our primary outcomes of wellness, emotional well-being, mental health, social connectedness, and personal growth, we ran a 3-way repeated measures ANOVA (3 groups: GUIDE+incentives, GUIDE-only, and waitlist control) for each measure. Qualification for this analysis was based on participant completion of both the baseline and posttrial assessments (and therefore did not include participants lost to follow-up).
We applied Bonferroni correction to control for type I errors (false positives) arising from multiple comparisons, given the large number of outcome measures in this study. Bonferroni was chosen for its strict alpha control requirements, ensuring that the risk of any false positive findings remained below the conventional significance threshold of .05 when performing multiple statistical tests, as was required for this analysis []. However, it is acknowledged that Bonferroni correction is a conservative method, which increases the risk of type II errors (false negatives), particularly when many tests are conducted.
To test how GUIDE app engagement was related to wellness, emotional well-being, mental health, social connectedness, and personal growth, we conducted an exploratory analysis of the relationships among app engagement (total, weekly, and by feature) for each of our outcome measures (change between pre and post) using Pearson correlation coefficients. To assess the extent to which participants engaged with GUIDE, we ran descriptive statistics for each of our app engagement outcome variables. To test whether financial incentives influenced engagement, we ran an independent samples t test to assess how financial incentives impacted engagement with GUIDE features. To assess GUIDE for implementation outcomes and technical merit, we reported on descriptive statistics.
In addition to our main analysis, we conducted post hoc exploratory subgroup analyses to assess whether first responder status, military affiliation, or sex impacted outcomes by rerunning 3-way repeated measures ANOVA (3 groups: GUIDE+incentives, GUIDE-only, and waitlist control) for select primary outcome measures.
We included all randomized participants who completed their posttrial assessment in our analysis, regardless of app usage during the trial period. App engagement, implementation outcomes, and technical merit analyses included fewer participants, since only the GUIDE+incentives and GUIDE-only groups completed these measures. Participants who did not complete their posttrial assessments were included in app engagement analyses but excluded from analyses of primary outcomes and implementation outcomes, and subgroup analyses. All analyses were conducted in SPSS 30 (IBM Corp), while data visualizations were created in Excel (Microsoft Corp).
To achieve 80% power for 3 groups based on the lowest effect size (d=0.40) and correlation (r=0.47) of the pre-post t tests from a pilot study [], our trial required at least 68 participants (approximately 23 per group) to 108 participants (approximately 36 per group). Based on these parameters, there was appropriate power for the main analysis, as well as the first responder and male subgroup analyses, but not for the military and female subgroup analyses. The results from underpowered subgroups should be interpreted with caution.
Results
Demographics
Data were analyzed for skewness, kurtosis, and outliers by calculating z-scores. Based on the established 3.29 cutoff [], the PHQ-8, GAD-7, and DERS-SF scores were positively skewed; therefore, these variables were log transformed for our main analysis. App engagement metrics (total and by feature) were also positively skewed and therefore log transformed for later analysis. After transformation, our outcome measures were normally distributed, and therefore, parametric tests were used. Next, data were assessed for differences between treatment groups. We found that there were differences between groups at baseline for some of our outcome measures (). In our analyses, we controlled for baseline, where differences between groups were significant (P<.05).
All 115 enrolled participants were first responders, active-duty military personnel, or veterans and were block randomized (GUIDE+incentives, GUIDE-only, or waitlist control). Treatment groups were matched on all participant demographic variables (age, race, sex, ethnicity, education, years with employer, responder type, and receipt of behavioral health care), indicating that the randomization to the treatment versus control groups was successful ().
| Baseline outcome measure | GUIDE+incentives group (n=39), mean (SD) | GUIDE-only group (n=37), mean (SD) | Waitlist control group (n=39), mean (SD) | One-way ANOVA P value | |
| Wellness | |||||
| PERMA Overalla | 6.57 (1.43) | 7.16 (1.65) | 6.45 (1.43) | .09b | |
| PWSc,d | 67.52 (22.52) | 77.25 (21.57) | 60.90 (24.04) | .01e | |
| WHO-5d,f | 53.03 (21.32) | 57.3 (21.87) | 52.21 (20.66) | .54 | |
| PERMA Healthg | 6.01 (1.69) | 6.80 (1.83) | 5.54 (1.77) | .01e | |
| Emotional well-being | |||||
| PERMA Positive Emotion | 6.07 (1.81) | 6.61 (2.09) | 5.89 (1.64) | .21 | |
| PERMA Happiness | 6.64 (1.91) | 6.89 (1.90) | 6.31 (1.81) | .40 | |
| PERMA Negative Emotion | 4.45 (2.18) | 3.81 (1.93) | 4.22 (1.74) | .36 | |
| DERS-SFh,i | 36.54 (12.59) | 31.19 (10.17) | 39.00 (12.46) | .02e | |
| Mental health | |||||
| GAD-7j,k | 13.36 (5.73) | 11.00 (4.34) | 13.54 (4.32) | .04e | |
| PHQ-8l,m | 7.36 (5.15) | 5.14 (5.05) | 7.51 (5.19) | .08b | |
| Social connectedness | |||||
| PERMA Relationships | 6.56 (2.07) | 7.43 (2.10) | 6.14 (1.94) | .02e | |
| PERMA Loneliness | 3.69 (2.99 | 3.57 (2.77) | 5.13 (3.10) | .04e | |
| Personal growth | |||||
| PERMA Accomplishment | 6.83 (1.60) | 7.60 (1.44) | 6.74 (1.60) | .03e | |
| PERMA Meaning | 6.91 (2.03) | 7.77 (1.95) | 6.74 (1.88) | .06e | |
| PERMA Engagement | 6.46 (1.56) | 6.39 (1.55) | 6.72 (1.39) | .60 | |
aPERMA Overall is the total score of human flourishing from the PERMA (Positive Emotion, Engagement, Relationships, Meaning, Accomplishment) model [].
bP=.05-.10.
cPWS: Personal Well-being Score.
dPWS and WHO-5 assess well-being.
eP<.05.
fWHO-5: World Health Organization 5-item Well-being Index.
gPERMA Health is a measure of perceived health.
hDERS-SF: Difficulties in Emotion Regulation Scale–Short Form.
iDERS-SF assesses emotion dysregulation.
jGAD-7: Generalized Anxiety Disorder-7.
kGAD-7 assesses anxiety symptoms.
lPHQ-8: Patient Health Questionnaire-8.
mPHQ-8 assesses depression symptoms.
| Characteristic | GUIDE+incentives group (n=39) | GUIDE-only group (n=37) | Waitlist control group (n=39) | Comparison statisticsa | ||||||||||
| P value | Chi-square (df) | |||||||||||||
| Age (years), mean (SD) | 42.77 (10.28) | 42.22 (11.43) | 43.69 (10.82) | .84 | —b | |||||||||
| Race, n (%) | ||||||||||||||
| White | 34 (87) | 32 (87) | 36 (92) | — | 0.68 (2) | |||||||||
| Black or African American | 4 (10) | 3 (8) | 1 (3) | — | 0.39 (2) | |||||||||
| American Indian or Alaska Native | 1 (3) | 0 (0) | 0 (0) | — | 0.37 (2) | |||||||||
| Hispanic or Latino | 2 (5) | 4 (11) | 0 (0) | — | 0.11 (2) | |||||||||
| Asian | 0 (0) | 0 (0) | 2 (5) | — | 0.14 (2) | |||||||||
| Refused to answer | 0 (0) | 1 (3) | 0 (0) | — | 0.35 (2) | |||||||||
| Sex, n (%) | — | 0.78 (2) | ||||||||||||
| Male | 27 (69) | 28 (76) | 27 (69) | |||||||||||
| Female | 12 (31) | 9 (24) | 12 (31) | |||||||||||
| Ethnicity, n (%) | — | 0.11 (2) | ||||||||||||
| Hispanic or Latino | 2 (5) | 4 (11) | 0 (0) | |||||||||||
| Education, n (%) | — | 0.47 (6) | ||||||||||||
| High school diploma | 3 (8) | 3 (8) | 3 (8) | |||||||||||
| Some college | 6 (16) | 9 (23) | 12 (31) | |||||||||||
| College graduate | 17 (46) | 21 (54) | 13 (33) | |||||||||||
| Graduate or professional training | 11 (30) | 6 (15) | 11 (28) | |||||||||||
| Years with employer, n (%) | — | 0.49 (12) | ||||||||||||
| <1 | 6 (16) | 7 (18) | 6 (15) | |||||||||||
| 1-2 | 1 (3) | 2 (5) | 5 (13) | |||||||||||
| 2-5 | 5 (14) | 6 (15) | 6 (15) | |||||||||||
| 5-10 | 8 (22) | 10 (26) | 10 (26) | |||||||||||
| 10-15 | 3 (8) | 7 (18) | 6 (15) | |||||||||||
| 15-20 | 4 (11) | 2 (5) | 0 (0) | |||||||||||
| >20 | 10 (27) | 5 (13) | 6 (15) | |||||||||||
| Responder type, n (%) | — | 0.29 (4) | ||||||||||||
| Responder only | 21 (57) | 22 (56) | 27 (69) | |||||||||||
| Military only | 9 (24) | 10 (26) | 3 (8) | |||||||||||
| Military and responder | 7 (19) | 7 (18) | 9 (23) | |||||||||||
| Household income (US$), n (%) | — | 0.59 (14) | ||||||||||||
| 25,000-34,999 | 1 (3) | 1 (3) | 1 (3) | |||||||||||
| 35,000-49,999 | 1 (3) | 2 (5) | 1 (3) | |||||||||||
| 50,000-74,999 | 6 (15) | 3 (8) | 4 (10) | |||||||||||
| 75,000-99,999 | 4 (10) | 7 (19) | 4 (10) | |||||||||||
| 100,000-149,999 | 13 (33) | 9 (24) | 16 (41) | |||||||||||
| 150,000-199,999 | 7 (18) | 3 (8) | 6 (15) | |||||||||||
| ≥200,000 | 3 (8) | 10 (27) | 4 (10) | |||||||||||
| Prefer not to answer | 4 (10) | 2 (5) | 3 (8) | |||||||||||
| Behavioral health care | — | 0.29 (2) | ||||||||||||
| Yes | 18 (46) | 17 (46) | 24 (62) | |||||||||||
| No | 20 (51) | 21 (57) | 15 (39) | |||||||||||
aOne-way ANOVA or chi-square test.
bNot applicable.
Enrollment and Dropout Analysis
Of the 115 participants enrolled, 107 (93.0%) completed the posttrial assessment. A CONSORT flow diagram of enrollment and dropout rates by assigned treatment group is presented in .
Changes to self-reported primary outcome measures could not be assessed for participants who did not complete the posttrial assessment (n=8), and therefore, these participants were excluded from the analysis of our primary outcomes. While imputing data or using pairwise deletions may have been a stronger approach [,], we used listwise deletions in this instance since our investigation was fully powered without the participants lost to follow-up. To determine whether participants who dropped out were significantly different from those who completed their posttrial assessments, Welch t tests and chi-square tests were performed to compare demographic features, baseline outcome measures, and overall app activity levels. Only 2 significant differences were found. First, participants who dropped out were less likely to have received behavioral health care in the past (χ21=5.18; P=.02). Second, among participants assigned to the GUIDE+incentives and GUIDE-only treatment groups, those who dropped out used the app significantly less over the 4-week trial period than those who completed their posttrial assessment (t72=–8.18; P<.001). These differences suggest that dropout may have been influenced by baseline characteristics and engagement levels, which could introduce bias by overestimating intervention effects, since participants who are less likely to seek help for mental health and less likely to use the intervention were not included in our analysis of primary outcomes.

Primary Outcomes
To assess how GUIDE influenced wellness, emotional well-being, mental health, social connectedness, and personal growth, repeated measures ANOVAs were conducted to examine the effect of time and treatment group on each outcome measure. Post hoc pairwise comparisons were included to explore between-group and within-group differences. See for a complete summary of statistics from all the tests.
Overall, the main effect of time was highly significant and had moderate to large effect sizes for all measures (P<.05), except engagement (P=.26). Time×group interactions were not significant for any outcome measures (all P≥.05); however, time×group was borderline significant for DERS-SF (P=.06). This result was driven by the GUIDE+incentives group, which showed a significant reduction in DERS-SF (ie, emotion dysregulation) over time with a large effect size (P=.003), while the GUIDE-only (P=.71) and waitlist control (P=.29) groups did not show a reduction.
In post hoc comparisons by group, the GUIDE+incentives group showed greater improvement than the waitlist control group in 93% of measures (14/15), though none of these differences were significant (all P≥.05). The GUIDE-only group showed greater improvement than the waitlist control group in 87% of measures (13/15), which were significant for wellness (PERMA Overall: P=.048), emotional well-being (PERMA Positive Emotion: P=.03; PERMA Happiness: P=.04), and mental health (PHQ-8 for depression: P=.04; GAD-7 for anxiety: P=.048). While differences between the GUIDE+incentives and GUIDE-only groups were not significant in any between-group pairwise comparisons (all P≥.05), overall, the GUIDE-only group outperformed the GUIDE+incentives group in 67% of measures (10/15) (particularly emotional well-being and mental health).
| Outcome measure | ANOVA effects | Pairwise comparisons (group×time) | Pairwise group comparisons of pre-post deltas | ||||||||||||||||||||||||||||
| Time | Time×group | GUIDE+incentives | GUIDE-only | Waitlist | GUIDE+incentives vs waitlist | GUIDE-only vs waitlist | GUIDE+incentives vs GUIDE-only | ||||||||||||||||||||||||
| ηp2a | ηp2a | ηp2a | MDb | ηp2a | MD | ηp2a | MD | dc | MD | dc | MD | dc | MD | ||||||||||||||||||
| Wellness | |||||||||||||||||||||||||||||||
| PERMA Overalld | 0.07e | 0.02 | 0.07e | 0.38e | 0.02 | 0.19 | 0.00 | 0.09 | 0.12 | 0.35 | 0.31f | 0.91f | –0.19 | –0.56 | |||||||||||||||||
| PWSg,h,i | 0.16j | 0.02 | 0.04k | 5.12k | 0.00 | 1.57 | 0.00 | –0.10 | 0.17 | 2.61 | 0.05 | 0.83 | 0.11 | 0.09 | |||||||||||||||||
| WHO-5h,l | 0.08e | 0.02 | 0.03k | 4.24k | 0.07e | 6.51e | 0.00 | 1.47 | 0.08 | 3.08 | 0.27k | 10.25k | –0.19 | –0.15 | |||||||||||||||||
| PERMA Healthi,m | 0.16j | 0.04 | 0.05f | 0.41f | 0.07e | 0.47e | 0.00 | 0.00 | 0.20 | 0.20 | 0.22 | 0.23 | –0.03 | –0.03 | |||||||||||||||||
| Emotional well-being | |||||||||||||||||||||||||||||||
| PERMA Positive Emotion | 0.10j | 0.02 | 0.06e | 0.48e | 0.05f | 0.43f | 0.01 | 0.14 | 0.15 | 0.51 | 0.31f | 1.09f | –0.17 | –0.13 | |||||||||||||||||
| PERMA Happiness | 0.04f | 0.12 | 0.01 | 0.24 | 0.06f | 0.53f | 0.00 | –0.05 | 0.16 | 0.55 | 0.30f | 1.06f | –0.14 | –0.51 | |||||||||||||||||
| PERMA Negative Emotion | 0.15j | 0.01 | 0.10e | –0.76e | 0.03k | –0.42k | 0.04f | –0.46f | 0.04 | 0.15 | –0.15 | –0.51 | 0.19 | 0.66 | |||||||||||||||||
| ln_DERS-SFi,n,o | 0.11j | 0.05k | 0.08e | –0.10e | 0.00 | 0.01 | 0.01 | –0.03 | –0.17 | –0.04 | 0.06 | 0.01 | –0.23 | –0.05 | |||||||||||||||||
| Mental health | |||||||||||||||||||||||||||||||
| ln_PHQ-8p,q | 0.14j | 0.01 | 0.08e | –0.29e | 0.04f | –0.19f | 0.04k | –0.17k | –0.05 | –0.07 | –0.30f | –0.44f | 0.25 | 0.37 | |||||||||||||||||
| ln_GAD-7i,r,s | 0.12j | 0.02 | 0.02 | –0.06 | 0.04f | –0.07f | 0.00 | 0.00 | –0.23 | –0.06 | –0.29f | –0.07f | 0.06 | 0.01 | |||||||||||||||||
| Personal growth | |||||||||||||||||||||||||||||||
| PERMA Accomplishmenti | 0.16j | 0.01 | 0.03 | 0.35 | 0.02 | 0.24 | 0.01 | 0.13 | 0.10 | 0.11 | 0.05 | 0.06 | 0.05 | 0.04 | |||||||||||||||||
| PERMA Meaningi | 0.08e | 0.02 | 0.04f | 0.37f | 0.00 | 0.09 | 0.00 | –0.03 | 0.16 | 0.20 | 0.06 | 0.06 | 0.13 | 0.10 | |||||||||||||||||
| PERMA Engagement | 0.01 | 0.01 | 0.05 | 0.16 | 0.13 | 0.25 | 0.00 | 0.00 | 0.00 | 0.01 | –0.04 | –0.11 | –0.04 | –0.11 | |||||||||||||||||
| Social connectedness | |||||||||||||||||||||||||||||||
| PERMA Relationshipsi | 0.13j | 0.04 | 0.04f | 0.41f | 0.04f | 0.39f | 0.00 | –0.08 | 0.22 | 0.25 | 0.20 | 0.24 | 0.01 | 0.01 | |||||||||||||||||
| PERMA Lonelinessi | 0.07e | 0.03 | 0.02 | –0.49 | 0.07e | –0.97e | 0.00 | –0.10 | –0.09 | –0.19 | –0.20 | –0.43 | 0.11 | 0.24 | |||||||||||||||||
aInterpretation of ηp2: 0.01=small effect, 0.06=moderate effect, 0.11=large effect [,].
bMD: mean difference; based on adjusted means.
cInterpretation of Cohen d: 0.2=small effect, 0.5=moderate effect, 0.8=large effect [,].
dPERMA Overall is the total score of human flourishing from the PERMA (Positive Emotion, Engagement, Relationships, Meaning, Accomplishment) model [].
eP<.01 (Bonferroni adjustment for multiple comparisons).
fP<.05 (Bonferroni adjustment for multiple comparisons).
gPWS: Personal Well-being Score.
hPWS and WHO-5 assess well-being.
iControlled for baseline.
jP<.001 (Bonferroni adjustment for multiple comparisons).
kP=.05-.10 (Bonferroni adjustment for multiple comparisons).
lWHO-5: World Health Organization 5-item Well-being Index.
mPERMA Health is a measure of perceived health.
nDERS-SF: Difficulties in Emotion Regulation Scale–Short Form.
oDERS-SF assesses emotion dysregulation.
pPHQ-8: Patient Health Questionnaire-8.
qPHQ-8 assesses depression symptoms.
rGAD-7: Generalized Anxiety Disorder-7.
sGAD-7 assesses anxiety symptoms.
Wellness
For our primary outcome variable, PERMA Overall, the main effect of time was significant and had a moderate effect size (F1,104=7.78; P=.006; ηp2=0.07). The time×group interaction was not significant (F2,104=1.15; P=.32; ηp2=0.02). Post hoc pairwise comparisons revealed that the GUIDE-only group experienced a significantly greater improvement in PERMA Overall when compared to the waitlist control group (raw mean=0.11; P=.03; d=0.31), a difference that translates to a 1% greater percentile point increase in the raw PERMA Overall score or a 9.1% greater increase when controlling for time. Raw scores for all 3 groups fell in the moderate well-being range for all time points (A).
For the PWS, the main effect of time was significant and had a large effect size (F1,103=19.22; P<.001; ηp2=0.16) when controlling for the baseline score. The time×baseline interaction effect was also significant and had a large effect size (F2,103=17.19; P<.001; ηp2=0.14). The time×group interaction was not significant (P=.35). See B for raw scores by treatment group over time.
For the WHO-5, the main effect of time was significant and had a moderate effect size (F1,104=9.09; P=.003; ηp2=0.08). The time×group interaction was not significant (P=.31). In post hoc pairwise comparisons, there was a borderline significant mean difference (MD) between the GUIDE-only and waitlist control groups, with the GUIDE-only group on average improving more (raw MD=5.03; P=.07; d=0.27), a difference that translates to a 5.04% greater percentile point increase in the raw score or a 10.25% greater increase when controlling for time. The raw mean scores were in the moderate well-being range for the treatment groups across time (C).
For perceived health, the main effect of time was significant and had a large effect size (F1,103=19.48; P<.001; ηp2=0.16). The time×baseline interaction was significant and had a large effect size (F2,103=14.03; P<.001; ηp2=0.12). The time×group interaction was not significant (P=.12). In post hoc pairwise comparisons, there were no significant differences (all P≥.05). See D for raw mean scores by group over time.

Emotional Well-Being
For PERMA Positive Emotion, the main effect of time was significant and had a moderate effect size (F1,104=11.88; P<.001; ηp2=0.10). The time×group interaction was not significant (P=.33). In post hoc pairwise comparisons, the GUIDE-only group showed a significant improvement compared to the waitlist control group (raw MD=0.29; P=.03; d=0.31), translating to a 10.9% increase when controlling for time. See A for raw mean scores by group over time.
For PERMA Happiness, the main effect of time was significant and had a small effect size (F1,104=4.14; P=.04; ηp2=0.04). The time×group interaction was not significant (P=.12). In post hoc pairwise comparisons, the GUIDE-only group showed significant improvement in this item relative to the waitlist control group (raw MD=1.24; P=.04; d=0.30), translating to a 12.4% greater percentile point improvement or a 10.6% greater improvement when controlling for time. See B for raw scores by group over time.
For PERMA Negative Emotion, the main effect of time was significant and had a large effect size (F1,104=18.02; P<.001; ηp2=0.16). The time×group interaction was not significant (P=.48). All 3 treatment groups showed a decrease in negative emotion over time (C).
For emotion dysregulation (DERS-SF), the main effect of time was significant and had a moderate effect size (F1,103=13.14; P<.001; ηp2=0.11) when controlling for baseline. The time×baseline interaction was also significant and had a large effect size (F2,103=20.31; P<.001; ηp2=0.17). The time×group interaction was borderline significant and had a moderate effect size (F2,103=2.88; P=.06; ηp2=0.05). In group×time post hoc pairwise comparisons, only the GUIDE+incentives group showed a significant reduction in emotion dysregulation over time, with a moderate effect size (raw MD=–4.56; P<.001; ηp2=0.09), indicating that a 6.33% greater percentile point reduction in raw emotion dysregulation for this group drove the borderline significant time×group interaction effect. Post hoc pairwise comparisons were not significant (all P≥.05). See D for raw scores by treatment group over time.

Mental Health
For depression symptoms (PHQ-8), the main effect of time was significant and had a large effect size (F1,104=16.99; P<.001; ηp2=0.14). The time×group interaction was not significant (P=.64). In post hoc pairwise comparisons, the GUIDE-only group showed a significantly greater reduction in depression symptoms compared to the waitlist control group (raw mean=–1.34; P=.04; d=–0.30), translating to a 5.58% greater percentile point reduction in the raw depression score for the GUIDE-only group. As shown in A, the average depression score for all groups fell below clinical levels for both time periods; however, a substantial portion of our participants did meet the clinical criteria: 29.0% (31/107) at baseline and 16.8% (18/107) at posttrial. A majority of participants (68/115, 59.1%) had symptoms of subclinical or clinical depression (≥5) at baseline, while nearly half (53/107, 49.5%) had these symptoms at posttrial. See Figure S2 in for participant depression severity across time.
For anxiety symptoms (GAD-7), the main effect of time was significant and had a large effect size when controlling for baseline (F1,103=14.33; P<.001; ηp2=0.12). The time×baseline interaction was also significant and had a larger effect size (F2,103=23.92; P<.001; ηp2=0.19). The time×group interaction was not significant (P=.29). In post hoc pairwise comparisons, the GUIDE-only group showed a significant relative reduction in anxiety compared to the waitlist control group (raw MD=–0.5; P=.05; d=–0.29). Mean raw anxiety scores exceeded the clinical cutoff for all 3 groups over time (B), indicating a high level of anxiety present in our sample. A majority of participants met the criteria for clinical anxiety at both baseline (82/115, 71.3%) and posttrial (70/107, 65.4%), and 100% of participants met the criteria for mild anxiety at both time periods. See Figure S3 of for participant anxiety level across time.

Social Connectedness
For PERMA Relationships, the main effect of time was significant and had a large effect size when controlling for baseline (F1,103=14.86; P<.001; ηp2=0.13). The time×baseline interaction was also significant and had a moderate effect size (F1,103=11.40; P=.001; ηp2=0.10). The time×group interaction was not significant (P=.12). Post hoc pairwise comparisons were not significant (all P≥.05). See A for raw mean scores by treatment group over time.
For PERMA Loneliness, the main effect of time was significant and had a moderate effect size (F1,103=7.55; P=.007; ηp2=0.07). The time×baseline interaction was significant and had a large effect size (F1,103=26.78; P<.001; ηp2=0.21). The time×group interaction was not significant (P=.22). See B for raw mean scores by treatment group over time.

Personal Accomplishment
For PERMA Accomplishment, the main effect of time was significant and had a large effect size when controlling for baseline (F1,103=19.16; P<.001; ηp2=0.16). The time×baseline interaction was also significant and had a large effect size (F2,103=16.00; P<.001; ηp2=0.13). The time×group interaction was not significant (P=.69). Post hoc pairwise comparisons were not significant (all P≥.05). See A for raw mean scores by treatment group over time.
For PERMA Meaning, the main effect of time was significant and had a moderate effect size when controlling for baseline (F1,103=9.33; P=.003; ηp2=0.08). The time×baseline interaction was significant and had a moderate effect size (F2,103=7.86; P=.006; ηp2=0.07). The time×group interaction was not significant (P=.29). Post hoc pairwise comparisons were not significant (all P≥.05). See B for raw mean scores by treatment group over time.
For PERMA Engagement, neither time (P=.26) nor time×group (P=.68) was significant. Mean scores for all 3 groups across both time periods fell near the suboptimal cutoff (C).

Subgroup Analyses
First Responders
We explored outcomes for active first responders (n=93). See Table S2 in for a breakdown by responder type. We reran our repeated measures ANOVAs for select outcome measures using data from active first responders only. We excluded “filler” items from the PERMA-Profiler (ie, Negative Emotion, Happiness, Health, and Loneliness), as well as measures without significant differences in our main analysis (ie, Engagement). See for a summary of our findings.
Time was significant and had moderate to large effect sizes for all outcome measures (P<.05), except PERMA Overall which was borderline significant and showed a small effect size (P=.08). There were no significant time×group interactions; however, 2 interactions were borderline significant and had moderate effect sizes: emotion dysregulation (DERS-SF; F2,82=2.35; P=.10; ηp2=0.05) and depression symptoms (PHQ-8; F2,82=2.68; P=.07; ηp2=0.06). In group×time pairwise comparisons for these outcome measures, the GUIDE+incentives group showed the most significant improvement, with a 4.63% reduction in emotion dysregulation (P=.03) and a 9.00% reduction in depression symptoms (P=.001) based on adjusted means and showing moderate and large effect sizes. Depression symptoms decreased by 5.13% for the waitlist control group (P=.03), but the effect sizes and MDs were half as large as those in the GUIDE+incentives group.
In pairwise comparisons by group, the GUIDE+incentives group performed better than the waitlist control group for all outcomes, but the MDs between these groups were not significant (all P≥.05). The GUIDE-only group improved more than the waitlist control group across wellness, emotional well-being, mental health, and social connectedness measures; however, the difference between groups was only significant for wellness (PERMA Overall: P=.03) and borderline significant for emotional well-being (PERMA Positive Emotion: P=.08). The GUIDE-only group did not improve significantly over the waitlist control group on any personal growth measure (all P≥.05). When comparing the GUIDE+incentives and GUIDE-only groups, the results were mixed, indicating that half of our outcomes improved more for participants who received financial incentives, while the other half improved more for participants who did not receive the incentives, although none of the differences between the 2 interventions were statistically significant (all P≥.05). The GUIDE+incentives group showed greater improvement in personal growth measures, while the GUIDE-only group showed greater improvement in wellness and emotional well-being measures.
During our intake meetings, many participants disclosed that they were leaders in their organization or involved in wellness for their community. We did not include an item to capture this in our demographic survey, so we selected participants with less education (college or less, n=63) and fewer years of employment (<15 years, n=66) as proxies for less leadership or wellness experience in a sensitivity analysis. See Tables S3 and S4 in for our findings.
While these sensitivity analyses were not powered, pairwise comparisons of less experienced first responders suggested that GUIDE with financial incentives may have led to reductions in emotion dysregulation (P=.009), depression (P=.002), and anxiety (P=.04). However, these differences were not significant when comparing group pre-post deltas (all P≥.05). Among first responders with less education, the GUIDE-only group showed borderline significant improvements compared to the waitlist control group for wellness (P=.06) and emotional well-being (P=.08) in pairwise group comparisons.
| Outcome measure | ANOVA effects | Pairwise comparisons (group×time) | Pairwise group comparisons of pre-post deltas | ||||||||||||||||||||||||||||||||
| Time | Time×group | GUIDE+incentives | GUIDE-only | Waitlist | GUIDE+incentives vs waitlist | GUIDE-only vs waitlist | GUIDE+incentives vs GUIDE-only | ||||||||||||||||||||||||||||
| ηp2a | ηp2a | ηp2a | MDb | ηp2a | MD | ηp2a | MD | dc | MD | dc | MD | dc | MD | ||||||||||||||||||||||
| Wellness | |||||||||||||||||||||||||||||||||||
| PERMA Overalld | 0.04e | 0.01 | 0.04 | 0.28 | 0.00 | 0.06 | 0.01 | 0.12 | 0.17 | 0.48 | 0.33f | 0.94f | –0.16 | –0.46 | |||||||||||||||||||||
| PWSg,h,i | 0.14j | 0.04 | 0.05f | 6.36f | 0.00 | 0.20 | 0.00 | –0.41 | 0.22 | 3.39 | 0.02 | 0.30 | 0.21 | 3.08 | |||||||||||||||||||||
| WHO-5h,k | 0.06f | 0.03 | 0.03 | 4.32 | 0.05f | 5.54f | 0.00 | 0.34 | 0.13 | 4.71 | 0.25 | 9.34 | –0.13 | –4.63 | |||||||||||||||||||||
| Emotional well-being | |||||||||||||||||||||||||||||||||||
| PERMA Positive Emotion | 0.11l | 0.02 | 0.08f | 0.53f | 0.02 | 0.26 | 0.02 | 0.18 | 0.18 | 0.61 | 0.29e | 0.97e | –0.11 | –0.36 | |||||||||||||||||||||
| DERS-SFi,m,n | 0.08l | 0.05e | 0.06f | –3.33f | 0.01 | 1.18 | 0.00 | –0.47 | –0.19 | –1.43 | 0.10 | 0.82 | –0.30 | –2.26 | |||||||||||||||||||||
| Mental health | |||||||||||||||||||||||||||||||||||
| PHQ-8o,p | 0.12l | 0.06e | 0.12f | –2.16l | 0.00 | –0.08 | 0.06f | –1.23f | –0.13 | –1.05 | –0.33 | –2.65 | 0.20 | 1.60 | |||||||||||||||||||||
| GAD-7i,q,r | 0.10l | 0.02 | 0.03 | –0.87 | 0.01 | –0.14 | 0.00 | 0.00 | –0.16 | –0.44 | –0.02 | –0.07 | –0.13 | –0.36 | |||||||||||||||||||||
| Personal growth | |||||||||||||||||||||||||||||||||||
| PERMA Accomplishmenti | 0.11l | 0.01 | 0.02 | 0.28 | 0.00 | 0.10 | 0.00 | 0.10 | 0.08 | 0.09 | 0.00 | 0.00 | 0.08 | 0.09 | |||||||||||||||||||||
| PERMA Meaningi | 0.05f | 0.02 | 0.02 | 0.26 | 0.00 | –0.04 | 0.00 | –0.02 | 0.14 | 0.14 | –0.01 | –0.01 | 0.15 | 0.15 | |||||||||||||||||||||
| Social connectedness | |||||||||||||||||||||||||||||||||||
| PERMA Relationshipsi | 0.08l | 0.01 | 0.01 | 0.23 | 0.03 | 0.31 | 0.03 | 0.03 | 0.09 | 0.10 | 0.12 | 0.14 | –0.04 | –0.04 | |||||||||||||||||||||
aInterpretation of ηp2: 0.01=small effect, 0.06=moderate effect, 0.11=large effect [,].
bMD: mean difference; based on adjusted means.
cInterpretation of Cohen d: 0.2=small effect, 0.5=moderate effect, 0.8=large effect [,].
dPERMA Overall is the total score of human flourishing from the PERMA (Positive Emotion, Engagement, Relationships, Meaning, Accomplishment) model [].
eP=.05-.1 (Bonferroni adjustment for multiple comparisons).
fP<.05 (Bonferroni adjustment for multiple comparisons).
gPWS: Personal Well-being Score.
hPWS and WHO-5 assess well-being.
iControlled for baseline.
jP<.001 (Bonferroni adjustment for multiple comparisons).
kWHO-5: World Health Organization 5-item Well-being Index.
lP<.01 (Bonferroni adjustment for multiple comparisons).
mDERS-SF: Difficulties in Emotion Regulation Scale–Short Form.
nDERS-SF assesses emotion dysregulation.
oPHQ-8: Patient Health Questionnaire-8.
pPHQ-8 assesses depression symptoms.
qGAD-7: Generalized Anxiety Disorder-7.
rGAD-7 assesses anxiety symptoms.
Military Affiliation
To explore outcomes for participants with military affiliation, we conducted repeated measures ANOVAs for veterans and active-duty military personnel (n=45). We excluded “filler” items from the PERMA-Profiler, as well as the PERMA Engagement subscale, as in the responder analysis. The findings are presented in .
The main effect of time was significant for all wellness (P<.01), emotional well-being (P<.05), mental health (P<.001), social connectedness (P<.001), and personal growth (P<.05) measures, except for PERMA Overall, which was borderline significant (P=.10).
Unlike our main analysis, 2 measures had significant time×group interactions and showed large effect sizes: GAD-7 for anxiety (F2,40=3.85; P=.03; ηp2=0.16) and PERMA Relationships (F2,40=3.85; P=.03; ηp2=0.16).
In post hoc group×time pairwise comparisons of depression symptoms, the GUIDE+incentives group showed a significant improvement over time, with a large effect size (raw MD=–1.44; P=.001; ηp2=0.25), translating to a 6.86% greater percentile point reduction in the raw anxiety score or a 13.76% greater reduction when controlling for baseline. In post hoc between-group pairwise comparisons, the GUIDE-only group showed a significantly greater reduction in anxiety symptoms compared to the waitlist control group (raw MD=–1.69; P=.03; d=–0.31), a difference that translates to an 8.05% greater percentile point reduction in the raw anxiety score or a 7.62% greater reduction when controlling for baseline.
For PERMA Relationships, post hoc group×time pairwise comparisons showed that both the GUIDE+incentives group (raw MD=0.81, SD 0.76; P=.02) and GUIDE-only group (raw MD=0.33; P=.05; ηp2=0.13) showed significant improvements over time, with large and moderate effect sizes, respectively, while the waitlist control group (raw MD=–0.22; P=.02; ηp2=0.09) did not show such an improvement. In post hoc between-group pairwise comparisons, both the GUIDE+incentives (P=.04) and GUIDE-only (P=.08) groups showed significantly greater improvements on this measure when compared to the waitlist control group. This indicates that social connectedness improved for both groups with GUIDE access but did not improve in the waitlist control group.
Beyond anxiety and social connectedness, the GUIDE-only group showed significant improvements over the waitlist control group in wellness (WHO-5; P=.049), emotional well-being (PERMA Positive Emotion; P=.02), and mental health (PHQ-8 for depression; P=.051), although the effect sizes were small. Wellness, as measured by PERMA Overall, was borderline significant in the GUIDE-only group compared with the waitlist control group (P=.09). The GUIDE-only group also had significantly reduced depression symptoms compared to the GUIDE+incentives group (P=.047) and borderline significant relative improvement in well-being (WHO-5; P=.07). The GUIDE+incentives group showed a significant improvement over the waitlist control group for social connectedness (P=.04). Together, these results suggest that GUIDE alone may perform better without financial incentives for military personnel and veterans.
In our analysis of participants without military affiliation (ie, first responders only), none of the time×group interactions were significant (all P≥.05). Details are provided in Table S5 in .
| Outcome measure | ANOVA effects | Pairwise comparisons (group×time) | Pairwise group comparisons of pre-post deltas | ||||||||||||||||||||||||||||||
| Time | Time×group | GUIDE+incentives | GUIDE-only | Waitlist | GUIDE+incentives vs waitlist | GUIDE-only vs waitlist | GUIDE+incentives vs GUIDE-only | ||||||||||||||||||||||||||
| ηp2a | ηp2a | ηp2a | MDb | ηp2a | MD | ηp2a | MD | dc | MD | dc | MD | dc | MD | ||||||||||||||||||||
| Wellness | |||||||||||||||||||||||||||||||||
| PERMA Overalld | 0.06e | 0.11e | 0.13f | 0.52f | 0.05 | 0.29 | 0.02 | –0.19 | 0.08 | 0.40 | 0.27e | 1.32e | 0.20 | –0.93 | |||||||||||||||||||
| PWSg,h,i | 0.16j | 0.02 | 0.02 | 3.02 | 0.03 | 5.04 | 0.00 | –0.33 | 0.07 | 1.68 | 0.10 | 2.68 | –0.04 | –1.01 | |||||||||||||||||||
| WHO-5h,k | 0.19j | 0.02 | 0.07e | 6.00e | 0.14f | 8.75f | 0.03 | 4.33 | 0.04 | 2.42 | 0.29f | 18.29f | –0.28e | –15.88e | |||||||||||||||||||
| Emotional well-being | |||||||||||||||||||||||||||||||||
| PERMA Positive Emotion | 0.11f | 0.07 | 0.10f | 0.63f | 0.12f | 0.56f | 0.00 | –0.06 | 0.13 | 0.74 | 0.33f | 1.92f | 0.22 | –1.18 | |||||||||||||||||||
| DERS-SFi,l,m | 0.29n | 0.08 | 0.24j | –5.17j | 0.06 | –2.33 | 0.01 | –1.33 | –0.20 | –1.92 | –0.05 | –0.50 | –0.16 | –1.42 | |||||||||||||||||||
| Mental health | |||||||||||||||||||||||||||||||||
| PHQ-8o,p | 0.30n | 0.04 | 0.24j | –3.00j | 0.10f | –1.81f | 0.06 | –1.58 | –0.02 | –0.27 | 0.29f | –4.37f | 0.30f | 4.09f | |||||||||||||||||||
| GAD-7i,q,r | 0.26n | 0.16f | 0.02 | –0.60 | 0.25j | –2.89j | 0.00 | 0.32 | –0.09 | –0.46 | 0.32f | –1.60f | 0.23 | 1.14 | |||||||||||||||||||
| Personal growth | |||||||||||||||||||||||||||||||||
| PERMA Accomplishmenti | 0.19j | 0.06 | 0.03 | 0.30 | 0.03 | 0.33 | 0.02 | –0.25 | 0.16 | 0.28 | 0.16 | 0.29 | –0.01 | –0.02 | |||||||||||||||||||
| PERMA Meaningi | 0.15f | 0.07 | 0.04 | 0.38 | 0.01 | 0.15 | 0.03 | –0.40 | 0.20 | 0.39 | 0.13 | 0.27 | 0.06 | 0.12 | |||||||||||||||||||
| Social connectedness | |||||||||||||||||||||||||||||||||
| PERMA Relationshipi | 0.31n | 0.16f | 0.13f | 0.72f | 0.09f | 0.61f | 0.04 | –0.46 | 0.31f | 0.60f | 0.27f | 0.54f | 0.03 | 0.05 | |||||||||||||||||||
aInterpretation of ηp2: 0.01=small effect, 0.06=moderate effect, 0.11=large effect [,].
bMD: mean difference; based on adjusted means.
cInterpretation of Cohen d: 0.2=small effect, 0.5=moderate effect, 0.8=large effect [,].
dPERMA Overall is the total score of human flourishing from the PERMA (Positive Emotion, Engagement, Relationships, Meaning, Accomplishment) model [].
eP=.05-.1 (Bonferroni adjustment for multiple comparisons).
fP<.05 (Bonferroni adjustment for multiple comparisons).
gPWS: Personal Well-being Score.
hPWS and WHO-5 assess well-being.
iControlled for baseline.
jP<.01 (Bonferroni adjustment for multiple comparisons).
kWHO-5: World Health Organization 5-item Well-being Index.
lDERS-SF: Difficulties in Emotion Regulation Scale–Short Form.
mDERS-SF assesses emotion dysregulation.
nP<.001 (Bonferroni adjustment for multiple comparisons).
oPHQ-8: Patient Health Questionnaire-8.
pPHQ-8 assesses depression symptoms.
qGAD-7: Generalized Anxiety Disorder-7.
rGAD-7 assesses anxiety symptoms.
Sex Differences
To explore how sex influenced participant outcomes, we conducted exploratory analyses of select outcome measures by rerunning our repeated measures ANOVAs for male (n=76) and female (n=31) participants, since previous research has shown significant variation between sexes on self-report measures related to health and mental health []. The results for male participants were powered, and the findings are reported in .
The main effect of time was significant across all wellness (P<.05), emotional well-being (P<.05), mental health (P<.001), personal growth (P<.01), and social connectedness (P<.001) measures included in the analysis. For PERMA Relationships, the time×group interaction was significant (P=.04) and had a moderate effect size when controlling for baseline. In post hoc group×time pairwise comparisons, only the GUIDE+incentives group showed a significant improvement (P=.005). Post hoc pairwise comparisons by group corroborated these findings and indicated that the GUIDE+incentives group performed significantly better than the waitlist control group on this measure (P=.03). Altogether, these findings indicate that male participants may feel more socially connected when using GUIDE with financial incentives, rather than without.
For emotion dysregulation (DERS-SF), the time×group interaction was borderline significant (P=.10). In post hoc group×time pairwise comparisons, only the GUIDE+incentives group showed a significant reduction in emotion dysregulation over time when controlling for baseline (P=.006), although in post hoc pairwise comparisons by group, there were no significant differences (all P≥.05). GUIDE with financial incentives may improve emotion regulation for male users, but these findings are not as strong as for PERMA Relationships.
The results for females are reported in Table S6 in . The main effect of time was only significant for the included emotional well-being (P<.01) and mental health (P<.05) measures. None of the time×group interactions were significant (all P≥.05), but unlike the male sample, these analyses were not powered and thus should be interpreted cautiously. In post hoc pairwise comparisons by group, we found that the GUIDE-only group showed a significant reduction in depression symptoms (PHQ-8) relative to the waitlist control group (P=.02), indicating that GUIDE usage without financial incentives may have helped reduce these symptoms relative to the control group. There was also a borderline significant difference between the GUIDE-only and waitlist control groups for PERMA Positive Emotion in post hoc pairwise comparisons (P=.08).
In summary, the effect of GUIDE and financial incentives on wellness, emotional well-being, and mental health may differ by sex. Male participants benefited more from the GUIDE+incentives condition, while female participants benefited from the GUIDE-only condition, indicating that males may be more motivated by financial incentives than females.
| Outcome measure | ANOVA effects | Pairwise comparisons (group×time) | Pairwise group comparisons of pre-post deltas | |||||||||||||||||||||||||||||||||||
| Time | Time×group | GUIDE+incentives | GUIDE-only | Waitlist | GUIDE+incentives vs waitlist | GUIDE-only vs waitlist | GUIDE+incentives vs GUIDE-only | |||||||||||||||||||||||||||||||
| ηp2a | ηp2a | ηp2a | MDb | ηp2a | MD | ηp2a | MD | dc | MD | dc | MD | dc | MD | |||||||||||||||||||||||||
| Wellness | ||||||||||||||||||||||||||||||||||||||
| PERMA Overalld | 0.08e | 0.05 | 0.10e | 0.51e | 0.02 | 0.19 | 0.00 | 0.04 | 0.11 | 0.37 | 0.18 | 0.63 | –0.07 | –0.26 | ||||||||||||||||||||||||
| PWSf,g,h | 0.20i | 0.03 | 0.32 | 4.82 | 0.07 | 1.26 | 0.07 | –1.41 | 0.17 | 3.10 | 0.07 | 1.34 | 0.10 | 1.78 | ||||||||||||||||||||||||
| WHO-5j | 0.12k | 0.03 | 0.08e | 6.78e | 0.07e | 5.93e | 0.01 | 1.85 | 0.06 | 2.76 | 0.16 | 7.04 | –0.09 | –4.28 | ||||||||||||||||||||||||
| Emotional well-being | ||||||||||||||||||||||||||||||||||||||
| PERMA Positive Emotion | 0.07e | 0.05 | 0.08e | 0.52e | 0.04l | 0.33l | 0.00 | –0.03 | 0.16 | 0.70 | 0.04 | 0.16 | –0.04 | –0.16 | ||||||||||||||||||||||||
| DERS-SFh,m,n | 0.15k | 0.06l | 0.10k | –4.45k | 0.00 | 0.23 | 0.01 | –1.14 | –0.18 | –1.65 | 0.08 | 0.69 | –0.26 | –2.34 | ||||||||||||||||||||||||
| Mental health | ||||||||||||||||||||||||||||||||||||||
| PHQ-8o,p | 0.18i | 0.04 | 0.14k | –2.70k | 0.03 | –0.96 | 0.05 | –1.35 | –0.25 | –3.27 | –0.34e | –5.02e | 0.12 | 1.76 | ||||||||||||||||||||||||
| GAD-7h,q,r | 0.19i | 0.03 | 0.02 | –0.82 | 0.05e | –1.18e | 0.00 | 0.02 | –0.11 | –0.42 | –0.17 | –0.60 | 0.05 | 0.18 | ||||||||||||||||||||||||
| Personal growth | ||||||||||||||||||||||||||||||||||||||
| PERMA Accomplishmenth | 0.17i | 0.02 | 0.04 | 0.41 | 0.02 | 0.23 | 0.00 | 0.08 | 0.12 | 0.17 | 0.06 | 0.08 | 0.07 | 0.09 | ||||||||||||||||||||||||
| PERMA Meaningh | 0.09k | 0.06 | 0.07e | 0.58e | 0.00 | 0.11 | 0.00 | –0.12 | 0.25 | 0.34 | 0.16 | 0.12 | 0.09 | 0.22 | ||||||||||||||||||||||||
| Social connectedness | ||||||||||||||||||||||||||||||||||||||
| PERMA Relationshipsh | 0.18i | 0.09e | 0.10k | 0.70k | 0.02 | 0.28 | 0.00 | –0.17 | 0.30e | 0.44e | 0.23 | 0.23 | 0.15 | 0.21 | ||||||||||||||||||||||||
aInterpretation of ηp2: 0.01=small effect, 0.06=moderate effect, 0.11=large effect [,].
bMD: mean difference; based on adjusted means.
cInterpretation of Cohen d: 0.2=small effect, 0.5=moderate effect, 0.8=large effect [,].
dPERMA Overall is the total score of human flourishing from the PERMA (Positive Emotion, Engagement, Relationships, Meaning, Accomplishment) model [].
eP<.05 (Bonferroni adjustment for multiple comparisons).
fPWS: Personal Well-being Score.
gPWS and WHO-5 assess well-being.
hControlled for baseline.
iP<.001 (Bonferroni adjustment for multiple comparisons).
jWHO-5: World Health Organization 5-item Well-being Index.
kP<.01 (Bonferroni adjustment for multiple comparisons).
lP=.05-.10 (Bonferroni adjustment for multiple comparisons).
mDERS-SF: Difficulties in Emotion Regulation Scale–Short Form.
nDERS-SF assesses emotion dysregulation.
oPHQ-8: Patient Health Questionnaire-8.
pPHQ-8 assesses depression symptoms.
qGAD-7: Generalized Anxiety Disorder-7.
rGAD-7 assesses anxiety symptoms.
Secondary Outcomes
App Engagement
App Engagement by Minimum Usage and Recommended Benchmarks
Among the participants allocated to the GUIDE+incentives and GUIDE-only groups, 97% (74/76) were included in the app engagement analysis. Excluded participants experienced technical difficulties and were unable to use the app (and thus were considered lost to follow-up). By week 4, 16% (6/38) of participants from the GUIDE+incentives group and 11% (4/36) of participants from the GUIDE-only group had stopped using GUIDE, for a combined attrition rate of 13% (10/76).
To assess for the adherence criterion as recommended by the literature [], we assessed app engagement based on minimum usage standards (≥3 weekly activities) versus recommended usage benchmarks (≥3 weekly lessons, posts, replies, likes, or mood surveys) across all weeks. Overall, 67% (51/76) of participants met the minimum usage standards for all 4 weeks, while only 8% (6/76) met the recommended benchmarks for all 4 weeks. To compare how financial incentives influenced whether a given participant met the minimum versus recommended benchmarks, we conducted an independent samples t test. There was no significant difference (P=.13) in average weeks meeting the minimum usage standards between the GUIDE+incentives group (mean 3.58, SD 0.72) and GUIDE-only group (mean 3.44, SD 1.03), indicating that financial incentives did not impact engagement at this minimum; however, when it came to the number of weeks meeting the more rigorous recommended usage benchmarks, the GUIDE+incentives group completed the recommended usage significantly more often than the GUIDE-only group (mean 1.61, SD 1.46 vs mean 0.44, SD 0.81; t72=4.19; P<.001).
This pattern held in the analysis of weekly app engagement (A, B). There were no significant differences between intervention groups in minimum usage for any week (all P≥.05), but differences between the GUIDE+incentives and GUIDE-only groups were highly significant for the recommended benchmark across week 1 (P<.001), week 2 (P=.001), and week 3 (P=.004) of the trial and borderline significant for week 4 (P=.06), indicating that financial incentives impacted whether participants met recommended benchmarks but that the influence of these incentives decreased over time.

Impact of Treatment Group on App Engagement
To assess whether financial incentives influenced app engagement, t tests were conducted to compare app usage (total and by feature) between the GUIDE+incentives and GUIDE-only groups. The descriptive statistics and t test results are reported in . Participants in the GUIDE+incentives group had significantly higher total app activity compared to the GUIDE-only group (P=.048), driven by more small group posts (P=.04) and replies (P=.001), indicating that financial incentives influenced engagement with the peer support features of GUIDE. There was no significant difference between lessons completed (P=.47), likes (P=.23), or mood surveys (P=.64).
| App metric | GUIDE+incentives group, mean (SD) | GUIDE-only group, mean (SD) | t test (df) | P value |
| Lessons | 27.53 (27.81) | 22.61 (30.47) | 0.73 (72) | .52 |
| Posts | 83.66 (71.58) | 52.92 (48.50) | 2.15 (72) | .02a |
| Replies | 19.05 (18.04) | 7.75 (8.71) | 3.40 (72) | .001b |
| Likes | 27.05 (30.93) | 18.83 (27.01) | 1.22 (72) | .22 |
| Mood surveys | 13.47 (7.86) | 12.50 (9.69) | 0.48 (72) | .16 |
| Total activity | 170.76 (133.12) | 114.61 (104.75) | 2.01 (72) | .04a |
aP<.05.
bP<.01.
Correlations Between App Engagement and Primary Outcome Measures
To explore how app engagement influenced pre-post scores for wellness, emotional well-being, mental health, social connectedness, and personal growth, Pearson correlation coefficients were calculated. The findings are presented in .
Total activity was weakly correlated with improvements in 2 wellness measures (WHO-5 and PWS). In terms of emotional well-being, there was a borderline significant positive correlation between total activity and PERMA Positive Emotion (P=.08). There were no significant correlations between total app activity and any mental health, personal growth, or social connectedness measures (all P≥.05).
The metric lessons completed was positively correlated with 2 wellness measures (WHO-5 and PWS). It was also weakly correlated with an improvement in emotional well-being measures (PERMA Positive Emotion and DERS-SF). For personal growth, more lessons completed was moderately correlated with an improvement in PERMA Accomplishment. Mental health and social connectedness were not correlated with lessons completed.
The metric total posts was moderately and positively correlated with wellness (PWS) and personal growth (PERMA Accomplishment). Additionally, posts were moderately and negatively correlated with emotion dysregulation (DERS-SF).
The metric repliesto peer posts was weakly correlated with an improvement in wellness (PWS). There was a moderate and negative correlation between replies and emotion dysregulation (DERS-SF). In terms of mental health, an increase in replies was weakly correlated with a reduction in depression symptoms (PHQ-8). There was also a weak and borderline significant correlation with a reduction in anxiety symptoms (GAD-7; P=.06). Moreover, there was a weak and borderline significant correlation between replies and personal growth (PERMA Accomplishment; P=.05). Replies were not significantly correlated with changes in social connectedness (P=.24).
The metric likes was significantly and moderately correlated with improvements in wellness (PWS), mental health (PHQ-8 for depression), and personal growth (PERMA Accomplishment). There were also borderline significant and weak correlations between total likes and improvements in well-being (WHO-5; P=.06), emotion dysregulation (DERS-SF; P=.07), and anxiety symptoms (GAD-7; P=.07). Likes to peer posts was not correlated with changes in social connectedness (P=.22).
The metric mood surveys completed was only correlated with an improvement in personal growth (PERMA Accomplishment).
| Pre-post outcome measure | Score difference, mean (SD) | App metric (correlation coefficienta) | ||||||
| Lessons | Posts | Replies | Likes | Mood surveys | Total activity | |||
| Wellness | ||||||||
| PERMA Overallb | 0.21 (0.81) | 0.19 | 0.14 | 0.17 | 0.16 | –0.04 | 0.16 | |
| WHO-5c,d | 4.00 (13.99) | 0.26e | 0.13 | 0.16 | 0.23f | 0.03 | 0.25e | |
| PWSd,g | 2.10 (16.59) | 0.36h | 0.32h | 0.28e | 0.31e | 0.19 | 0.25e | |
| PERMA Healthi | 0.28 (1.05) | 0.07 | 0.06 | 0.06 | 0.09 | –0.05 | 0.07 | |
| Emotional well-being | ||||||||
| PERMA Positive Emotion | 0.34 (1.05) | 0.26e | 0.13 | 0.16 | 0.23 | 0.03 | 0.21f | |
| PERMA Happiness | 0.23 (1.24) | 0.13 | 0.04 | –0.14 | 0.21 | –0.12 | 0.09 | |
| PERMA Negative Emotion | –0.55 (1.34) | –0.23 | –0.33h | –0.23 | –0.21 | –0.17 | –0.35h | |
| DERS-SFj,k | –1.73 (8.60) | –0.26e | –0.39h | –0.30e | 0.22f | –0.19 | –0.14 | |
| Mental health | ||||||||
| PHQ-8l,m | –1.55 (3.54) | –0.19 | –0.22f | –0.27e | –0.35h | 0.09 | –0.12 | |
| GAD-7n,o | –0.65 (3.45) | –0.12 | –0.18 | –0.23f | –0.22f | –0.01 | –0.05 | |
| Social connectedness | ||||||||
| PERMA Relationships | 0.23 (1.18) | 0.11 | 0.09 | 0.14 | 0.15 | –0.04 | 0.11 | |
| PERMA Loneliness | –0.50 (2.29) | 0.05 | 0.01 | –0.09 | –0.17 | –0.04 | –0.03 | |
| Personal growth | ||||||||
| PERMA Meaning | 0.13 (1.12) | 0.18 | 0.09 | 0.19 | 0.15 | –0.10 | 0.12 | |
| PERMA Accomplishment | 0.23 (1.16) | 0.34h | 0.33h | 0.23f | 0.36h | 0.30e | 0.14 | |
| PERMA Engagement | 0.13 (1.22) | –0.03 | 0.00 | –0.08 | –0.10 | –0.04 | –0.01 | |
aCorrelation coefficient: <0.1=negligible correlation, 0.10=weak correlation, 0.40=moderate correlation [].
bPERMA Overall is the total score of human flourishing from the PERMA (Positive Emotion, Engagement, Relationships, Meaning, Accomplishment) model [].
cWHO-5: World Health Organization 5-item Well-being Index.
dPWS and WHO-5 assess well-being.
eP<.05.
fP=.05-.10.
gPWS: Personal Well-being Score.
hP<.01.
iPERMA Health is a measure of perceived health.
jDERS-SF: Difficulties in Emotion Regulation Scale–Short Form.
kDERS-SF assesses emotion dysregulation.
lPHQ-8: Patient Health Questionnaire-8.
mPHQ-8 assesses depression symptoms.
nGAD-7: Generalized Anxiety Disorder-7.
oGAD-7 assesses anxiety symptoms.
Implementation Outcomes and Technical Merit
To assess implementation outcomes and technical merit, we relied on descriptive statistics for acceptability (mean 3.81, SD 0.89; minimum 1, maximum 5), appropriateness (mean 4.01, SD 0.75; minimum 2, maximum 5), feasibility (mean 4.01, SD 0.77; minimum 1, maximum 5), and usability (mean 70.62, SD 18.82; minimum 7.5, maximum 100).
While there are no agreed-upon cutoffs for acceptability, appropriateness, and feasibility, for the purpose of this investigation, we classified a mean score of ≥4 as good, since it indicates that the participant, on average, agreed with each item. With this classification in mind, the appropriateness and feasibility measures were above the cutoff, indicating that GUIDE is well-suited for the population and feasible to implement. The mean acceptability score was 3.81, which was below our threshold, indicating that participants on average rated GUIDE somewhere between the neutral and agree options on a 5-point Likert scale. This likely means that the app is adequately acceptable but that there is room for improvement.
Mean usability was satisfactory or slightly above the average cutoff score of 68, indicating that the app is generally usable; however, like with acceptability, there is room for improvement, as 39% of participants (n=27) rated the app below this cutoff.
Overall, GUIDE received scores above industry standards for appropriateness, feasibility, and usability, suggesting strong performance on these indicators; however, the acceptability score was slightly below the desired threshold, indicating that there may be room for improvement in terms of likability.
Discussion
Population
The prevalence of clinical depression and anxiety symptoms in our sample indicates that we managed to recruit a population in need of an app like GUIDE. In the 2022 National Health Interview Survey, depression (PHQ-8) and anxiety (GAD-7) symptoms met the clinical criteria for 7.5% and 6.7% of the general US population, respectively []. The rates of clinical depression (16.80%-26.96%) and clinical anxiety (65.42%-71.30%) in our sample were substantially higher and even exceeded rates reported in a recent systematic review of the mental health of first responders []. The severity of reported depression and anxiety becomes more apparent when including those with mild or subclinical levels. These figures indicate cause for concern and corroborate the notion that there is a mental health crisis among first responders, veterans, and active-duty military personnel.
Primary Outcomes
Overall, the main effect of time was highly significant and had moderate to large effect sizes for wellness, emotional well-being, mental health, social connectedness, and personal growth primary outcome measures. While this cross-measure improvement over time aligns with the reported pre-post t tests from the quasiexperimental GUIDE pilot study [], our findings indicate that, since time×group interactions were not significant, pre-post t test results may have been greatly influenced by time.
When controlling for time, there was a borderline significant time×group interaction for emotion dysregulation (DERS-SF). This interaction effect was driven by the GUIDE+incentives group, which showed a significant reduction in emotion dysregulation over time with a large effect size, while the GUIDE-only and waitlist control groups did not, indicating that GUIDE plus financial incentives may be most effective as an intervention for improving emotion regulation. This result is encouraging, since emotion regulation plays an important role in ethical decision-making and acts as a protective factor in stressful contexts like those encountered by first responders and military personnel [,].
Across most wellness, emotional well-being, mental health, social connectedness, and personal growth measures, the GUIDE+incentives and GUIDE-only groups showed larger and more consistent improvements over time compared to the waitlist control group. In terms of post hoc pairwise group×time comparisons, active intervention groups showed significant improvement in 53% (8/15) of measures (GUIDE+incentives: PERMA Overall, PERMA Positive Emotion, DERS-SF, PERMA Negative Emotion, PHQ-8 [depression], PERMA Health, PERMA Meaning, and PERMA Relationships; GUIDE-only: WHO-5, PERMA Positive Emotion, PERMA Happiness, PHQ-8 [depression], GAD-7 [anxiety], PERMA Health, PERMA Relationships, and Loneliness), while the waitlist control group only showed significant improvement in a single (1/15, 7%) measure (PERMA Negative Emotion). This ratio suggests that participants in the active intervention groups, on average, improved significantly over time, while participants in the waitlist control group improved at a rate just above random chance (ie, P<.05).
While both the GUIDE+incentives and GUIDE-only groups showed improvement over time in a majority of well-being measures when compared to the waitlist control group, in between-group post hoc pairwise comparisons, only the GUIDE-only group showed statistically significant improvements over the waitlist control group for wellness (PERMA Overall), emotional well-being (PERMA Positive Emotion and PERMA Happiness), and mental health (PHQ-8 depression and GAD-7 anxiety symptoms). The MD for WHO-5 well-being was also near significance. These findings suggest that GUIDE has a positive influence on wellness, emotional well-being, and mental health among “warriors,” without financial incentives. This result is encouraging, since it suggests that financial incentives are not required to see improvements among GUIDE members. This aligns with previous research indicating that financial incentives did not impact mental health or wellness measures over time for a sample of college students using a digital mental health intervention [].
Given the significant improvements the GUIDE-only group showed over the waitlist control group, GUIDE appears to be just as, if not more, effective than interventions included in a recent systematic review of well-being and resilience programs for first responders (including in-person interventions), half of which reported no significant differences within or between treatment groups on outcome measures []. While the percentile point improvements for our sample were modest with small effect sizes, they are still encouraging since these mental health and wellness changes were observable after only 4 weeks and were aligned with other randomized controlled trials testing digital mental health apps considered effective [].
Observable trends and directionality of between-group post hoc pairwise comparisons were also encouraging, even if they were not all statistically significant. The GUIDE+incentives group showed greater improvement than the waitlist control group in 93% (14/15) of measures, although none of the improvements were significant. On the other hand, the GUIDE-only group showed greater improvement than the waitlist control group in 87% (13/15) of measures, and 33% (5/15) were significant. The directionality of these results suggests that GUIDE (with and without financial incentives) can positively influence wellness and mental health for first responders, veterans, and military personnel over 4 weeks, a result that may become more apparent over a longer time horizon, given research indicating how sustained engagement leads to larger improvements for digital health users [].
Subgroup Analyses
Active First Responders
Like in our main analysis, the main effect of time for active first responders was significant and had moderate to large effect sizes for all reported outcome measures, except PERMA Overall, which was borderline significant and had a small effect size. There were no significant time×group interactions; however, the interaction effects for emotion dysregulation and depression symptoms were borderline significant. The GUIDE+incentives group showed the most significant improvement in these measures, with a 4.63% greater percentile point reduction in emotion dysregulation and 9.00% greater percentile point reduction in anxiety symptoms across the 2 time periods based on adjusted means, with moderate and large effect sizes, respectively. Depression symptoms also decreased by 5.13% in the waitlist control group, but the effect sizes and MDs were half as large as those in the GUIDE+incentives group.
In between-group comparisons, the GUIDE+incentives group performed better than the waitlist control group for all outcomes, but the MDs between these groups were not significant. The GUIDE-only group improved more than the waitlist control group across wellness, emotional well-being, mental health, and social connectedness measures; however, the difference between groups was only significant for wellness (PERMA Overall) and borderline significant for emotional well-being (Positive Emotion). Notably, the GUIDE-only group did not improve significantly over the waitlist control group in personal growth measures. When comparing the GUIDE+incentives and GUIDE-only groups, the results were mixed, with half of the outcomes improving more for responders who received financial incentives and the other half improving more without incentives, although the differences were not statistically significant, indicating that financial incentives are not integral to efficacy.
In sensitivity analyses selecting for first responders with less education and experience, group×time pairwise comparisons showed that the GUIDE+incentives group had more significant improvements and larger effect sizes compared to the findings in the analysis of all responders. While these subgroup sensitivity analyses were not powered, they imply that financial incentives may act as a more powerful motivator for first responders with less education and experience, particularly for reducing emotion dysregulation, depression, and anxiety. In terms of wellness and emotional well-being, however, the GUIDE-only approach still performed significantly better, indicating that these outcomes may be less influenced by financial incentives, regardless of experience or education.
Military Affiliation
Veterans and active-duty military personnel who used GUIDE showed a greater improvement in outcome measures over time when compared to participants without military affiliation. These improvements were more significant and had larger effect sizes than those for participants overall, indicating that GUIDE may be more effective in improving wellness, emotional well-being, mental health, and social connectedness for veterans and active-duty military personnel. In between-group post hoc pairwise comparisons, the GUIDE-only group showed significant improvements over time compared to the waitlist control group for wellness (WHO-5), emotional well-being (PERMA Positive Emotion), mental health (GAD-7 anxiety and PHQ-8 depression symptoms), and social connectedness (PERMA Relationships), while the GUIDE+incentives group showed a significant improvement over the waitlist control group for only social connectedness (PERMA Relationships). Of note, the GUIDE-only group showed a significant and borderline significant improvement in PHQ-8 depression symptoms and WHO-5 well-being, respectively, when compared to the GUIDE+incentives group, strengthening the case that GUIDE alone leads to better wellness outcomes than GUIDE with financial incentives for military-affiliated participants. These findings are extremely encouraging and suggest that GUIDE may be just as, if not more, effective than existing mental health-related apps created by the Veterans Affairs (VA) or Department of Defense (DoD) []. While these subgroup results are promising, they are underpowered and must be interpreted with caution. Future research should include a larger number of veterans and military personnel to validate our findings.
Sex Differences
Our analyses by sex revealed that GUIDE may be an especially good fit for males, which is promising since males make up about 75% of first responders [], 82.5% of active-duty military personnel [], and 89% of veterans []. Additionally, males are more likely to commit suicide [] and are less likely to seek help or avoid treatment due to stigma [].
Male participants allocated to the GUIDE+incentives group improved significantly over the waitlist control group in terms of social connectedness (PERMA Relationships) and had the most significant improvements over time for our selected outcome measures (PERMA Overall, WHO-5, PERMA Positive Emotion, DERS-SF [emotion dysregulation], depression [PHQ-8], anxiety [GAD-7], PERMA Meaning, and PERMA Relationships), indicating that they may have responded more to financial incentives than females, a finding that aligns with previous research []. This suggests that incorporating financial incentives may yield more positive outcomes and app engagement for male GUIDE users.
Secondary Outcomes
App Engagement
Overall, 67% (51/76) of participants met our minimum usage standards for all weeks, while only 8% (6/76) of participants met the recommended benchmarks for all weeks. This indicates that the recommended benchmarks may be high for this population, since individuals willing to volunteer for a clinical trial are likely more motivated to persist than average.
The app attrition rate of 13% (n=10) was higher than the trial attrition rate of 6% (n=7), both of which were substantially lower than the weighted meta-analytic attrition rate of 24.7% from a recent study of mindfulness apps, a promising sign about the uptake of GUIDE among first responders and military personnel []. Attrition was higher in the GUIDE intervention groups than in the waitlist control group, which is typical for digital mental health interventions, and was the highest in the GUIDE+incentives group, indicating that app usage and financial incentives may have been considered burdensome by some participants.
Financial incentives significantly increased the average number of posts and replies in the small group chat of GUIDE. This suggests that financial incentives may drive engagement with the peer-support element of GUIDE or that the peer support feature has room to improve on usability or acceptability to keep users engaged. Financial incentives did not lead to significantly more lessons completed, indicating that the education feature keeps users engaged and that they are motivated to complete these lessons with or without the opportunity for compensation.
When assessing how app engagement (overall and by feature) was correlated with outcome measures, lessons, posts, and replies were associated with improvements in wellness and emotional well-being measures. Additionally, all features were associated with an improvement in accomplishment, indicating that participants who engaged with the app more felt more accomplished, regardless of the features used. Replies and likes, indicative of peer-to-peer interaction, were significantly correlated with a reduction in depression symptoms, indicating that these interactions may be beneficial for mental health. This finding is important to keep in mind, since the number of replies per participant was highly dependent on whether they received financial incentives. Figuring out a way to increase engagement and interaction between users may lead to an improvement across these outcomes, especially for mental health measures. Financial incentives may be an effective way to encourage use for some participants.
While financial incentives effectively increased app engagement, particularly in small group chats, they did not lead to corresponding improvements in primary outcomes. This suggests that intrinsic motivation may be essential for first responders, veterans, and military personnel to benefit from an app like GUIDE. According to the self-determination theory, individuals respond differently to financial incentives based on their intrinsic values and beliefs. Because financial materialism is a less reliable motivator than financial altruism, especially in service-oriented professions, gamification strategies that foster intrinsic motivation may be more effective in driving sustained app engagement []. This explanation is supported by our dropout analysis, which found that participants with no prior behavioral health care experience and lower app usage during the trial (ie, those less intrinsically motivated to engage with mental health interventions) were significantly less likely to complete the posttrial assessment, regardless of financial incentives.
Implementation Outcomes and Technical Merit
Appropriateness and feasibility, which represent the suitability of an intervention for a given population and the practicality of implementation in real-world conditions, were good according to our participants. This indicates that GUIDE is a good fit for first responders and military personnel and that it would be reasonable to implement given available resources.
Acceptability, which represents the degree to which an intervention is viewed positively, was satisfactory. High acceptability is important for digital mental health interventions, since it has been identified as a factor that increases both uptake and adherence among individuals with depression and anxiety []. Exploring opportunities to increase acceptability may improve outcomes for GUIDE members.
Usability was above average for GUIDE, but the scores indicate that there is room for improvement in this measure as well. Usability is critical when designing an effective digital mental health intervention, since users are more likely to stop using an app due to usability issues (eg, bugs, poor user interaction design, data loss, battery and memory usage, and lack of explanation or guidance) []. Usability scores for this trial may have been impacted by app outages experienced during the second half of the trial.
Overall, GUIDE performed above average across implementation outcomes and technical merit. Focusing on acceptability and usability can help take this app to the next level, which in turn could help drive adherence, engagement, and, ultimately, further improvement in wellness and mental health outcomes.
Limitations
Our study has several limitations. First, we relied exclusively on subjective self-report data in the trial. Although we considered using objective measures related to work performance to complement our results, we avoided this for the sake of confidentiality so that participants could join the trial without informing their employers. In the future, a study like this could limit recruitment to a specified number of agencies and gather workplace metrics for all employees to avoid a breach of confidentiality. Second, our sample included only those people motivated enough to voluntarily participate in a clinical trial and was limited to those interested in testing a mental health and wellness app, which may not be representative of the larger first responder, veteran, and military populations. Third, several aspects unique to our sample may limit the generalizability and external validity of our findings. Participants showed higher than average anxiety and depression symptoms compared with the findings in other studies of first responders and military personnel, indicating that generalizability to a larger “warrior” population may be difficult. Additionally, participation required a certain amount of digital literacy and accessibility, meaning that these results may not be generalizable to nondigital populations, especially older veterans and first responders. Demographic features, including a mostly male sample, limit the generalizability to underrepresented groups like female first responders. Fourth, we did not include an item about leadership status or rank in our demographic survey, a factor that may have influenced outcomes. Future research should include an item that assesses whether a participant is in a leadership or management position. Fifth, 4 weeks may not have been enough time to see substantial changes in our measures. Research on other digital wellness apps suggests that efficacy improves over time []. Without assessing long-term adherence, the study could not establish whether GUIDE fosters sustainable behavior change. Future studies should assess outcome measures across a longer time horizon and should incorporate a follow-up assessment 3 or 6 months after the intervention. Sixth, we relied on listwise deletions in our primary outcome analysis for participants who did not complete their posttrial assessment, which is not the most robust method of estimation. Research indicates that listwise deletion may lead to an overestimation of treatment effectiveness and bias results. Future research should consider pairwise deletions or imputation methods for assessment. Seventh, this study used a waitlist control design rather than adopting a direct comparator. Since this was the first randomized controlled trial of GUIDE, we chose a nonactive comparison to assess the effects of GUIDE use relative to no intervention. As a result, the outcomes may have been influenced by the expectancy effects of participants anticipating treatment. Additionally, owing to this design choice, the effectiveness of GUIDE cannot be directly compared to similar interventions. Future research should include a comparator to evaluate these effects against a similar mHealth product. Eighth, the participants and research team were not blinded to the assignment. Future iterations of this study should include a comparator, so that at least the participants are not aware of their assigned treatment group. Finally, this study relied on a multiplicity of analyses, which may have increased the likelihood of type I error. We addressed this risk using Bonferroni correction, although this conservative method may have increased the risk of type II error. Future research should use the Benjamini-Hochberg procedure in a sensitivity analysis to assess the effects of type II error.
Conclusions
The findings of this trial indicate that the GUIDE app is a feasible and appropriate intervention with the potential to improve the mental health and well-being of first responders, veterans, and military personnel. Participants in the GUIDE-only group showed significant improvements compared with waitlist controls in wellness (PERMA Overall), emotional well-being (PERMA Positive Emotion), and mental health (PHQ-8 for depression and GAD-7 for anxiety symptoms). Financial incentives increased engagement with the peer-support aspects of GUIDE but did not lead to significant improvements over the waitlist control group in between-group comparisons. More lessons, posts, and replies were correlated with improvements in wellness and emotional well-being. Increased use of any feature was correlated with increased personal growth. Exploratory subgroup analyses suggested that the use of GUIDE as a mental health and wellness intervention may be effective for active first responders, those with military affiliation, and male individuals. Future research should assess the effect of longer-term GUIDE use and whether improvements are sustained in the long term.
Acknowledgments
We would like to acknowledge the incredible staff managing the GUIDE app, who supported the implementation of this trial, including but not limited to Shawn Carey, Andy Roke, and Patrick Sandone. The authors disclose receipt of financial support for the research, authorship, and publication of this article from Nlyten Corp (developer of the GUIDE app).
Data Availability
Deidentified data supporting the findings of this study are available as a supplementary file accompanying this manuscript (). Additional details can be provided upon request.
Authors' Contributions
Conceptualization: HJN
Data curation: MKD
Formal analysis: MKD, HJN
Funding acquisition: HJN
Investigation: MKD
Methodology: HJN
Project administration: MKD
Supervision: HJN
Validation: HJN
Visualization: MKD
Writing – original draft: MKD
Writing – review & editing: HJN, MKD
All authors reviewed and edited the manuscript and approved the final version of the manuscript.
Conflicts of Interest
This study was funded by Nlyten Corp, the developer of the GUIDE app. While Nlyten Corp provided financial support for the research, it had no role in the study design, data collection, analysis, interpretation of data, or manuscript preparation. The authors declare no other conflicts of interest related to this study.
Additional study information.
DOCX File , 499 KBGUIDE course inventory.
PDF File (Adobe PDF File), 168 KBCONSORT-EHEALTH (V 1.6.1) checklist.
PDF File (Adobe PDF File), 2874 KBDeidentified data supporting the study findings.
XLSX File (Microsoft Excel File), 227 KBReferences
- Fullerton CS, Ursano RJ, Wang L. Acute stress disorder, posttraumatic stress disorder, and depression in disaster or rescue workers. Am J Psychiatry. Aug 01, 2004;161(8):1370-1376. [CrossRef] [Medline]
- Moradi Y, Dowran B, Sepandi M. The global prevalence of depression, suicide ideation, and attempts in the military forces: a systematic review and Meta-analysis of cross sectional studies. BMC Psychiatry. Oct 15, 2021;21(1):510. [FREE Full text] [CrossRef] [Medline]
- Cardozo B, Holtz T, Kaiser R, Gotway C, Ghitis F, Toomey E, et al. The mental health of expatriate and Kosovar Albanian humanitarian aid workers. Disasters. Jun 2005;29(2):152-170. [CrossRef] [Medline]
- Thériault FL, Gardner W, Momoli F, Garber B, Kingsbury M, Clayborne Z, et al. Mental health service use in depressed military personnel: a systematic review. Mil Med. Aug 14, 2020;185(7-8):e1255-e1262. [CrossRef] [Medline]
- Courtney JA, Francis AJP, Paxton SJ. Caring for the country: fatigue, sleep and mental health in Australian rural paramedic shiftworkers. J Community Health. Feb 29, 2013;38(1):178-186. [CrossRef] [Medline]
- Macdonald-Gagnon G, Stefanovics EA, Potenza MN, Pietrzak RH. Generalized anxiety and mild anxiety symptoms in U.S. military veterans: Prevalence, characteristics, and functioning. J Psychiatr Res. Mar 2024;171:263-270. [CrossRef] [Medline]
- Black DW, Carney CP, Peloso PM, Woolson RF, Schwartz DA, Voelker MD, et al. Gulf War veterans with anxiety: prevalence, comorbidity, and risk factors. Epidemiology. Mar 2004;15(2):135-142. [CrossRef] [Medline]
- Bennett P, Williams Y, Page N, Hood K, Woollard M. Levels of mental health problems among UK emergency ambulance workers. Emerg Med J. Mar 2004;21(2):235-236. [FREE Full text] [CrossRef] [Medline]
- Aldrich RS, Cerel J. Occupational suicide exposure and impact on mental health: examining differences across helping professions. Omega (Westport). May 23, 2022;85(1):23-37. [CrossRef] [Medline]
- Walker A, McKune A, Ferguson S, Pyne DB, Rattray B. Chronic occupational exposures can influence the rate of PTSD and depressive disorders in first responders and military personnel. Extrem Physiol Med. Jul 15, 2016;5(1):8. [FREE Full text] [CrossRef] [Medline]
- Spitzer A. First Responders and PTSD: A Literature Review. JEMS. 2020. URL: https://www.jems.com/mental-health-wellness/first-responders-and-ptsd-a-literature-review/ [accessed 2024-12-01]
- Whitworth J, Galusha J, Carbajal J, Ponder WN, Schuman DL. Affective depression mediates PTSD to suicide in a sample of treatment-seeking first responders. J Occup Environ Med. Oct 10, 2022;65(3):249-254. [CrossRef]
- Smith LJ, Zegel M, Bartlett BA, Lebeaut A, Vujanovic AA. Posttraumatic Stress and Alcohol Use Among First Responders. In: Bowers CA, Beidel DC, Marks MR, editors. Mental Health Intervention and Treatment of First Responders and Emergency Workers. Hershey, PA. IGI Global; 2019:32-48.
- Arble E, Daugherty AM, Arnetz BB. Models of first responder coping: Police officers as a unique population. Stress Health. Dec 2018;34(5):612-621. [FREE Full text] [CrossRef] [Medline]
- Beauchamp AM, Weerakoon SM, Ponder WN, Jetelina KK. Possible substance use disorders among first responders during the COVID-19 era: a quasi-experimental study of personal and residential vulnerability. Am J Drug Alcohol Abuse. Nov 02, 2022;48(6):724-733. [CrossRef] [Medline]
- Bonumwezi JL, Tramutola D, Lawrence J, Kobezak HM, Lowe SR. Posttraumatic stress disorder symptoms, work-related trauma exposure, and substance use in first responders. Drug Alcohol Depend. Aug 01, 2022;237:109439. [CrossRef] [Medline]
- Wagner TH, Harris KM, Federman B, Dai L, Luna Y, Humphreys K. Prevalence of substance use disorders among veterans and comparable nonveterans from the National Survey on Drug Use and Health. Psychological Services. Aug 2007;4(3):149-157. [CrossRef]
- Sleep Problems, Depression, Substance Use, Social Bonding, and Quality of Life in Professional Firefighters. University of Pennsylvania. URL: https://oce-ovid-com.proxy.library.upenn.edu/article/00043764-201108000-00015/HTML [accessed 2024-11-20]
- Teeters J, Lancaster C, Brown D, Back S. Substance use disorders in military veterans: prevalence and treatment challenges. SAR. Aug 2017;Volume 8:69-77. [CrossRef]
- Lan C, Fiellin DA, Barry DT, Bryant KJ, Gordon AJ, Edelman EJ, et al. The epidemiology of substance use disorders in US Veterans: A systematic review and analysis of assessment methods. Am J Addict. Jan 2016;25(1):7-24. [FREE Full text] [CrossRef] [Medline]
- Zolnikov TR, Furio F. Stigma on first responders during COVID-19. Stigma and Health. Nov 2020;5(4):375-379. [CrossRef]
- Fisher MP, Lavender CD. Ensuring optimal mental health programs and policies for first responders: opportunities and challenges in one U.S. state. Community Ment Health J. Oct 2023;59(7):1341-1351. [FREE Full text] [CrossRef] [Medline]
- Stanley IH, Hom MA, Joiner TE. A systematic review of suicidal thoughts and behaviors among police officers, firefighters, EMTs, and paramedics. Clin Psychol Rev. Mar 2016;44:25-44. [CrossRef] [Medline]
- Morral AR, Schell TL, Smart R. Comparison of suicide rates among US veteran and nonveteran populations. JAMA Netw Open. Jul 03, 2023;6(7):e2324191. [FREE Full text] [CrossRef] [Medline]
- Lawrence DS, Dockstader J. Law Enforcement Deaths by Suicide. CNA. URL: https://www.cna.org/reports/2024/03/Law-Enforcement-Deaths-By-Suicide.pdf [accessed 2025-08-04]
- The Ruderman White Paper Update on Mental Health and Suicide of First Responders. Ruderman Family Foundation. 2022. URL: https://rudermanfoundation.org/white_papers/the-ruderman-white-paper-update-on-mental-health-and-suicide-of-first-responders/ [accessed 2024-11-17]
- Kleim B, Westphal M. Mental health in first responders: A review and recommendation for prevention and intervention strategies. Traumatology. Dec 2011;17(4):17-24. [CrossRef]
- Pflanz SE, Ogle AD. Job stress, depression, work performance, and perceptions of supervisors in military personnel. Mil Med. Sep 2006;171(9):861-865. [CrossRef] [Medline]
- University of Phoenix Survey Finds 93 Percent of First Responders Say Mental Health is as Important as Physical Health. University of Phoenix. 2018. URL: https://www.phoenix.edu/press-release/university-of-phoenix-survey-finds-93-percent-of-first-responders-say-mental-health-is-as-important-as-physical-health.html [accessed 2024-12-01]
- Ben-Zeev D, Corrigan PW, Britt TW, Langford L. Stigma of mental illness and service use in the military. J Ment Health. Jun 17, 2012;21(3):264-273. [CrossRef] [Medline]
- Crowe A, Glass JS, Lancaster MF, Raines JM, Waggy MR. A content analysis of psychological resilience among first responders and the general population. Sage Open. Mar 20, 2017;7(1):2158244017698530. [CrossRef]
- Sharp M, Fear NT, Rona RJ, Wessely S, Greenberg N, Jones N, et al. Stigma as a barrier to seeking health care among military personnel with mental health problems. Epidemiol Rev. 2015;37:144-162. [CrossRef] [Medline]
- Bowers CA, Beidel DC, Marks MR. Obstacles to mental health treatment: Similarities and differences among first responder groups. Journal of Community Safety and Well-Being. Jun 16, 2022;7(2):42-46. [CrossRef]
- Jones S, Agud K, McSweeney J. Barriers and facilitators to seeking mental health care among first responders: "removing the darkness". J Am Psychiatr Nurses Assoc. Sep 11, 2020;26(1):43-54. [CrossRef] [Medline]
- Rafferty LA, Wessely S, Stevelink SAM, Greenberg N. The journey to professional mental health support: a qualitative exploration of the barriers and facilitators impacting military veterans' engagement with mental health treatment. Eur J Psychotraumatol. Jan 06, 2020;10(1):1700613. [FREE Full text] [CrossRef] [Medline]
- Horan KA, Marks M, Ruiz J, Bowers C, Cunningham A. Here for my peer: the future of first responder mental health. Int J Environ Res Public Health. Oct 22, 2021;18(21):11097. [FREE Full text] [CrossRef] [Medline]
- Mercier J, Hosseiny F, Rodrigues S, Friio A, Brémault-Phillips S, Shields D, et al. Peer support activities for veterans, serving members, and their families: results of a scoping review. Int J Environ Res Public Health. Feb 18, 2023;20(4):3628. [FREE Full text] [CrossRef] [Medline]
- Royle L, Keenan P, Farrell D. Issues of stigma for first responders accessing support for post traumatic stress. Int J Emerg Ment Health. 2009;11(2):79-85. [Medline]
- Hilty DM, Chan S, Hwang T, Wong A, Bauer AM. Advances in mobile mental health: opportunities and implications for the spectrum of e-mental health services. Focus (Am Psychiatr Publ). Jul 2018;16(3):314-327. [FREE Full text] [CrossRef] [Medline]
- Moghimi E, Knyahnytska Y, Omrani M, Nikjoo N, Stephenson C, Layzell G, et al. Benefits of digital mental health care interventions for correctional workers and other public safety personnel: a narrative review. Front Psychiatry. Jul 8, 2022;13:921527. [FREE Full text] [CrossRef] [Medline]
- Firth J, Torous J, Nicholas J, Carney R, Rosenbaum S, Sarris J. Can smartphone mental health interventions reduce symptoms of anxiety? A meta-analysis of randomized controlled trials. J Affect Disord. Aug 15, 2017;218:15-22. [FREE Full text] [CrossRef] [Medline]
- Moshe I, Terhorst Y, Philippi P, Domhardt M, Cuijpers P, Cristea I, et al. Digital interventions for the treatment of depression: A meta-analytic review. Psychol Bull. Aug 2021;147(8):749-786. [CrossRef] [Medline]
- Firth J, Torous J, Nicholas J, Carney R, Pratap A, Rosenbaum S, et al. The efficacy of smartphone-based mental health interventions for depressive symptoms: a meta-analysis of randomized controlled trials. World Psychiatry. Oct 2017;16(3):287-298. [FREE Full text] [CrossRef] [Medline]
- Linardon J, Cuijpers P, Carlbring P, Messer M, Fuller-Tyszkiewicz M. The efficacy of app-supported smartphone interventions for mental health problems: a meta-analysis of randomized controlled trials. World Psychiatry. Oct 09, 2019;18(3):325-336. [FREE Full text] [CrossRef] [Medline]
- Spijkerman MPJ, Pots WTM, Bohlmeijer ET. Effectiveness of online mindfulness-based interventions in improving mental health: A review and meta-analysis of randomised controlled trials. Clin Psychol Rev. Apr 2016;45:102-114. [FREE Full text] [CrossRef] [Medline]
- Seppälä EM, Bradley C, Moeller J, Harouni L, Nandamudi D, Brackett MA. Promoting mental health and psychological thriving in university students: a randomized controlled trial of three well-being interventions. Front Psychiatry. Jul 15, 2020;11:590. [FREE Full text] [CrossRef] [Medline]
- Groot J, MacLellan A, Butler M, Todor E, Zulfiqar M, Thackrah T, et al. The effectiveness of fully automated digital interventions in promoting mental well-being in the general population: systematic review and meta-analysis. JMIR Ment Health. Oct 19, 2023;10:e44658. [FREE Full text] [CrossRef] [Medline]
- The GUIDE App. URL: https://theguideapp.com/ [accessed 2025-03-09]
- Stewart V, Merron K. Pilot Study White Paper. The GUIDE App. URL: https://theguideapp.com/pilotstudy/ [accessed 2025-08-04]
- Seligman MEP. Flourish: A visionary new understanding of happiness and well-being. New York, NY. Free Press; 2011.
- Questionnaires. Peggy Kern. URL: https://www.peggykern.org/questionnaires.html [accessed 2024-12-01]
- Butler J, Kern ML. The PERMA-Profiler: A brief multidimensional measure of flourishing. Intnl J Wellbeing. Oct 13, 2016;6(3):1-48. [CrossRef]
- Benson T, Sladen J, Liles A, Potts HWW. Personal Wellbeing Score (PWS)-a short version of ONS4: development and validation in social prescribing. BMJ Open Qual. Apr 11, 2019;8(2):e000394. [FREE Full text] [CrossRef] [Medline]
- Topp CW, Østergaard SD, Søndergaard S, Bech P. The WHO-5 Well-Being Index: a systematic review of the literature. Psychother Psychosom. 2015;84(3):167-176. [FREE Full text] [CrossRef] [Medline]
- Sischka PE, Costa AP, Steffgen G, Schmidt AF. The WHO-5 well-being index – validation based on item response theory and the analysis of measurement invariance across 35 countries. Journal of Affective Disorders Reports. Dec 2020;1:100020. [CrossRef]
- Kaufman EA, Xia M, Fosco G, Yaptangco M, Skidmore CR, Crowell SE. The Difficulties in Emotion Regulation Scale Short Form (DERS-SF): validation and replication in adolescent and adult samples. J Psychopathol Behav Assess. Nov 23, 2015;38(3):443-455. [CrossRef]
- Burton AL, Brown R, Abbott MJ. Overcoming difficulties in measuring emotional regulation: Assessing and comparing the psychometric properties of the DERS long and short forms. Cogent Psychology. Apr 07, 2022;9(1):2060629. [CrossRef]
- Kroenke K, Strine TW, Spitzer RL, Williams JBW, Berry JT, Mokdad AH. The PHQ-8 as a measure of current depression in the general population. J Affect Disord. Apr 2009;114(1-3):163-173. [CrossRef] [Medline]
- Pressler SJ, Subramanian U, Perkins SM, Gradus-Pizlo I, Kareken D, Kim J, et al. Measuring depressive symptoms in heart failure: validity and reliability of the patient health questionnaire-8. Am J Crit Care. Mar 2011;20(2):146-152. [CrossRef] [Medline]
- Spitzer RL, Kroenke K, Williams JBW, Löwe B. A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Intern Med. May 22, 2006;166(10):1092-1097. [CrossRef] [Medline]
- Löwe B, Decker O, Müller S, Brähler E, Schellberg D, Herzog W, et al. Validation and standardization of the Generalized Anxiety Disorder Screener (GAD-7) in the general population. Med Care. Mar 2008;46(3):266-274. [CrossRef] [Medline]
- Weiner BJ, Lewis CC, Stanick C, Powell BJ, Dorsey CN, Clary AS, et al. Psychometric assessment of three newly developed implementation outcome measures. Implement Sci. Aug 29, 2017;12(1):108. [FREE Full text] [CrossRef] [Medline]
- Brooke J. SUS - A quick and dirty usability scale. Agency for Healthcare Research and Quality. URL: https://digital.ahrq.gov/sites/default/files/docs/survey/systemusabilityscale%2528sus%2529_comp%255B1%255D.pdf [accessed 2025-08-04]
- Bangor A, Kortum PT, Miller JT. An empirical evaluation of the system usability scale. International Journal of Human-Computer Interaction. Jul 30, 2008;24(6):574-594. [CrossRef]
- Lewis JR, Sauro J. Item benchmarks for the System Usability Scale. J User Exp. 2018;13(3):158-167. [FREE Full text]
- Eysenbach G, CONSORT-EHEALTH Group. CONSORT-EHEALTH: improving and standardizing evaluation reports of Web-based and mobile health interventions. J Med Internet Res. Dec 31, 2011;13(4):e126. [FREE Full text] [CrossRef] [Medline]
- Armstrong RA. When to use the Bonferroni correction. Ophthalmic Physiol Opt. Sep 2014;34(5):502-508. [CrossRef] [Medline]
- Kim H. Statistical notes for clinical researchers: assessing normal distribution (2) using skewness and kurtosis. Restor Dent Endod. Feb 2013;38(1):52-54. [FREE Full text] [CrossRef] [Medline]
- Nich C, Carroll KM. Intention-to-treat meets missing data: implications of alternate strategies for analyzing clinical trials data. Drug Alcohol Depend. Oct 01, 2002;68(2):121-130. [FREE Full text] [CrossRef] [Medline]
- Armijo-Olivo S, Warren S, Magee D. Intention to treat analysis, compliance, drop-outs and how to deal with missing data in clinical research: a review. Physical Therapy Reviews. Jul 19, 2013;14(1):36-49. [CrossRef]
- Richardson JT. Eta squared and partial eta squared as measures of effect size in educational research. Educational Research Review. Jan 2011;6(2):135-147. [CrossRef]
- Cohen J. Statistical Power Analysis for the Behavioral Sciences. New York, NY. Routledge; 2013.
- Fritz CO, Morris PE, Richler JJ. Effect size estimates: current use, calculations, and interpretation. J Exp Psychol Gen. Feb 2012;141(1):2-18. [CrossRef] [Medline]
- Boerma T, Hosseinpoor AR, Verdes E, Chatterji S. A global assessment of the gender gap in self-reported health with survey data from 59 countries. BMC Public Health. Jul 30, 2016;16:675. [FREE Full text] [CrossRef] [Medline]
- Lipschitz JM, Pike CK, Hogan TP, Murphy SA, Burdick KE. The engagement problem: A review of engagement with digital mental health interventions and recommendations for a path forward. Curr Treat Options Psychiatry. Sep 25, 2023;10(3):119-135. [FREE Full text] [CrossRef] [Medline]
- Schober P, Boer C, Schwarte LA. Correlation coefficients: appropriate use and interpretation. Anesth Analg. May 2018;126(5):1763-1768. [CrossRef] [Medline]
- Terlizzi EP, Zablotsky B. Symptoms of Anxiety and Depression Among Adults: United States, 2019 and 2022. CDC. 2024. URL: https://stacks.cdc.gov/view/cdc/164018 [accessed 2025-08-04]
- Jones S. Describing the mental health profile of first responders: a systematic review. J Am Psychiatr Nurses Assoc. May 2017;23(3):200-214. [CrossRef] [Medline]
- Kligyte V, Connelly S, Thiel C, Devenport L. The influence of anger, fear, and emotion regulation on ethical decision making. Human Performance. Sep 10, 2013;26(4):297-326. [CrossRef]
- Troy AS, Mauss IB. Resilience in the face of stress: emotion regulation as a protective factor. In: Southwick SM, Litz BT, Charney D, Friedman MJ, editors. Resilience and Mental Health. Cambridge, United Kingdom. Cambridge University Press; 2011:30-44.
- Chang C, Palermo E, Deswert S, Brown A, Nuske HJ. Money can't buy happiness: A randomized controlled trial of a digital mental health app with versus without financial incentives. Digit Health. 2023;9:20552076231170693. [FREE Full text] [CrossRef] [Medline]
- Wild J, El-Salahi S, Esposti MD. The effectiveness of interventions aimed at improving well-being and resilience to stress in first responders. European Psychologist. Oct 2020;25(4):252-271. [CrossRef]
- Witarto BS, Visuddho V, Witarto AP, Bestari D, Sawitri B, Melapi TAS, et al. Effectiveness of online mindfulness-based interventions in improving mental health during the COVID-19 pandemic: A systematic review and meta-analysis of randomized controlled trials. PLoS One. Sep 21, 2022;17(9):e0274177. [FREE Full text] [CrossRef] [Medline]
- Gould CE, Kok BC, Ma VK, Zapata AML, Owen JE, Kuhn E. Veterans Affairs and the Department of Defense mental health apps: A systematic literature review. Psychol Serv. May 2019;16(2):196-207. [CrossRef] [Medline]
- Men accounted for about 75 percent of workers in protective service occupations in 2020. U.S. Bureau of Labor Statistics. 2021. URL: https://www.bls.gov/opub/ted/2021/men-accounted-for-about-75-percent-of-workers-in-protective-service-occupations-in-2020.htm [accessed 2024-12-02]
- 2022 Demographics Profile Active-Duty Members. Military OneSource. URL: https://download.militaryonesource.mil/12038/MOS/Infographic/2022-demographics-active-duty-members.pdf [accessed 2025-08-04]
- Schaeffer K. The changing face of America’s veteran population. Pew Research Center. 2023. URL: https://www.pewresearch.org/short-reads/2023/11/08/the-changing-face-of-americas-veteran-population/ [accessed 2024-12-02]
- Carson LM, Marsh SM, Brown MM, Elkins KL, Tiesman HM. An analysis of suicides among first responders ─ Findings from the National Violent Death Reporting System, 2015-2017. J Safety Res. Jun 2023;85:361-370. [CrossRef] [Medline]
- Addis ME, Mahalik JR. Men, masculinity, and the contexts of help seeking. Am Psychol. Jan 2003;58(1):5-14. [CrossRef] [Medline]
- Czap NV, Czap HJ, Khachaturyan M, Burbach ME, Czap NV. Comparing female and male response to financial incentives and empathy nudging in an environmental context. Rev Behav Econ. Mar 26, 2018;5(1):61-84. [CrossRef]
- Linardon J. Rates of attrition and engagement in randomized controlled trials of mindfulness apps: Systematic review and meta-analysis. Behav Res Ther. Nov 2023;170:104421. [FREE Full text] [CrossRef] [Medline]
- Stone D, Bryant S, Wier B. Why are financial incentive effects unreliable? an extension of self-determination theory. Behav Res Account. 2010;22(2):105-132. [CrossRef]
- Gulliver A, Calear AL, Sunderland M, Kay-Lambkin F, Farrer LM, Batterham PJ. Predictors of acceptability and engagement in a self-guided online program for depression and anxiety. Internet Interv. Sep 2021;25:100400. [FREE Full text] [CrossRef] [Medline]
- Alqahtani F, Orji R. Insights from user reviews to improve mental health apps. Health Informatics J. Sep 10, 2020;26(3):2042-2066. [FREE Full text] [CrossRef] [Medline]
Abbreviations
| AIM: Acceptability of Intervention Measure |
| CONSORT: Consolidated Standards of Reporting Trials |
| DERS-SF: Difficulties in Emotion Regulation Scale–Short Form |
| FIM: Feasibility of Intervention Measure |
| GAD-7: Generalized Anxiety Disorder-7 |
| HIPAA: Health Insurance Portability and Accountability Act |
| IAM: Intervention Appropriateness Measure |
| IRB: Institutional Review Board |
| MD: mean difference |
| mHealth: mobile health |
| PERMA: Positive Emotion, Engagement, Relationships, Meaning, Accomplishment |
| PHQ-8: Patient Health Questionnaire-8 |
| PWS: Personal Well-being Score |
| SUS: System Usability Scale |
| WHO-5: World Health Organization 5-item Well-being Index |
Edited by T de Azevedo Cardoso; submitted 10.Jan.2025; peer-reviewed by K Horan, F Quazi; comments to author 24.Feb.2025; revised version received 17.Mar.2025; accepted 01.May.2025; published 17.Oct.2025.
Copyright©Morgan K Dunphy, Heather J Nuske. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 17.Oct.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

