Review
Abstract
Background: Patients with cancer experience complex physical, psychosocial, and behavioral challenges that require continuous support. This need has intensified with the rising cancer burden worldwide and the limited scalability of traditional care models. In response, conversational agents (CAs) have emerged as promising digital interventions for enhancing cancer care, but evidence regarding their feasibility and effectiveness remains limited.
Objective: This study aimed to evaluate the feasibility and effectiveness of CAs in supporting care for patients with cancer and to summarize the key characteristics of CA interventions to inform future design and implementation.
Methods: We systematically searched PubMed, Cochrane Library, Web of Science, and Embase databases from the index date through February 3, 2025, and screened reference lists and trial registries for gray literature. Eligible studies included randomized controlled trials (RCTs) and nonrandomized interventions (NRIs) evaluating CA-delivered interventions targeting health outcomes in patients with cancer. Two reviewers independently selected studies and extracted data. Study quality was then appraised using the Cochrane Risk of Bias 2.0 tool for RCTs and the Joanna Briggs Institute (JBI) Critical Appraisal Checklist for NRIs. Extracted data included study characteristics, CA features, and implementation outcomes, including feasibility, acceptability, and usability. Meta-analyses were conducted on physical activity, pain, anxiety, depression, psychological distress, and quality of life. Narrative synthesis was used for outcomes with inconsistent reporting across studies, including health information acquisition and treatment-related side effects.
Results: In total, 17 studies involving 1817 patients with cancer were included, with 10 (58.8%) studies being included in the meta-analysis. The meta-analysis showed significant improvements in physical activity (mean difference [MD]=1.44, 95% CI 0.36-2.52, P<.01), pain (MD=–0.91, 95% CI –1.44 to –0.38, P<.01), anxiety (SMD=–0.19, 95% CI –0.35 to –0.02, P=.02), and quality of life (SMD=0.35, 95% CI 0.03-0.67, P=.03). No significant effects were observed on depression (SMD=–0.07, 95% CI –0.42 to 0.27, P=.68) or psychological distress (SMD=–0.33, 95% CI –0.66 to 0.01, P=.06). Narrative synthesis suggested that CAs have the potential to improve patients’ acquisition of health information and help manage treatment-related side effects. Notably, CAs were generally found to be safe, feasible, acceptable, and usable among patients with cancer, particularly during the initial phase of use. However, user engagement tended to decline over time, underscoring the need for strategies to sustain long-term use.
Conclusions: This systematic review is the first comprehensive analysis to suggest that CAs are feasible, acceptable, usable, and effective interventions for patients with cancer. Nevertheless, the limited psychological benefits and suboptimal long-term user engagement indicate the need for further refinement. Future research should adopt theory-based designs and leverage emerging technologies to enhance personalization, empathy, and sustained engagement in CA interventions. Robust evidence from large-scale RCTs is needed to strengthen the evidence base.
Trial Registration: PROSPERO CRD42025645982; https://www.crd.york.ac.uk/PROSPERO/view/CRD42025645982
doi:10.2196/76968
Keywords
Introduction
Background
Cancer remains a leading cause of morbidity and mortality worldwide, placing a substantial burden on health care systems. In 2022, an estimated 20 million new cancer cases and 9.7 million cancer-related deaths occurred worldwide, with approximately 53.5 million individuals living with cancer []. A cancer diagnosis not only presents significant challenges to health care systems but also profoundly affects patients’ daily lives. Both cancer itself and its treatment can result in a wide range of physical symptoms and psychological distress []. These multifaceted burdens frequently disrupt patients’ overall well-being and significantly impair their quality of life [].
In this context, there has been increasing emphasis on developing effective cancer care that provides continuous, comprehensive support alongside standard oncological treatment, aiming to address patients’ physical and psychological needs and to promote behavioral change across the cancer care continuum []. However, implementing such care models through traditional face-to-face delivery remains difficult, as these approaches are often constrained by time, staffing, and scalability—particularly in resource-limited settings []. As the demand for personalized and continuous support grows and health care systems shift toward cost-efficient care, these limitations have become increasingly evident, highlighting the necessity for automated, scalable, and cost-effective solutions to enhance care for patients with cancer [].
In response to these limitations, there has been growing interest in leveraging mobile health (mHealth) technologies to enhance care delivery. In particular, conversational agents (CAs) have emerged as a promising mHealth innovation in supporting care for patients with cancer []. CAs are dialogue systems capable of understanding and generating human language to enable effective human-computer interactions []. Unlike conventional mHealth tools—which often rely on static content, unidirectional reminders, or resource-intensive human support—CAs provide automated, real-time dialogue capabilities that support dynamic, personalized interactions []. By simulating human-like conversations, CAs can adapt to patients’ evolving informational, emotional, and practical needs—an essential feature in the complex and often unpredictable trajectory of cancer care. These capabilities may not only improve the continuity and personalization of care but also offer a scalable means of support, potentially reducing the burden on health care providers [].
Although CA interventions have shown promise and are increasingly used in cancer care, evidence regarding their feasibility and effectiveness remains inconclusive []. Although some studies have reported positive outcomes—such as improved symptom management, psychological support, behavior change, and quality of life—others have shown minimal or no effects [,]. These inconsistencies highlight the need for a comprehensive synthesis of existing evidence to clarify the effectiveness of CAs in supporting cancer care []. In parallel, although feasibility is frequently cited as a strength of CA interventions—due to their ability to deliver personalized support, reduce access barriers, and provide timely responses—several concerns have also emerged, including risks of miscommunication, limited empathetic responsiveness, and challenges in sustaining use [,]. Together, these inconsistencies in both clinical and implementation outcomes point to the urgent need for a systematic synthesis to guide the design and integration of CA interventions in cancer care.
To date, no systematic review has comprehensively evaluated the effectiveness or feasibility of CA-based interventions in cancer care. Existing reviews remain limited in scope; notably, only 1 previous narrative review provides a broad overview of CA applications in oncology but does not quantitatively synthesize outcomes or systematically synthesize implementation outcomes []. Consequently, it remains unclear whether CAs lead to improvements in health outcomes of patients with cancer; what implementation-related patterns—particularly regarding feasibility, usability, and acceptability—have emerged; and what core features define current CA interventions for cancer care. A rigorous synthesis of the existing evidence is warranted to inform future CA development, facilitate clinical implementation, and support evidence-based decision-making.
Objectives
This systematic review and meta-analysis aimed to comprehensively evaluate the use of CA interventions in cancer care. Specifically, the objectives were to (1) assess their effectiveness in improving patient outcomes, including quality of life, pain, anxiety, depression, psychological distress, physical activity, treatment-related side effects, and health information acquisition; (2) evaluate implementation outcomes—such as feasibility, acceptability, and usability—to determine the practical applicability of CAs in oncology settings; and (3) synthesize the core characteristics of CA interventions, including intervention components, and CA delivery and interaction logic, as well as input and output modalities, to inform future development and implementation.
Methods
Study Protocol and Registration
This systematic review and meta-analysis was based on a prespecified protocol registered with PROSPERO (International Prospective Register of Systematic Reviews; CRD42025645982) and conducted and reported according to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) statement []. The PRISMA checklist can be found in in Table S1 in .
Search Strategy
A comprehensive systematic literature search was conducted on 4 different electronic databases—PubMed, Embase, Web of Science, and Cochrane Library—from the index date through February 3, 2025. After study screening and selection, references for the identified studies and relevant websites, such as clinical trial registries, were manually searched to find potential related gray papers. The keywords of the search mainly revolved around “cancer” and “conversational agent.”. The full search strategies and the number of search results from each database are detailed in Table S2 in .
Inclusion and Exclusion Criteria
We considered eligible randomized controlled trials (RCTs) and nonrandomized interventions (NRIs) that examined the effects of CAs, defined as dialogue systems capable of understanding and generating human language to enable effective human-computer interactions. Eligible studies aimed to improve health-related outcomes in adult patients with cancer using CAs that were either specifically developed for oncology care or originally designed for general health purposes and subsequently applied to patients with cancer. The inclusion and exclusion criteria are reported in .
Inclusion criteria:
- Population: participants aged ≥18 years, of both sexes, diagnosed with cancer
- Intervention: online interventions delivered through conversational agents (CAs), which were defined as dialogue systems capable of understanding and generating human language to enable effective human-computer interactions
- Comparison: any comparator was acceptable (no intervention, usual care, active comparators, or a within-subject pre-post design)
- Outcomes: studies reporting health-related outcomes of patients with cancer
- Study design: studies reporting original data from randomized controlled trials (RCTs) and nonrandomized interventions (NRIs)
Exclusion criteria:
- Duplicate papers
- Literature for which full text is not available, such as conference abstracts
- Unrelated to the topic
Screening and Study Selection
All studies identified through database searches, reference lists, and relevant websites were imported into EndNote X9 (Clarivate Analytics), a reference management software program. EndNote was also used to detect and remove any duplicate entries. The selection of studies was conducted in 2 stages. Initially, 2 independent reviewers (authors JXH and YXH) screened the titles and abstracts of all identified studies in EndNote to exclude studies that did not meet the basic inclusion criteria. Subsequently, the full texts of the remaining studies were retrieved and assessed in detail against the inclusion and exclusion criteria by the same 2 reviewers. Any disagreements between reviewers were resolved through discussion or, if necessary, consultation with a third reviewer (author PJS).
Data Extraction
Data extraction using Microsoft Excel was first pilot-tested on 3 studies and revised with additional headings before being applied to all included studies. Two independent reviewers (JXH and YXH) conducted the data extraction process, with any disagreements resolved by a third reviewer (PJS). The following data were extracted: (1) basic study characteristics (author, title, country, and publication year), (2) sample characteristics (cancer type, sample size, age, and gender), (3) intervention components (consultable content, CA-only vs multicomponent interventions, frequency and duration, theoretical framework), (4) CA delivery and interaction logic (CA name, delivery platform, prompt sequence, whether the CA was cancer specific, clinician referral, algorithm type, and embodiment type), (5) input and output information (input data and format, input and output modality, and output source), and (6) outcome measures (health outcomes, feasibility, acceptability, and usability).
Quality Evaluation
Two independent authors (JXH and YXH) assessed the risk of bias for included studies using the revised Cochrane Risk of Bias Tool version 2.0 (ROB 2.0) for RCTs [] and the Joanna Briggs Institute (JBI) Critical Appraisal Checklist for NRIs []. When disagreements arose, a third reviewer (PJS) was consulted, and consensus was achieved through discussion.
Statistical Methods
The primary outcome was quality of life, selected for its direct relevance in assessing the overall effectiveness of CA interventions in cancer care. Secondary outcomes included effectiveness outcomes (physical activity, pain, anxiety, depression, psychological distress, treatment-related side effects, and health information acquisition) and implementation outcomes (feasibility, acceptability, and usability). Depending on data availability and consistency, either a meta-analysis or a narrative synthesis was conducted.
Effectiveness outcomes were synthesized using meta-analysis when the included studies demonstrated sufficient statistical homogeneity. Analyses were performed using Review Manager (RevMan) version 5.4 (Cochrane Collaboration). For each outcome, either change-from-baseline scores or postintervention scores were extracted, with a consistent approach was applied within each analysis. When necessary, missing summary statistics were estimated following the methods recommended in the Cochrane Handbook for Systematic Reviews of Interventions []. Effect sizes were reported as standardized mean differences (SMDs) with 95% CIs when different measurement scales were used, and as mean differences (MDs) when the same scale was applied. Heterogeneity was assessed using the chi-square test and the I² statistic. If P>.10 and I2 < 50%, it suggested that the heterogeneity among studies as small, and the fixed-effect model was selected. If P<.10 and I2> 50%, it suggested that there was obvious heterogeneity among the studies, and the random-effects model was adopted []. Sensitivity analysis was performed by comparing the results of the combined effect sizes of the fixed-effects model and random-effects model to ensure stability and reliability of the results []. Publication bias was examined using funnel plots.
For effectiveness outcomes not amenable to meta-analysis due to insufficient data or heterogeneity, a narrative synthesis was conducted. This synthesis was structured by outcome domain and described the direction and magnitude of reported effects. For implementation outcomes, a narrative synthesis was conducted and organized thematically according to key evaluation dimensions, including feasibility (retention rate, safety, interactions, duration of engagement, and engagement rate), acceptability (satisfaction, perceived helpfulness, and recommendation willingness), and usability (usability of content, ease of use, and user experience). In addition, intervention characteristics—including intervention components, CA delivery and interaction logic, and input and output information—were descriptively summarized.
Results
Results of Literature Search
The database search yielded 2087 results. A total of 1323 (63.4%) titles and abstracts were screened after removal of duplicates, and 1188 (56.9%) studies were excluded after screening. In total, 135 (6.5%) potential full-text papers were identified for further evaluation, of which 15 (11.1%) met our inclusion criteria. An additional 87 papers were identified from websites and relevant citations. After removing duplicates and assessing titles, abstracts, and full texts, 2 extra papers were identified and included. In total, 17 papers were included in this systematic review ().

Study and Sample Characteristics
The 17 included studies [,,,-] were published between 2016 and 2024 (please note that Bema [] contains the complete documentation of results from Mlakar et al’s [] study). There were a total of 1817 participants, and sample sizes ranged between 30 (1.7%) and 336 (18.5%). The mean age of the participants ranged between 25 (SD 2.9) and 64 (SD 11.8) years. Of the 17 studies, 12 (70.6%) were RCTs, and 5 (29.4%) were NRIs. Most studies were conducted in the United States (n=10, 58.8%) and primarily targeted patients with breast cancer (n=8, 47.1%). Overall, 69.7% (n=1266) of the participants were women. A summary of study characteristics is presented in .
| Study; country | Research type | Sample size (N) and participants | Age (years), mean (SD) | Female gender, n (%) | Consultable content | Outcomes |
| Baumgärtner et al []; Germany | 2-arm RCTa | 112 patients with prostate cancer | 64.0 (11.8) | 0 | General prostate cancer management | User health, feasibility, acceptability, and usability data |
| Tawfik et al []; Egypt | 3-arm RCT | 150 patients with breast cancer | 44.7 (7.0) | 150 (100.0) | Symptom management | User health and usability data |
| Greer et al []; United States | 2-arm RCT | 45 patients with cancer | 25.0 (2.9) | 36 (80.0) | Psychological management | User health, feasibility, acceptability, and usability data |
| Schmitz et al []; United States | 2-arm RCT | 36 patients with breast cancer | 53.4 (11.0) | 36 (100.0) | Symptom management | User health, feasibility, acceptability, and usability data |
| Sikorskii et al []; United States | 2-arm RCT | 272 patients with cancer | 61.0 (12.0) | 69 (51.0) | Symptom management and medication reminders | User health data |
| Hassoon et al []; United States | 3-arm RCT | 42 patients with cancer | 62.1 (9.8) | 38 (90.0) | Physical activity management | User health and feasibility data |
| Maguire et al []; United Kingdom | 2-arm RCT | 336 patients with cancer | 52.4 (12.2) | 275 (81.8) | Symptom management | User health and feasibility data |
| Lee et al []; Korea | 4-arm RCT | 145 patients with breast cancer | —b | 145 (100.0) | Management regarding radiotherapy | User health and feasibility data |
| Horesh et al []; United States | NRIc | 37 patients with breast and ovarian cancer | 46.8 (6.8) | 37 (100.0) | Psychological management | User health and acceptability data |
| Mlakar et al [,]; Slovenia | NRI | 160 patients with breast and colorectal cancer | 55.0 (—) | 129 (77.7) | General cancer management | User health data |
| Bibault et al []; France | 2-arm RCT | 142 patients with breast cancer | 42.0 (19.0) | 142 (100.0) | General breast cancer management | User health data |
| Al-Hilli et al []; United States | 2-arm RCT | 37 patients with breast cancer | 62.0 (13.0) | 37 (100.0) | Genetic counseling management | User health and acceptability data |
| Kamdar et al []; United States | 2-arm RCT | 112 patients with cancer | 52.7 (8.5) | 55 (49.2) | Pain management | User health, feasibility, and usability data |
| Queiroz et al []; Brazil | NRI | 30 patients with colorectal cancer | 50.3 (13.4) | 13 (43.3) | General colorectal cancer management | User health, feasibility, and usability data |
| Bickmore et al []; United States | 2-arm RCT | 89 patients with cancer | 59.2 (9.8) | 48 (46.0) | Clinical trial information | User health and acceptability data |
| Gomaa et al []; United States | NRI | 34 patients with gastrointestinal cancer | 61.0 (12.0) | 19 (56.0) | Symptom management | User health, feasibility, acceptability, and usability data |
| Caru et al []; United States | NRI | 38 patients with breast cancer | 52.4 (11.0) | 38 (100.0) | General breast cancer management | User health data |
aRCT: randomized controlled trial.
bNot available.
cNRI: nonrandomized intervention.
Characteristics of CA Interventions
The characteristics of the CA interventions are presented in Table S3 in .
Intervention Components
In terms of consultable content, most CAs delivered general management targeting specific malignancies (n=6, 35.3%), including breast cancer [,,,], colorectal cancer [], and prostate cancer []. Several other studies (n=5, 29.4%) provided symptom management [,,,,], occasionally incorporating medication reminders [] or addressing specific pain symptoms []. Two studies provided psychological management [,]. Individual studies focused on radiotherapy-related management [], physical activity [], genetic counseling [], or clinical trial information [].
Regarding intervention components, 8 (47.1%) studies used CA-only interventions [,,,,,,,], while 9 (52.9%) studies integrated additional interventions, such as mHealth apps [], medical team referrals [,,,,,], printed education materials [], mindfulness therapy [,], wearable sensors [,], and prescription refill request tools []. Intervention durations ranged from 4 weeks to 6 months, except for 3 (17.6%) studies that adopted a single-session design.
Most studies did not explicitly adopt a theoretical framework. Only 3 (17.6%) studies reported using theoretical frameworks: one used the Empowerment Education Model []; another applied stress and coping theory, together with the broaden-and-build theory of positive emotion []; and the third adopted cognitive behavioral therapy as its theoretical basis []; see Table S3 in .
CA Delivery and Interaction Logic
Across the 17 studies, 17 CAs were deployed, with 2 (11.8%) studies using the same CA (Nurse Amie) [,] and 1 (5.9%) study [] incorporating 2 CAs (MyCoach and CoachText) []. These CAs operated through various digital platforms, including custom-developed mHealth apps (n=5, 29.4%) [,,,,], Facebook Messenger (n=3, 17.6%) [,,], SMS (n=3, 17.6%) [,,], Amazon Echo Show and Alexa (n=2, 11.8%) [,], Kakao (n=1, 5.9%) [], phone calls (n=1, 5.9%) [], Microsoft Azure (n=1, 5.9%) [], and a website (n=1, 5.9%) []. Of the 17 included studies, 14 (82.4%) evaluated CAs specifically designed for patients with cancer [,,-,,,-], while the remaining 3 (17.6%) involved CAs originally developed for general health purposes, such as managing chronic conditions or delivering preventive health counseling, but were applied to patients with cancer in the study context [,,].
The sequencing of prompts to the CAs generally followed 3 distinct modes: user-initiated inputs, where users actively started the interaction (n=4, 23.5%) [,,,]; system-initiated prompts, where messages were proactively delivered by the CAs based on a predefined schedule or contextual triggers, which do not require prior user input but allow for subsequent user interaction (n=4, 23.5%) [,,,]; and hybrid approaches that combined both user- and system-initiated interactions (n=9, 52.9%) [,,,,-,,].
In terms of clinician referral, only 6 (35.3%) studies incorporated mechanisms to hand over user inquiries to health care professionals, when necessary [,,,,,], while 11 (64.7%) did not [,,,,,,,,,,].
In terms of embodiment, 5 (29.4%) CAs were virtual, using visual avatars or animated characters [,,,,], while the remaining 12 (70.6%) were disembodied [,,,-,-,], interacting without visual representation.
Additionally, CAs used 3 types of dialogue approaches: 5 (29.4%) used artificial intelligence (AI)-based methods (leveraging AI techniques such as machine learning and natural language processing to produce contextually relevant responses) [,,,,], 5 (29.4%) used a rule-based approach (generating deterministic responses through predefined rules and structured pathways) [,,,,], and 7 (41.2%) adopted a hybrid dialogue approach (integrating both AI- and rule-based logic) [,,,-]; see Table S3 in .
Input and Output Information
Input data included symptoms, adverse effects, adherence behaviors (medication and treatment), lifestyle behaviors (diet, nutrition, physical activity), emotions, treatment procedures, therapy feedback and planning, genetic counseling, clinical trial information, diagnostic information, and appointment scheduling. Users interacted with CAs through 3 distinct input methods: free input, where users entered open-ended queries (n=3, 17.6%) [,,]; button input, involving the selection of predefined topics or numerical options (n=6, 35.3%) [-,,,]; and mixed input, which combined both free and button-based interactions (n=8, 47.1%) [,,,,,-].
Output information was predominantly text based (n=9, 52.9%) [,,,,-,], followed by multimodal formats (eg, audio and video; n=5, 29.4%) [,,,,] and voice-based outputs (n=3, 17.6%) [,,]. The content delivered by CAs was generally derived from authoritative sources. These included clinical guidelines from organizations such as the American Cancer Society (ACS), National Cancer Institute (NCI), and the National Comprehensive Cancer Network (NCCN); peer-reviewed literature; expert consensus; evidence-based question-and-answer (QnA) libraries developed by study teams; and authoritative health websites [,,,,,-]. Additionally, 6 (35.3%) studies used cloud-based platforms, such as Amazon Alexa, Microsoft QnA Maker, SAP Conversational AI, and Kakao chatbot frameworks, to retrieve responses from predefined, evidence-informed knowledge bases [,,,,,]; see Table S3 in .
Feasibility, Acceptability, and Usability of CAs
The studies included in the qualitative synthesis of feasibility, acceptability, and usability are listed in Table S4 in .
Feasibility
Feasibility outcomes of CA interventions were reported in 9 (52.9%) of the 17 included studies, focusing primarily on (1) safety, (2) retention rate, (3) engagement rate, (4) interactions, and (5) duration of engagement [,,,-,,,]. Safety was assessed in 3 (33.3%) studies, all of which reported no adverse events, indicating that CAs are generally safe for participants [,,]. Retention rates—defined as the proportion of participants who completed postintervention assessments—were reported in 7 (63%) studies and ranged from 64% to 85.7%, with most exceeding 70% [,,,,,,]. Engagement rates—defined as the proportion of participants who interacted with the CA at least once during the intervention—were reported in 4 (44.4%) studies and ranged from 59% to 86% [,,,]. The interaction frequency varied considerably, with participants engaging with the CA between 3 and 15 times on average and usage frequencies ranging from 2.1 times per week to twice daily [,,,,,]. Two studies reported usage duration metrics: one study documented a mean total usage time of 73.8 (SD 52) minutes over 4 weeks [], while another reported an average of 3.4 usage days per participant within the same period []. Notably, 3 (33.3%) studies observed a declining trend in engagement over time, including reductions in the interaction frequency and the proportion of active users [,,]. Overall, CA interventions are feasible and generally safe in patients with cancer, with promising levels of initial engagement. However, maintaining sustained user involvement remains a significant challenge that warrants attention in future design and implementation efforts.
Acceptability
Of the 17 studies, 7 (41.2%) assessed acceptability, focusing on (1) satisfaction, (2) perceived helpfulness, and (3) recommendation willingness. Satisfaction with CA interventions was consistently high, with rates ranging from 70% to 97% in 4 (57.1%) studies [,,,], and 1 (14.3%) study [] reporting comparable satisfaction to in-person consultations and higher ratings than traditional digital interfaces. Perceived helpfulness, evaluated in 5 (71.4%) studies, was generally favorable, with 70%-90.7% of participants finding the interventions beneficial in 4 (80%) studies [,,,]; in 1 (20%) study, participants rated the interventions 5.76 out of 10 in terms of perceived help with symptom relief, indicating a moderate level of perceived helpfulness in addressing symptoms []. Recommendation willingness was evaluated in 4 (57.1%) studies, with scores ranging from 6.9 to 7.24 out of 10, and 95.3%-97% of participants agreed to recommend CAs for broader clinical implementation [,,,]. Collectively, these findings suggest that CA interventions are generally well accepted by patients with cancer.
Usability
Usability of CAs was evaluated in 7 (41.2%) of the 17 studies, focusing on 3 key aspects: (1) usability of content, (2) ease of use, and (3) user experience. Usability of content reported in 5 (71.4%) studies received consistently high scores ranging from 79.6 to 86.14—well above the 68-point threshold for acceptable usability [,,,]. Additionally, in 1 (14.3%) study, 94% of participants described CA responses as useful and informative []. Ease of use, assessed in 3 (42.9%) studies, was also highly rated, with 94%-100% of participants agreeing that the CAs were easy to use [,,]. User experience reported in 3 (42.9%) studies was generally positive [,,]. Participants highlighted several valued features of the CAs, including their nonjudgmental nature, intuitive navigation, effective error management, decision-making support, personalized educational content, motivational messaging, and communication skill support in clinical interactions [,,,].
Effectiveness of CA Interventions: Meta-Analysis
Of the 17 studies, 10 (58.8%) were included in a meta-analysis, comprising 6 (60%) RCTs [,,-,] and 4 (40%) NRIs [,,,]. Given the clinical and methodological similarities across the included studies, quality of life, physical activity, pain, anxiety, depression, and psychological distress were adopted as outcome indicators of the meta-analysis. The results of the meta-analysis are presented in and . Sensitivity analyses comparing fixed- and random-effects models for each of these outcomes yielded consistent results, supporting the stability of the pooled effect estimates (see Figures S1-S6 in ).
| Outcome | Patients (N=1817), n (%) | Studies (N=17), n (%) | Effect size (95% CI) | P value | Statistical heterogeneity, P value, I² | Effect model |
| Quality of life | 503 (27.7) | 4 (23.5) [,-,] | SMDb=0.35 (0.03 to 0.67) | .03 | .09, 53% | Random |
| Physical activity | 118 (6.5) | 2 (11.8), 3 comparisons [,] | MDc=1.44 (0.36 to 2.52) | <.01 | .26, 25% | Fixed |
| Pain | 120 (6.6) | 2 (11.8), 3 comparisons [,] | MD=–0.91 (–1.44 to –0.38) | <.01 | .59, 0% | Fixed |
| Anxiety | 595 (32.7) | 4 (23.5), 5 comparisons [,,,,] | SMD=–0.19 (–0.35 to –0.02) | .02 | .60, 0% | Fixed |
| Depression | 138 (7.6) | 2 (11.8) [,,] | SMD=–0.07 (–0.42 to 0.27) | .68 | .65, 0% | Fixed |
| Psychological distress | 139 (7.6) | 3 (17.6) [,,] | SMD=–0.33 (–0.66 to 0.01) | .06 | .34, 9% | Fixed |
aCA: conversational agent.
bSMD: standardized mean difference.
cMD: mean difference.

Quality of Life
Quality of life was measured in 4 (23.5%) studies involving 503 (27.7%) participants (A and ) [,,,]. Moderate heterogeneity was observed among the included studies (I²=53%, P=.09), supporting the use of a random-effects model. The pooled analysis showed statistically significant improvement in quality of life in the intervention group compared to the control group (standardized mean difference [SMD]=0.35, 95% CI 0.03-0.67, P=.03).
Physical Activity
Physical activity was measured using the step count in 2 (11.8%) studies involving 118 (6.5%) participants (B and ) [,]. One study with a 3-arm design contributed 2 independent comparisons: a text-based CA intervention and a voice-based CA intervention, each compared with the usual care group. The pooled analysis demonstrated a statistically significant improvement in physical activity in the intervention group compared to the control group (mean difference [MD]=1.44, 95% CI 0.36-2.52, P<.01). These studies showed low heterogeneity (I2=25%, P=.26).
Pain
Pain was measured in 2 (11.8%) studies involving 120 (6.6%) participants [,]. In 1 (50%) study, outcomes at 2 timepoints were included as separate entries in the meta-analysis, with assessments conducted at the midpoint (4 weeks) and the endpoint (8 weeks) of the intervention []. The results indicated no significant heterogeneity among the included studies (I2=0%, P=.59), supporting the use of a fixed-effects model. The pooled analysis demonstrated a statistically significant reduction in pain scores in the intervention group compared to the control group (MD=–0.91, 95% CI –1.44 to –0.38, P<.01). These findings suggest that the CA intervention was effective in reducing the pain of patients with cancer (C and ).
Anxiety
Anxiety was assessed in 4 (23.5%) studies involving a total of 595 (32.7%) participants [,,,]. One study with a 4-arm design contributed 2 independent comparisons: CA + paper versus paper, and CA + video versus video []. The meta-analysis showed no heterogeneity among the included trials (I²=0%, P=.60), supporting the use of a fixed-effects model. The pooled results demonstrated a statistically significant reduction in anxiety levels in the intervention group compared to the control group (SMD=–0.19, 95% CI –0.35 to –0.02, P=.02). These findings suggest that the CA intervention was effective in alleviating anxiety (D and ).
Depression
Depression was measured in 2 (11.8%) studies involving 138 (7.6%) participants (E and ) [,]. The pooled analysis demonstrated a nonsignificant SMD of –0.07 (95% CI –0.42 to 0.27, P=.68), with no heterogeneity among the included trials (I²=0%, P=.65).
Psychological Distress
Psychological distress was measured in 3 (17.6%) studies involving 139 (7.6%) participants (F and ) [,,]. The pooled analysis demonstrated a nonsignificant SMD of –0.33 (95% CI –0.66 to 0.01, P=.06), with low heterogeneity among the included trials (I²=9%, P=.34).
Effectiveness of CA Interventions: Narrative Synthesis
Health Information Acquisition
Health information acquisition was reported in 4 (57.1%) of the 17 studies, all of which used short-term designs and collectively included all 3 single-session interventions identified in this review. Due to heterogeneity in outcome measures, a meta-analysis was not feasible; therefore, a narrative synthesis was conducted. Across these studies, Bickmore et al [] found that CAs enhance access to clinical trial information among patients with cancer with limited health literacy. Bibault et al [] and Al-Hilli et al [] further demonstrated that CAs effectively enhance patients’ understanding of clinical information by providing accurate and specific content. Baumgärtner et al [] reported a significant reduction in unmet informational needs among patients receiving CA intervention. Despite their short duration, these CA interventions demonstrated the potential to effectively improve patients’ acquisition of health information.
Treatment-Related Side Effects
Of the 17 studies, 3 (17.6%) examined CA interventions designed to support patients in managing side effects associated with cancer treatments, including oral oncolytic agents, chemotherapy, and surgery. Due to heterogeneity in outcome measures and definitions, a narrative synthesis was conducted. Queiroz et al [] found that CAs facilitate effective self-monitoring of treatment-related side effects. Tawfik et al [] reported improvements in self-management capacity for chemotherapy-induced symptoms. Sikorskii et al [] further demonstrated a notable reduction in symptom severity among patients taking oral oncolytic agents. These findings indicate that CAs may serve as a valuable tool to empower patients in alleviating side effects throughout the cancer care continuum.
Summary of Quality Assessment and Risk of Bias
The results of risk-of-bias assessments of the 17 studies are reported in Figures S7 and S8 and Table S5 in . Of the 12 (70.6%) RCTs, 3 (25%) had an overall low risk of bias [-], 4 (33.3%) exhibited a high risk of bias [,,,], and another 5 (41.7%) raised some concerns [,,-]. Common methodological issues included inadequate randomization, deviations from intended interventions, and bias due to missing outcome data or outcome measurement. In addition, 5 (41.7%) quasi-experimental studies met 7-8 out of 9 quality assessment criteria, indicating relatively good methodological quality [,,,,]. Common limitations among these studies included the absence of control groups, limited follow-up reporting, and insufficient pre- and postintervention outcome measurement.
Discussion
Principal Findings
This systematic review is the first to comprehensively evaluate the effectiveness and feasibility of CA interventions in supporting care for patients with cancer. In total, 17 trials involving 1817 participants were included, comprising 12 (70.6%) RCTs and 5 (29.4%) NRIs. Of these, 10 (58.8%) studies were included in a meta-analysis. The meta-analysis findings indicated that CA interventions are effective in promoting physical activity, reducing pain and anxiety, and improving the quality of life among patients with cancer. However, no significant effects were observed for depression or psychological distress. Narrative synthesis suggested that CAs have the potential to enhance health information acquisition and help manage treatment-related side effects. Furthermore, CAs were generally found to be feasible, acceptable, and usable among patients with cancer, particularly during the initial phase of use. However, user engagement tended to decline over time, underscoring the need for strategies to sustain long-term use. These results support the potential integration of CAs into broader cancer care frameworks and inform future direction for optimizing CA design to enhance sustained engagement and clinical effectiveness.
Comparison With Prior Work
Prompt sequencing and interaction modes are central to the design and delivery of CA interventions, shaping how users engage with the CA system and receive tailored support. Across the included studies, CAs adopted user-initiated inputs [,,,], system-initiated prompts [,,,], or a combination of both approaches [,,,,-,,]. The sequencing of prompts generally aligned with the CA’s intended function and interaction style: user-initiated prompts supported on-demand health queries or symptom check-ins; system-initiated prompts provided scheduled reminders or symptom-triggered alerts; and hybrid models combined both to enhance adherence and engagement []. Upon receiving input, CAs typically conducted a needs assessment and generated personalized responses via rule-based scripts [,,,,], AI-driven dialogue generation [,,,,], or both approaches [,,,-], depending on the design and operational mechanisms of the system. The response content of CAs was largely grounded in authoritative sources, including clinical guidelines, health websites, expert consensus, peer-reviewed literature, and evidence-based QnA libraries developed by study teams. Cloud-based platforms were also integrated in some studies to deliver responses from predefined knowledge bases, improving response consistency and system scalability [,,,,,]. Handover mechanisms were incorporated in only one-third of interventions, enabling escalation to clinical staff in high-risk situations to ensure safety and responsiveness [,,,,,]. Additionally, most interventions adopted mobile- or web-based interfaces, though only a third incorporated visual elements (eg, icons, avatars) [,,,,].
This review supports the effectiveness of CAs in cancer care, particularly in promoting physical activity and alleviating symptoms, such as pain, treatment side effects, and anxiety, thereby enhancing patients’ overall quality of life. Although prior studies in chronic disease populations have also reported favorable outcomes with CAs, many of those reviews focused on relatively narrow objectives—such as physical activity or weight management—with limited attention to multidimensional outcomes [,]. In contrast, the cancer-specific CAs included in this review demonstrated broader functionality, addressing not only physical activity but also a wide range of symptoms and overall quality of life. This broader scope and effectiveness may be attributed to the interactive and tailored content of CAs, which aligns well with the complex and evolving needs of individuals undergoing cancer treatment []. Specifically, reductions in the symptom burden—such as pain, anxiety, and treatment-related side effects—highlight the potential of CAs to deliver symptom-specific education and practical self-care strategies []. Improvements in physical activity observed across studies may reflect the effectiveness of CA features, such as timely reminders, individualized feedback, and structured goal setting []. These benefits are particularly important in oncology, where preserving physical function and controlling symptoms are critical to maintaining quality of life []. Collectively, these features position CAs as a promising adjunct to conventional oncology care, with the potential to improve multidimensional health-related outcomes and enhance supportive care delivery.
However, the nonsignificant effect of CAs on depression and psychological distress requires further exploration. Both conditions represent complex, multifaceted, and long-lasting psychological challenges that require deeper emotional empathy and theoretically grounded interventions []. Although current CA systems may help alleviate transient psychological states, such as anxiety, by delivering multidimensional informational support, they remain inadequate for more complex and persistent psychological demands []. This inadequacy is largely attributable to the structural limitations of most existing CAs, which are predominantly rule based or minimally AI enhanced []. As a result, they are unable to recognize complex psychological states, respond empathetically, or support deep emotional regulation []. Notably, recent studies in other fields suggest that AI-powered CAs based on large language models (LLMs) may even outperform physicians in perceived empathy—possibly due to their advanced attention mechanisms, emotionally rich training data, and context-aware communication design []. Although these advances have not yet been applied in cancer care CAs, their integration holds promise. Future CA systems could incorporate emerging technologies, such as AI and LLMs, to enhance empathic communication and provide more comprehensive psychological support [].
Narrative synthesis revealed that studies using single-session CA interventions primarily reported immediate informational benefits, specifically enhanced health information acquisition. In contrast, sustained health improvements—such as increased physical activity, better symptom management, and enhanced quality of life—were typically observed in studies that implemented multisession or extended-duration interventions lasting from 4 weeks to 6 months. These findings suggest that although brief, single-session interventions may suffice for improving informational outcomes, more sustained exposure—of at least 4 weeks—may be necessary to achieve meaningful improvements in sustained health benefits []. However, due to the limited number of studies per outcome and inconsistencies in intervention duration reporting, the optimal intervention duration could not be formally examined. Future research should investigate the relationship between intervention duration and effectiveness to determine the minimum exposure required for achieving sustained benefits across different domains of cancer care [].
CAs were generally found to be safe, feasible, acceptable, and usable. However, suboptimal long-term engagement remains a critical challenge, underscoring the need for strategies to sustain continued use. This trend is consistent with findings from prior CA interventions in chronic disease management, where high attrition rates were commonly reported despite favorable early-phase implementation outcomes [,]. Several design features may have hindered this sustained engagement. Notably, two-thirds of included CAs were fully automated and lacked handover mechanisms to escalate complex or high-risk queries to clinical personnel, potentially weakening users’ sense of accountability and connection, thereby undermining long-term engagement []. In addition, only one-third incorporated visual elements (eg, avatars, icons), which may have limited interactivity and reduced the overall appeal of the interface []. Moreover, most CAs relied on narrow content databases, leading to repetitive information and diminishing novelty over time []. To address these issues, future CA interventions should embed health care professionals into the interaction loop, incorporate emotionally engaging visuals, and leverage advanced technologies—such as LLM-based interfaces and enriched data sources—to provide diverse, adaptive content that evolves with user needs []. These enhancements may improve long-term engagement, build trust, and strengthen the effectiveness and scalability of CA-based interventions [].
Strength and Limitations
This study presented the first systematic review and meta-analysis to comprehensively synthesize the characteristics, feasibility, and effectiveness of CA-based interventions in supporting care for patients with cancer, providing an up-to-date synthesis of the current evidence and inform future clinical practice and intervention development.
However, several limitations should be acknowledged. First, this review included only peer-reviewed studies published in English, which may have introduced publication and language bias (results are presented in Figures S9-S14 in ). This decision was made to ensure methodological rigor and consistency in reporting quality. However, relevant studies in other languages or nonindexed sources may have been overlooked. Future reviews could address this by incorporating non-English and gray literature to improve inclusiveness. Second, heterogeneity in intervention characteristics, such as intervention duration, technology algorithms, and delivery modality, may have introduced variability that warrants cautious interpretation. Given the limited number of eligible trials, formal subgroup analyses were not feasible; therefore, we interpreted outcome variability narratively based on key intervention features. Future meta-analyses with a larger evidence base may facilitate subgroup analyses to examine their potential moderating effects. Third, most of the included studies had relatively small sample sizes, and some were NRIs, which may limit the generalizability of the findings. This limitation reflects the early, exploratory stage of research on CA interventions in cancer care. To mitigate this, we conducted both meta-analysis and narrative synthesis to comprehensively integrate the current evidence on effectiveness. In addition, a separate narrative synthesis of implementation outcomes was performed to capture early insights into CA delivery and contextual applicability. Future well-designed RCTs with larger sample sizes are warranted to enhance the robustness of the evidence base and support broader clinical implementation.
Future Direction
Overall, CA interventions address persistent service gaps in both clinical and community settings. However, their successful integration into routine care requires systematic consideration of diverse aspects. One key aspect identified in our review is the limited application of theoretical frameworks to guide the design, delivery, and evaluation of CA interventions. Future interventions should be guided by established theoretical frameworks to ensure coherent design, targeted content delivery, and meaningful outcome evaluation []. In particular, stage-based models, such as the Transtheoretical Model (TTM), which links intervention strategies and outcomes to users’ readiness for change (eg, precontemplation, preparation, action, maintenance), may offer added practical value []. Applying such models could help clarify how CAs influence patient outcomes, address individual needs, and reinforce the scientific foundation of future CA interventions [].
Second, future CA development should build on existing technologies by incorporating more flexible, adaptive, and emotionally responsive capabilities. Many current systems remain limited by predefined scripts or limited AI functionality, restricting their ability to understand context or convey empathy. Integrating advanced techniques, such as LLMs, may improve natural language understanding and emotional responsiveness, leading to greater personalization and sustained user interaction []. Additionally, embedding health care professionals into the interaction loop and incorporating multimodal communication features may further enhance safety, accessibility, user trust, and long-term engagement [].
Third, the successful application of general-purpose health CAs in oncology care highlights their potential for cross-condition adaptability and broader applicability across health care contexts. Although most CAs included in this review were specifically designed for oncology, several originally developed for general health conditions have also demonstrated effectiveness when adapted for oncology care [,,]. This adaptability indicates that well-designed CAs may be transferable across clinical conditions, suggesting valuable directions for future development. Clinically, repurposing CAs may reduce development costs and expedite deployment in supportive care of different diseases []. This also supports the development of CA systems with reusable modules, where disease-specific content can be plugged into a standardized infrastructure []. This approach enables continuity of care across comorbid conditions by centralizing supportive functions in a unified CA platform. To support broader implementation, future research should examine how contextual adaptations (eg, population, disease, setting) affect CA effectiveness and user experience [].
Conclusion
This systematic review is the first to comprehensively evaluate CA interventions in supporting care for patients with cancer, with preliminary evidence supporting their feasibility and effectiveness. However, the limited psychological benefits and suboptimal long-term user engagement indicate the need for further refinement. Future research should adopt theory-driven approaches and explore the integration of emerging technologies to enhance personalization, empathy, and sustained engagement in CA interventions. Moreover, rigorous study designs—particularly large-scale RCTs—are needed to evaluate the effectiveness and implementation processes of CAs in cancer care.
Acknowledgments
The authors thank Dr Changhao Li of the School of Computer Science, Sun Yat-sen University, for his valuable suggestions on this study. No funding was received to support this research.
Data Availability
All data generated or analyzed during this study are included in this published paper and its supplementary information files.
Authors' Contributions
XHJ and JSP conceptualized and designed this study, drafted the initial paper, and coordinated and supervised the data selection and collection. XHY and JSP conducted data selection and provided feedback to this paper. XHJ performed statistical analysis and contributed to the interpretation of the results. XHJ and HZ contributed to the results’ interpretation and reviewed this paper. All authors have approved the final manuscript and agreed to be accountable for all aspects of the work.
Conflicts of Interest
None declared.
Summary tables, figures, and supplementary information.
DOCX File , 1003 KBReferences
- Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. Apr 04, 2024;74(3):229-263. [FREE Full text] [CrossRef] [Medline]
- Williams K, Jackson SE, Beeken RJ, Steptoe A, Wardle J. The impact of a cancer diagnosis on health and well-being: a prospective, population-based study. Psychooncology. Jul 2016;25(6):626-632. [FREE Full text] [CrossRef] [Medline]
- Barlow J, Wright C, Sheasby J, Turner A, Hainsworth J. Self-management approaches for people with chronic conditions: a review. Patient Educ Couns. 2002;48(2):177-187. [FREE Full text] [CrossRef] [Medline]
- Rimmer B, Brown MC, Sotire T, Beyer F, Bolnykh I, Balla M, et al. Characteristics and components of self-management interventions for improving quality of life in cancer survivors: a systematic review. Cancers (Basel). Dec 19, 2023;16(1):14. [FREE Full text] [CrossRef] [Medline]
- Lee K, Kim S, Kim SH, Yoo S, Sung JH, Oh EG, et al. Digital health interventions for adult patients with cancer evaluated in randomized controlled trials: scoping review. J Med Internet Res. Jan 06, 2023;25:e38333. [FREE Full text] [CrossRef] [Medline]
- Baumgärtner K, Byczkowski M, Schmid T, Muschko M, Woessner P, Gerlach A, et al. Effectiveness of the medical chatbot PROSCA to inform patients about prostate cancer: results of a randomized controlled trial. Eur Urol Open Sci. Dec 2024;69:80-88. [FREE Full text] [CrossRef] [Medline]
- Laranjo L, Dunn AG, Tong HL, Kocaballi AB, Chen J, Bashir R, et al. Conversational agents in healthcare: a systematic review. J Am Med Inform Assoc. Oct 01, 2018;25(9):1248-1258. [FREE Full text] [CrossRef] [Medline]
- Tudor Car L, Dhinagaran DA, Kyaw BM, Kowatsch T, Joty S, Theng Y, et al. Conversational agents in health care: scoping review and conceptual analysis. J Med Internet Res. Aug 07, 2020;22(8):e17158. [FREE Full text] [CrossRef] [Medline]
- Adamopoulou E, Moussiades L. Chatbots: history, technology, and applications. Mach Learn Appl. Dec 2020;2:100006. [FREE Full text] [CrossRef]
- Wang A, Qian Z, Briggs L, Cole AP, Reis LO, Trinh Q. The use of chatbots in oncological care: a narrative review. Int J Gen Med. May 2023;16:1591-1602. [FREE Full text] [CrossRef]
- Tawfik E, Ghallab E, Moustafa A. A nurse versus a chatbot ‒ the effect of an empowerment program on chemotherapy-related side effects and the self-care behaviors of women living with breast cancer: a randomized controlled trial. BMC Nurs. May 06, 2023;22(1):102. [FREE Full text] [CrossRef] [Medline]
- Greer S, Ramo D, Chang Y, Fu M, Moskowitz J, Haritatos J. Use of the chatbot "Vivibot" to deliver positive psychology skills and promote well-being among young people after cancer treatment: randomized controlled feasibility trial. JMIR Mhealth Uhealth. Oct 31, 2019;7(10):e15018. [FREE Full text] [CrossRef] [Medline]
- MacNeill AL, MacNeill L, Luke A, Doucet S. Health professionals' views on the use of conversational agents for health care: qualitative descriptive study. J Med Internet Res. Oct 25, 2024;26:e49387. [FREE Full text] [CrossRef] [Medline]
- Chen X, Xiang J, Lu S, Liu Y, He M, Shi D. Evaluating large language models and agents in healthcare: key challenges in clinical applications. Intell Med. May 2025;5(2):151-163. [FREE Full text] [CrossRef]
- Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. Mar 29, 2021;372:n71. [FREE Full text] [CrossRef] [Medline]
- Sterne JAC, Savović J, Page MJ, Elbers RG, Blencowe NS, Boutron I, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ. Aug 28, 2019;366:l4898. [FREE Full text] [CrossRef] [Medline]
- Barker TH, Habibi N, Aromataris E, Stone JC, Leonardi-Bee J, Sears K, et al. The revised JBI critical appraisal tool for the assessment of risk of bias for quasi-experimental studies. JBI Evid Synth. Mar 01, 2024;22(3):378-388. [FREE Full text] [CrossRef] [Medline]
- Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, et al. Cochrane Handbook for Systematic Reviews of Interventions. Chichester, UK. John Wiley & Sons; 2019.
- Higgins JPT, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analyses. BMJ. Oct 06, 2003;327(7414):557-560. [FREE Full text] [CrossRef] [Medline]
- Schmitz KH, Kanski B, Gordon B, Caru M, Vasakar M, Truica CI, et al. Technology-based supportive care for metastatic breast cancer patients. Support Care Cancer. Jul 20, 2023;31(7):401. [CrossRef] [Medline]
- Sikorskii A, Given CW, Given BA, Vachon E, Krauss JC, Rosenzweig M, et al. An automated intervention did not improve adherence to oral oncolytic agents while managing symptoms: results from a two-arm randomized controlled trial. J Pain Symptom Manage. Dec 2018;56(5):727-735. [FREE Full text] [CrossRef] [Medline]
- Hassoon A, Baig Y, Naiman DQ, Celentano DD, Lansey D, Stearns V, et al. Randomized trial of two artificial intelligence coaching interventions to increase physical activity in cancer survivors. NPJ Digit Med. Dec 09, 2021;4(1):168. [FREE Full text] [CrossRef] [Medline]
- Maguire R, McCann L, Kotronoulas G, Kearney N, Ream E, Armes J, et al. Real time remote symptom monitoring during chemotherapy for cancer: European multicentre randomised controlled trial (eSMART). BMJ. Jul 21, 2021;374:n1647. [FREE Full text] [CrossRef] [Medline]
- Lee J, Byun HK, Kim YT, Shin J, Kim YB. A study on breast cancer patient care using chatbot and video education for radiation therapy: a randomized controlled trial. Int J Radiat Oncol Biol Phys. May 01, 2025;122(1):84-92. [FREE Full text] [CrossRef] [Medline]
- Horesh D, Kohavi S, Shilony-Nalaboff L, Rudich N, Greenman D, Feuerstein JS, et al. Virtual reality combined with artificial intelligence (VR-AI) reduces hot flashes and improves psychological well-being in women with breast and ovarian cancer: a pilot study. Healthcare (Basel). Dec 11, 2022;10(11):2261. [FREE Full text] [CrossRef] [Medline]
- Mlakar I, Lin S, Aleksandraviča I, Arcimoviča K, Eglītis J, Leja M, et al. Patients-centered SurvivorShIp care plan after cancer treatments based on big data and artificial intelligence technologies (PERSIST): a multicenter study protocol to evaluate efficacy of digital tools supporting cancer survivors. BMC Med Inform Decis Mak. Aug 14, 2021;21(1):243. [FREE Full text] [CrossRef] [Medline]
- Bema D. Full clinical study validation results of PERSIST. Zenodo. 2023. URL: https://zenodo.org/records/8232106 [accessed 2025-07-31]
- Bibault J, Chaix B, Guillemassé A, Cousin S, Escande A, Perrin M, et al. A chatbot versus physicians to provide information for patients with breast cancer: blind, randomized controlled noninferiority trial. J Med Internet Res. Dec 27, 2019;21(11):e15787. [FREE Full text] [CrossRef] [Medline]
- Al-Hilli Z, Noss R, Dickard J, Wei W, Chichura A, Wu V, et al. A randomized trial comparing the effectiveness of pre-test genetic counseling using an artificial intelligence automated chatbot and traditional in-person genetic counseling in women newly diagnosed with breast cancer. Ann Surg Oncol. Oct 2023;30(10):5990-5996. [FREE Full text] [CrossRef] [Medline]
- Kamdar M, Jethwani K, Centi AJ, Agboola S, Fischer N, Traeger L, et al. A digital therapeutic application (ePAL) to manage pain in patients with advanced cancer: a randomized controlled trial. J Pain Symptom Manage. Oct 2024;68(3):261-271. [FREE Full text] [CrossRef] [Medline]
- Albino de Queiroz D, Silva Passarello R, Veloso de Moura Fé V, Rossini A, Folchini da Silveira E, Aparecida Isquierdo Fonseca de Queiroz E, et al. A wearable chatbot-based model for monitoring colorectal cancer patients in the active phase of treatment. Healthcare Anal. Dec 2023;4:100257. [FREE Full text] [CrossRef]
- Bickmore TW, Utami D, Matsuyama R, Paasche-Orlow MK. Improving access to online health information with conversational agents: a randomized controlled experiment. J Med Internet Res. Jan 04, 2016;18(1):e1. [FREE Full text] [CrossRef] [Medline]
- Gomaa S, Posey J, Bashir B, Basu Mallick A, Vanderklok E, Schnoll M, et al. Feasibility of a text messaging-integrated and chatbot-interfaced self-management program for symptom control in patients with gastrointestinal cancer undergoing chemotherapy: pilot mixed methods study. JMIR Form Res. Dec 10, 2023;7:e46128. [FREE Full text] [CrossRef] [Medline]
- Caru M, Abdullah S, Qiu L, Kanski B, Gordon B, Truica CI, et al. Women with metastatic breast cancer don't just follow step-count trends, they exceed them: an exploratory study. Breast Cancer Res Treat. Jul 2023;200(2):265-270. [FREE Full text] [CrossRef] [Medline]
- Noh E, Won J, Jo S, Hahm D, Lee H. Conversational agents for body weight management: systematic review. J Med Internet Res. May 26, 2023;25:e42238. [FREE Full text] [CrossRef] [Medline]
- Wu Y, Zhang J, Ge P, Duan T, Zhou J, Wu Y, et al. Application of chatbots to help patients self-manage diabetes: systematic review and meta-analysis. J Med Internet Res. Dec 03, 2024;26:e60380. [FREE Full text] [CrossRef] [Medline]
- Yang Q, Cheung K, Zhang Y, Zhang Y, Qin J, Xie YJ. Conversational agents in physical and psychological symptom management: a systematic review of randomized controlled trials. Int J Nurs Stud. Mar 2025;163:104991. [FREE Full text] [CrossRef] [Medline]
- Oh YJ, Zhang J, Fang M, Fukuoka Y. A systematic review of artificial intelligence chatbots for promoting physical activity, healthy diet, and weight loss. Int J Behav Nutr Phys Act. Dec 11, 2021;18(1):160. [FREE Full text] [CrossRef] [Medline]
- Ahmed A, Hassan A, Aziz S, Abd-Alrazaq AA, Ali N, Alzubaidi M, et al. Chatbot features for anxiety and depression: a scoping review. Health Inf J. Jan 24, 2023;29(1):14604582221146719. [FREE Full text] [CrossRef] [Medline]
- Bickmore TW, Picard RW. Establishing and maintaining long-term human-computer relationships. ACM Trans Comput-Hum Interact. Jun 01, 2005;12(2):293-327. [FREE Full text] [CrossRef]
- Chen D, Chauhan K, Parsa R, Liu ZA, Liu F, Mak E, et al. Patient perceptions of empathy in physician and artificial intelligence chatbot responses to patient questions about cancer. NPJ Digit Med. Jul 21, 2025;8(1):275. [FREE Full text] [CrossRef] [Medline]
- Wutz M, Hermes M, Winter V, Köberlein-Neu J. Factors influencing the acceptability, acceptance, and adoption of conversational agents in health care: integrative review. J Med Internet Res. Oct 26, 2023;25:e46548. [FREE Full text] [CrossRef] [Medline]
- Aggarwal A, Tam CC, Wu D, Li X, Qiao S. Artificial intelligence–based chatbots for promoting health behavioral changes: systematic review. J Med Internet Res. Mar 24, 2023;25:e40789. [FREE Full text] [CrossRef] [Medline]
- Jabir AI, Lin X, Martinengo L, Sharp G, Theng Y, Tudor Car L. Attrition in conversational agent-delivered mental health interventions: systematic review and meta-analysis. J Med Internet Res. Mar 27, 2024;26:e48168. [FREE Full text] [CrossRef] [Medline]
- Yu K, Gorbachev G, Eck U, Pankratz F, Navab N, Roth D. Avatars for teleconsultation: effects of avatar embodiment techniques on user perception in 3D asymmetric telepresence. IEEE Trans Vis Comput Graph. Dec 2021;27(11):4129-4139. [FREE Full text] [CrossRef] [Medline]
- Bickmore T, Giorgino T. Health dialog systems for patients and consumers. J Biomed Inform. Oct 2006;39(5):556-571. [FREE Full text] [CrossRef] [Medline]
- Martinengo L, Jabir AI, Goh WWT, Lo NYW, Ho MR, Kowatsch T, et al. Conversational agents in health care: scoping review of their behavior change techniques and underpinning theory. J Med Internet Res. Oct 03, 2022;24(10):e39243. [FREE Full text] [CrossRef] [Medline]
- Sun X, de WJ, Li Z, Pei J, El AA, Bosch J. Script-strategy aligned generation: aligning LLMs with expert-crafted dialogue scripts and therapeutic strategies for psychotherapy. URL: https://arxiv.org/abs/2411.06723 [accessed 2025-07-31]
- Tumuhimbise W, Theuring S, Kaggwa F, Atukunda EC, Rubaihayo J, Atwine D, et al. Enhancing the implementation and integration of mHealth interventions in resource-limited settings: a scoping review. Implement Sci. Oct 14, 2024;19(1):72. [FREE Full text] [CrossRef] [Medline]
Abbreviations
| AI: artificial intelligence |
| CA: conversational agent |
| JBI: Joanna Briggs Institute |
| LLM: large language model |
| MD: mean difference |
| mHealth: mobile health |
| PRISMA: Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
| QnA: question and answer |
| SMD: standardized mean difference |
| NRI: nonrandomized intervention |
| RCT: randomized controlled trial |
Edited by A Schwartz; submitted 05.05.25; peer-reviewed by D Chrimes, A Martins; comments to author 09.06.25; revised version received 12.07.25; accepted 30.07.25; published 08.08.25.
Copyright©Xiao-han Jiang, Xiu-hong Yuan, Hui Zhao, Jun-sheng Peng. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 08.08.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

