Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Journal Description

The Journal of Medical Internet Research (JMIR) is the pioneer open access eHealth journal, and is the flagship journal of JMIR Publications. It is a leading health services and digital health journal globally in terms of quality/visibility (Journal Impact Factor 6.0, Journal Citation Reports 2025 from Clarivate), ranking Q1 in both the 'Medical Informatics' and 'Health Care Sciences & Services' categories, and is also the largest journal in the field. The journal is ranked #1 on Google Scholar in the 'Medical Informatics' discipline. The journal focuses on emerging technologies, medical devices, apps, engineering, telehealth and informatics applications for patient education, prevention, population health and clinical care.

JMIR is indexed in all major literature indices including National Library of Medicine(NLM)/MEDLINE, Sherpa/Romeo, PubMed, PMCScopus, Psycinfo, Clarivate (which includes Web of Science (WoS)/ESCI/SCIE), EBSCO/EBSCO Essentials, DOAJ, GoOA and others. Journal of Medical Internet Research received a Scopus CiteScore of 11.7 (2024), placing it in the 92nd percentile (#12 of 153) as a Q1 journal in the field of Health Informatics. It is a selective journal complemented by almost 30 specialty JMIR sister journals, which have a broader scope, and which together receive over 10,000 submissions a year. 

As an open access journal, we are read by clinicians, allied health professionals, informal caregivers, and patients alike, and have (as with all JMIR journals) a focus on readable and applied science reporting the design and evaluation of health innovations and emerging technologies. We publish original research, viewpoints, and reviews (both literature reviews and medical device/technology/app reviews). Peer-review reports are portable across JMIR journals and papers can be transferred, so authors save time by not having to resubmit a paper to a different journal but can simply transfer it between journals. 

We are also a leader in participatory and open science approaches, and offer the option to publish new submissions immediately as preprints, which receive DOIs for immediate citation (eg, in grant proposals), and for open peer-review purposes. We also invite patients to participate (eg, as peer-reviewers) and have patient representatives on editorial boards.

As all JMIR journals, the journal encourages Open Science principles and strongly encourages publication of a protocol before data collection. Authors who have published a protocol in JMIR Research Protocols get a discount of 20% on the Article Processing Fee when publishing a subsequent results paper in any JMIR journal.

Be a widely cited leader in the digital health revolution and submit your paper today!

 

Recent Articles:

  • Source: Image created by the authors/Pexels; Copyright: The Authors/Pexels; URL: https://www.jmir.org/2026/1/e89428/; License: Creative Commons Attribution + Noncommercial + NoDerivatives (CC-BY-NC-ND).

    Digital Patient Decision Aid for Antiobesity Medications: Mixed Methods Study of Human-Centered Design and Usability Evaluation

    Abstract:

    Background: The global burden of obesity continues to rise, highlighting the need for patient-centered approaches to weight management. Shared decision-making is particularly important in the selection of antiobesity medications (AOMs), as treatment options differ in mechanism, effectiveness, side effects, routes of administration, and cost. Despite this preference-sensitive context, only a few patient decision aids (PDAs) have been culturally and clinically adapted for use in Asian populations. Objective: This study aims to design, develop, and evaluate a digital PDA, OptiWeight, to support shared decision-making for AOM selection, incorporating perspectives from health care professionals and patients. Methods: This mixed methods, multicenter study, conducted between August 2022 and November 2025, applied a 4-stage human-centered design process. An evidence-informed prototype was developed based on clinical guidelines, followed by 2 rounds of usability testing using think-aloud protocols to assess navigation structures, perceived usability (System Usability Scale [SUS]), and cognitive workload (NASA Task Load Index [NASA-TLX]). Semistructured interviews with health care professionals specializing in weight management, guided by the Consolidated Framework for Implementation Research, informed clinical implementation and workflow integration. Finally, patients with overweight or obesity evaluated usability, cognitive workload, and overall user experience in outpatient settings. Qualitative data were analyzed using content analysis, and 1-way analysis of variance examined changes in usability and workload across stages. Results: A total of 174 individuals were included across all study stages (usability testing among adults: n=78; health care professional interviews: n=18; and clinical evaluation among patients: n=78). Iterative usability testing comparing system- and user-controlled navigation structures revealed complementary strengths and limitations, leading to the adoption of a hybrid navigation structure supporting both sequential guidance and flexible comparison. Additional design requirements included the use of icon arrays to enhance risk comprehension and localization features such as treatment cost displays and clarification of socially impactful side effects. Perceived usability increased from initial testing to clinical evaluation (SUS: 60.53-73.65, P<.001), meeting good usability thresholds, while cognitive workload decreased (NASA-TLX: 40.35-16.69, P<.001). Conclusions: Through a systematic human-centered design process integrating health care professional and patient perspectives, OptiWeight addresses the lack of culturally adapted PDAs for AOM decision-making in Mandarin-speaking populations while capturing user needs—particularly regarding navigation flexibility and risk visualization. The final tool demonstrated good usability and feasibility, and workflow considerations suggest potential for integration into routine weight-management care. Further research is needed to evaluate its impact on decision quality and real-world implementation outcomes.

  • Source: Freepik; Copyright: Freepik; URL: https://www.freepik.com/free-photo/arabic-woman-teaching-senior-man-use-smartwatch-with-smartphone_25213026.htm; License: Licensed by JMIR.

    Assessing the Use of Wearable Mobile-Monitoring Devices Among Individuals With Serious Mental Illness: Qualitative Acceptability and Feasibility Study

    Abstract:

    Background: Serious mental illness (SMI) is difficult to treat for various reasons, such as rapid changes in symptoms, comorbid health conditions, long gaps between provider visits, and additional societal barriers experienced by this population. Wearable mobile-sensing devices can be used to passively collect valuable patient-generated health data, such as daily step count, heart rate variability, sleep information, and other health-related behaviors, which could inform and improve treatment for individuals with SMI. Wearable health devices have become more economically accessible, providing promise for the possibility of their implementation in health care. However, more information regarding how individuals with SMI perceive and interact with these devices is needed. Objective: This study aimed to assess the acceptability and feasibility of using wearable mobile-sensing devices to improve treatment outcomes for Veterans with SMI. In addition, we were also interested in learning if privacy concerns would influence acceptability of devices, specifically surrounding location tracking and health information sharing, as well as assessing other barriers to device use. Methods: Qualitative interviews were conducted with participants who had been using a wearable health and fitness tracker for at least 2 weeks to explore their thoughts and perceptions of these devices. A total of 15 Veterans diagnosed with a SMI participated in interviews. Both thematic analysis and rapid qualitative analysis approaches were used to uncover findings in key domains and emergent themes. Results: Wearable fitness trackers allowed participants to conveniently monitor various aspects of their physical and mental health, provided a greater understanding of their overall well-being, and motivated them to reach personal health goals. Individuals were open to sharing their personal health information collected from the devices with providers to improve their health care treatment and expressed no privacy concerns surrounding data tracking or the device’s global positioning system that monitors physical location. Participants experienced some technological challenges with using the fitness trackers, as well as the device’s accompanying cell phone app. Furthermore, participants expressed difficulties in understanding and interpreting the health data that was collected from the health and fitness trackers. Greater ongoing technological support, in addition to physical device adjustments to enhance comfort and usability, were suggested ways of improving overall user experience. Conclusions: Participants with SMI in this sample were accepting of wearable mobile-monitoring devices and believe it is feasible to incorporate these fitness trackers into their daily lives. Furthermore, participants in this sample expressed no privacy concerns regarding location tracking or the sharing of health information collected from these devices with providers. Patient-generated health data collected from these devices may offer valuable information that could be used to inform health care treatment for this population.

  • Source: freepik; Copyright: DC Studio via freepik; URL: https://www.freepik.com/free-photo/stomatological-assistant-typing-computer-keyboard_18509330.htm; License: Licensed by JMIR.

    Concerns of Using Large Language Models in Health Care Research and Practice: Umbrella Review

    Abstract:

    Background: Large language models (LLMs), such as ChatGPT (OpenAI), are rapidly evolving, and their applications in health care are increasing. There is a growing demand for automation of routine tasks and a drive to use LLMs or similar to support research. Objective: This umbrella review examines concerns of health care professionals and researchers related to the use of LLMs in health care research and practice. We aimed to identify common issues raised and the implications for patient care, policy, and practice. Methods: A protocol was registered on PROSPERO (CRD420250640997). Searches were conducted in 7 databases (Ovid MEDLINE, Ovid Embase, Scopus, Web of Science, JBI Database of Systematic Reviews and Implementation Reports, Cochrane Database of Systematic Reviews, and Epistemonikos) in February 2025 and updated in February 2026. Screening was conducted in 2 stages, with independent screening by 2 reviewers. Studies published in the English language after January 2017 with at least one outcome expressing concerns of LLM or generative artificial intelligence use in health care research were included. The included studies were quality appraised for risk of bias and certainty of the evidence using AMSTAR-2 (A Measurement Tool to Assess Systematic Reviews) and GRADE (Grading of Recommendations Assessment, Development, and Evaluation), respectively. Data was extracted using a piloted form and narratively synthesized following SWiM guidelines and the PRIOR (Preferred Reporting Items for Overviews of Reviews) checklist. Results: The search retrieved 448 systematic reviews, of which 42 met the inclusion criteria. Further, 12 distinct populations were identified, including researchers and clinicians in various medical specialties. The included reviews were assessed to be of very poor quality, and the level of overlap between primary studies could not be determined. Additionally, 15 reviews focused on ChatGPT, a further 15 on two or more LLMs, and 12 on generic artificial intelligence. Thus, 3 main themes emerged from the narrative synthesis. In order of most to least frequently discussed: (1) technical capability; (2) ethical, legal, and societal; and (3) costs. Conclusions: To our knowledge, this is the first umbrella review to address the concerns of LLMs in health care research and practice. Thematic analyses provided insight into the complexity of different perspectives, and by using a whole population approach, it demonstrates common narratives. However, the poor quality of the included studies and potential overlap of results are substantial limitations. Data quality is at the heart of these concerns, and combative action must ensure health care professionals and researchers have the resources required to overcome these apprehensions. Ethical, legal, and societal implications of artificial intelligence use were also commonly raised. As technology accelerates and demands on health care increase, we must adapt and embrace change with equity, diversity, inclusion, and safety at the core. Trial Registration: PROSPERO CRD420250640997; https://www.crd.york.ac.uk/PROSPERO/view/CRD420250640997

  • AI-generated image. Generator: Nano Banana Pro, 19 April 2026; requestor: Jiao Fangfang. Source: Nano Banana Pro; Copyright: N/A (AI-Generated image); URL: https://jmir.org/2026/1/e74046/; License: Public Domain (CC0).

    Effectiveness of Telemedicine vs Face-to-Face Consultation in Fighting COVID-19: Retrospective Cohort Study of Adult Patients With COVID-19 in a Primary Care...

    Abstract:

    Background: Telemedicine use expanded rapidly during the COVID-19 pandemic. The Hong Kong Hospital Authority (HA) launched both tele-designated clinics (Tele-DCs) and face-to-face physical designated clinics (PDCs) to manage mild cases. However, the comparative effectiveness of these models remains unclear. Objective: This study aimed to compare clinical outcomes, specifically hospitalization and severe complications, between patients with mild COVID-19 managed via Tele-DCs versus PDCs in Hong Kong’s public primary care setting. Methods: We conducted a retrospective cohort study involving all patients with COVID-19, aged 18 years or older, who visited a PDC (n=23,031) or a Tele-DC (n=38,628) at the Kowloon Central Cluster in Hong Kong from July 28, 2022, to January 29, 2023. Patients were matched 1:1 using propensity score matching based on age, sex, smoking status, comprehensive social security assistance (CSSA) status, and the Charlson comorbidity score, resulting in 17,199 patients per group. The primary outcome was the hospital admission rate between day 1 and day 28. Secondary outcomes included severe complications, mortality, accident and emergency department (AED) use, the antiviral prescription rate, and DC revisit. Results: The average age of patients in the Tele-DC and PDC groups was 58.55 (SD 17.53) and 58.53 (SD 17.54) years, respectively (P=.93). In both groups, 9.05% (n=1557) of patients were on CSSA, and 11% (n=1892) were smokers. Compared to the PDC group, the Tele-DC group demonstrated similar hospital admission rates (Tele-DC: n=497, 2.89%; PDC: n=471, 2.74%; between-group difference 0.15%, 95% CI –0.20% to 0.50%, P=.40), lengths of stay (Tele-DC: mean 6.92, SD 0.47 days; PDC: mean 6.61, SD 0.50 days; between-group difference 0.31 days, 95% CI –1.65 to 1.04, P=.66), severe complication rates (Tele-DC: n=46, 0.27%; PDC: n=33, 0.19%; between-group difference 0.08%, 95% CI –0.03% to 0.18%, P=.18), and mortality rates (Tele-DC: n=23, 0.13%; PDC: n=18, 0.10%; between-group difference 0.03%, 95% CI –0.04% to 0.10%, P=.39). However, the Tele-DC group exhibited a higher AED visit rate (n=641, 3.73%, vs n=542, 3.15%; between-group difference 0.58%, 95% CI 0.19%-0.96%, P.003) and DC revisit rate (n=1446, 8.41%, vs n=1287, 7.48%; between-group difference 0.93%, 95% CI 0.09%-1.50%, P.002). In addition, the Tele-DC group had a lower antiviral prescription rate (n=9872, 57.4%, vs n=10,797, 62.78%; between-group difference –5.38%, 95% CI –6.41% to –4.32%, P<.001). Conclusions: The tele-DC demonstrated clinical safety comparable to the PDC regarding hospitalization and severe complications for patients with mild COVID-19. By validating a scalable model without complex home monitoring, these findings challenge the strict necessity of physical examinations for safe triage and support a digital-first strategy for future infectious surges. However, the disparities observed in AED visits and antiviral prescription rates suggest that integrated remote monitoring tools and improved medication logistics are needed to fully replicate the efficacy of conventional care.

  • AI-generated image, in response to the request “A bright, high-key, photorealistic medical editorial image of a healthcare professional in a white lab coat typing on a laptop at a desk in a clean modern clinic. The composition is a close-up cropped from the chest down, with the face completely out of frame. A stethoscope hangs around the neck, partially visible. The laptop is positioned in the left foreground, the hands are centered over the keyboard, and the desk surface is visible in the lower area with printed clinical case-report papers. Around and above the laptop, soft transparent blue holographic medical interface panels float in midair, combined with delicate glowing node-and-line network patterns, elegant and minimal, resembling abstract clinical concept extraction and medical AI visualization. The holograms are subtle, semi-transparent, and layered, not overly futuristic, with soft blue light. No readable text, no letters, no numbers, no labels, no symbols anywhere in the interfaces or documents. Clean healthcare stock-photo aesthetic, soft daylight, realistic materials, shallow depth of field, calm professional atmosphere, premium scientific journal artwork style, no clutter, no watermark. ”(Generator: ChatGPT Images/OpenAI March 21, 2026). Source: AI-generated image; Copyright: N/A (AI-generated image); URL: https://www.jmir.org/2026/1/e78681; License: Public Domain (CC0).

    Evaluating Encoder and Decoder Models for Extended Clinical Concept Recognition in Japanese Clinical Texts: Comparative Study With Weighted Soft Matching

    Abstract:

    Background: Extracting medical knowledge for secondary purposes, such as diagnostic support, continues to pose a substantial challenge. Conventional named entity recognition has focused on short terms (eg, genes, diseases, and chemicals), whereas extraction and assessment of longer, complex expressions remain underexplored. Clinically vital concepts, such as diseases, pathologies, symptoms, and findings, often appear as long phrases, and accurate extraction is crucial for applications such as constructing causal knowledge from case reports. Consequently, a framework addressing both short terms and clinically meaningful long phrases—termed extended Clinical Concept Recognition (E-CCR)—is essential. Objective: This study, the first comprehensive investigation of E-CCR model selection, aimed to identify optimal strategies by comparing encoder versus decoder models and general-purpose versus domain-specific pretraining. We analyzed variation in effectiveness by target length and proposed a novel E-CCR evaluation metric. Methods: We evaluated 17 encoder and decoder models using J-CaseMap, a database of approximately 20,000 Japanese case reports annotated with clinical concepts. Performance was primarily assessed using the weighted soft matching score, which penalizes fragmentation of long extraction targets and weights scores by target length to account for the greater difficulty of extracting longer expressions. Results: On J-CaseMap, JMedDeBERTa(s)—an encoder model pretrained on domain-specific medical text—achieved the highest mean performance (F1-score=0.758, SD 0.002), with similarly strong results from JMedDeBERTa(c), suggesting comparable performance among the top encoder models. As the fragmentation penalty increased, performance generally declined; however, no consistently severe degradation was observed. On the Medical Report Named Entity Recognition for positive disease dataset, the general-domain DeBERTaV2-base yielded the highest mean F1 score, and differences among the medical-domain JMedDeBERTa(s) and JMedDeBERTa(c) variants were small, suggesting limited benefit of domain-specific pretraining. Overall, under our experimental settings (low-rank adaptation fine-tuning for decoders and full fine-tuning for encoders), encoder models outperformed decoder models, and token classification outperformed our instruction tuning setup. Conclusions: Under our experimental setting, encoder-based token classification achieved the highest mean performance on our internal dataset. Differences among the top encoder models were small and should be interpreted as comparable within the uncertainty implied by our annotation review, whereas decoder-based approaches did not surpass encoder-based models in this setup, suggesting that encoder models can deliver high accuracy with fewer parameters and may offer practical advantages in resource-constrained environments. Token classification outperformed instruction tuning for extracting long expressions, whereas instruction tuning was better suited to short terms. Using the weighted soft matching score, we found that performance did not substantially deteriorate as the fragmentation penalty increased, indicating that extracted spans were rarely fragmented. Similar trends in external validation datasets suggest that findings under our setup may generalize to information extraction tasks on Japanese medical text. Further investigation is needed to determine whether these findings hold across other languages and medical document types.

  • Source: iStock; Copyright: champpixs; URL: https://www.istockphoto.com/photo/asian-senior-man-falling-on-the-ground-with-walker-in-living-room-at-home-elderly-gm1433561104-475438188; License: Licensed by the authors.

    Machine Learning and Deep Learning Models for Predicting Future Falls in Community-Dwelling Older Adults: Systematic Review and Meta-Analysis of Longitudinal...

    Abstract:

    Background: Machine learning (ML) and deep learning (DL) show promise for fall risk prediction, but prior reviews focused mainly on real-time fall detection, in-hospital falls, or conventional statistical models. The performance of ML-DL–based models for predicting future falls in community-dwelling older adults remains unclear. Objective: This study aimed to review ML-DL studies for predicting future falls among community-dwelling older adults and meta-analyze discrimination where feasible. Methods: Six databases were searched from inception to September 23, 2024, with updates on August 31, 2025, and February 28, 2026. We included longitudinal studies developing or validating ML-DL models to predict future falls in community-dwelling adults aged ≥60 years and excluded real-time detection, simulated or no fall, and inpatient studies. Risk of bias was assessed using the Prediction Model Risk of Bias Assessment Tool (PROBAST). Areas under the curve (AUCs) were meta-analyzed using Hartung-Knapp-Sidik-Jonkman random-effects models with 95% CIs. Heterogeneity, 95% prediction intervals (PIs), sensitivity analyses, and subgroup analyses were conducted. Results: After screening 10,253 records, 28 (0.3%) studies were included; 18 (64.3%) focused on general older adults. Prediction horizons ranged from 3 months to 7 years, and fall incidence ranged from 1.6% to 46.6%. Twenty-three (82.1%) studies applied ML, and 5 (17.9%) studies used DL. Input modalities included text (n=18, 64.3%), sensor (n=5, 17.9%), image (n=1, 3.6%), and multimodal data (n=4, 14.3%). Common predictors included age, sex, fall history, depression, and basic daily activities. Only one model underwent external validation. Calibration reporting was sparse. All models were rated at high risk of bias. Ten models were meta-analyzed, yielding a pooled AUC of 0.79 (95% CI 0.69‐0.87) with extreme heterogeneity (=0.64; =0.80; =99.8%; =4128.99). The confidence-distribution bootstrap PI was 0.20 to 0.99, indicating substantial uncertainty in expected performance across new populations. Subgroup analyses indicated moderation by sample size and population type, with higher discrimination in specific populations than in general samples; however, the specific population subgroup included only 2 studies. Although all participants were community dwelling, some cohorts were recruited through clinically enriched pathways rather than general community sampling. Conclusions: ML-DL models show potential for identifying community-dwelling older adults at elevated future fall risk; however, wide PIs, limited external validation, and high risk of bias suggest real-world performance may be optimistic. The pooled AUC should be interpreted as a summary of reported discrimination under study-specific conditions, predominantly from internally validated, high-risk-of-bias models, rather than as a robust estimate of transportable real-world performance. This review extends prior reviews by focusing on community-dwelling settings and by integrating PROBAST, Hartung-Knapp-Sidik-Jonkman meta-analysis, PIs, and modality-specific synthesis to evaluate both discrimination and uncertainty. Findings support the use of ML-DL models for proactive fall prevention while emphasizing the need for validation and context-specific implementation. Trial Registration: PROSPERO CRD42024580902; https://www.crd.york.ac.uk/PROSPERO/view/CRD42024580902

  • Source: Freepik; Copyright: freepik; URL: https://www.freepik.com/free-photo/diabetic-man-checking-glucose-patch-sensor_65373187.htm; License: Licensed by JMIR.

    A Digital Diabetes Self-Management Education and Support Program Integrated With Continuous Glucose Monitoring for Type 2 Diabetes: Randomized Controlled Trial

    Abstract:

    Background: Previous research has demonstrated that the use of continuous glucose monitoring (CGM) can improve glycemic control in people with type 2 diabetes when used regularly alongside a digital diabetes self-management education and support (DSMES) program. However, to date, there is limited evidence showing the benefits of a digitally delivered DSMES program combined with real-time CGM for adults with type 2 diabetes. Objective: The objective of this study is to evaluate the impact of a DSMES program coupled with CGM on hemoglobin A (HbA) and CGM-derived glycemic measures compared to usual care for adults with type 2 diabetes over 6 months. Methods: Participants with type 2 diabetes and HbA of 8% or higher (64 mmol/mol) who were not using mealtime bolus insulin (aged 26‐83 y; mean HbA 9.6%, SD 1.4% [mean 81.2 mmol/mol, SD 15.8 mmol/mol]) were randomly assigned to a digital DSMES+CGM integrated solution (n=51) or usual care (n=49) for 6 months. The primary outcome was HbA. The secondary outcomes were CGM-derived glycemic measures, including glucose management indicator, percent time in range 70 to 180 mg/dL, percent time above range (>180 mg/dL), percent time below range (<70 mg/dL), and mean glucose. Linear mixed effects models were used for intention-to-treat analyses. Results: HbA was lower among the intervention group versus the usual care group at 3 months (difference=−0.7%, 95% CI −1.4% to −0.1% or difference=−8.1 mmol/mol, 95% CI −15.5 to −0.7 mmol/mol; =.03) and at 6 months (difference=−0.6%, 95% CI −1.4% to 0.2% or difference=−6.9 mmol/mol, 95% CI −15.7 to 1.9 mmol/mol; =.12) but only reached statistical significance at 3 months. CGM-derived glycemic measures, including glucose management indicator (difference=−0.9%, 95% CI −1.7% to −0.1%; =.03), time in range (difference=14.6%, 95% CI 1.0% to 28.2%; =.04), time above range (difference=−14.9%, 95% CI −29.0% to −0.9%; =.04), and mean glucose (difference=−36.4 mg/dL, 95% CI −70.0 to −2.9 mg/dL; =.03), also significantly improved for the intervention group versus the usual care group at 6 months. Conclusions: The combination of digital DSMES+CGM is effective for supporting adults with type 2 diabetes in managing their condition and has the potential to reduce the risk of long-term health complications. Trial Registration: ClinicalTrials.gov NCT05368454; https://clinicaltrials.gov/ct2/show/NCT05368454

  • AI-generated image in response to the request "A Korean woman in her early 50s using a smartphone on a subway train in Seoul to respond to a patient experience survey question displayed on her screen, using a 4-point scale (as shown in the attached file). She has selected the top-box option but appears to be considering changing her response to the next category. The image should have a 4:3 aspect ratio (1200 × 900 px)." Generator: Gemini Nano Banana 2 [Mar 24, 2026]; Requestor: Young Kyung Do. Source: Image created by Gemini Nano Banana 2; Copyright: N/A (AI - generated image); URL: https://www.jmir.org/2026/1/e79398/; License: Public Domain (CC0).

    Mode Effects Between Mobile Web and Telephone Surveys on Patient Experience Scores in South Korea: Secondary Analysis of a Randomized Controlled Trial Under...

    Abstract:

    Background: Patient experience surveys are essential tools for assessing health care quality, yet the potential influence of survey mode on patient experience scores remains understudied. This study investigates the mode effects between mobile web and telephone surveys within South Korea’s Patient Experience Assessment. Objective: This study aimed to examine the presence and extent of the mode effects of mobile web versus telephone surveys on patient experience scores. The primary outcome was defined as the total score across all 21 survey items, rescaled to 0‐100. Methods: This is a secondary analysis using experimental data from a parallel-group randomized controlled trial involving 3200 patients (adults aged ≥19 years, hospitalized >1 day, discharged 2‐56 days before the survey) from 4 general hospitals between October and November 2022, equally allocated to telephone and mobile web survey modes. An independent survey company generated the random allocation sequence using computer-generated random numbers and assigned participants to the survey modes. Due to the nature of the intervention, blinding of participants, interviewers, and outcome assessors was not feasible after assignment. We calculated unadjusted score differences among respondents and estimated adjusted differences accounting for nonresponse using inverse probability weighting (IPW) and multiple imputation (MI) under the missing-at-random assumption. Sensitivity analyses, using the delta-adjustment method based on the missing-not-at-random assumption, assessed robustness to departures from the missing-at-random assumption. Subgroup analyses by sex, age group, and field of care were also conducted. Results: Of 3200 patients randomized (1600 per mode), 878 completed the survey (520 mobile web and 358 telephone). Analyses included all randomized participants (n=3200), with nonresponse addressed through IPW and MI. No adverse events were reported in this survey-based study. The total patient experience score was significantly lower in the mobile web group (mean 81.5, SD 16.4) than in the telephone group (mean 84.9, SD 14.3; unadjusted difference –3.41 points, 95% CI –5.51 to –1.31; IPW-adjusted –4.11, 95% CI –6.17 to –2.04; MI-adjusted –4.59, 95% CI –7.45 to –1.73). Similar patterns were observed across most subdomains. Subgroup analyses revealed consistent mode effects across different demographic categories. Sensitivity analyses using the delta-adjustment method confirmed the robustness of these findings under various missing data scenarios. Conclusions: Mobile surveys may yield substantially lower patient experience scores than telephone surveys. Unlike previous studies, our study analyzes randomized experimental data under various missingness scenarios and provides evidence that this mode effect is unlikely to be attributable to analytical methods or heterogeneity in respondent characteristics between the 2 survey administration modes. Accordingly, caution is warranted when comparing patient experience scores obtained from traditional telephone surveys with those from mobile surveys. Methodologically, our sensitivity analysis approach provides a robust framework for assessing and addressing potential nonresponse bias in patient experience assessments. Trial Registration: Clinical Research Information Service KCT0011374; https://tinyurl.com/3e3u5mjs

  • Older adult interacting with the Chronic Care Platform. Source: envato; Copyright: Wavebreakmedia; URL: https://elements.envato.com/high-angle-view-of-thoughtful-senior-woman-using-l-H5PUEQL; License: Licensed by the authors.

    Understanding How a Digital Platform for Chronic Disease Management Can Enable and Limit Patient Self-Care: Qualitative Study

    Abstract:

    Background: A growing segment of the population requires ongoing care and support for managing their chronic diseases. Digital platforms for self-management are rapidly emerging to meet this need, but patients’ experiences with these platforms vary significantly. This may be due to the complexity and flexibility of digital platforms, where the wide array of available features can generate unexpected impacts. Objective: This study aims to explore how a digital platform can both enable and limit patients with a chronic disease in managing their own health. Methods: We conducted semistructured qualitative interviews with patients to better understand their experience of using a digital platform for self-managing their chronic diseases. Patients who had been using a digital platform (the Chronic Care Platform) for at least 1 month were invited to participate. Twenty-four patients were recruited and interviewed in person or by phone. The collected data were analyzed using template analysis, which is a type of thematic analysis that allows inductive identification of themes from data and deductive application of theory-informed themes. We leveraged Self-Care Theory to understand how patients’ motivation to use the platform and their subsequent use of its features generated perceived value and challenges in achieving this value. Results: The platform was shown to support patients’ development of core self-care abilities (cognitive, psychosocial, and sociocultural abilities) and self-care behaviors (maintenance, monitoring, management), but it did not provide any support to the development of physiological abilities. Moreover, results indicate important limitations in the way in which the digital platform supported all self-care abilities and behaviors—in particular, self-care management. Hence, while the platform was viewed as valuable overall, patients reported several challenges in effectively using the Chronic Care Platform for self-care. Conclusions: Digital platforms for chronic disease management can enhance patient self-care by providing valuable resources and support for reinforcing desired behaviors. However, gaps in platform features can limit patients’ ability to comprehensively care for themselves. This study shows that relating platform features to specific dimensions of self-care can help identify missing features, providing a fine-grained understanding of how a given platform is generating positive impacts and how it may be improved to fully support self-care.

  • Source: Freepik; Copyright: pressfoto; URL: https://www.freepik.com/free-photo/unrecognizable-doctor-extending-digital-tab-anonymous-patient-fill-questionnaire_5699298.htm; License: Licensed by JMIR.

    Understanding Patient-Reported Offenses in Electronic Health Records: Cross-Sectional Mixed Methods Survey

    Abstract:

    Background: Patients’ access to their electronic health record (EHR) supports their participation and satisfaction with care. Despite the benefits, some patients have been upset after reading their EHR. Additionally, health care professionals are concerned that patients, particularly those with mental health conditions, may be offended, and they have expressed a need for further guidelines on how to write EHRs. Experiences among various patient groups are essential to support the relationship between patients and professionals. However, prior studies have often focused on single patient groups or specific clinical contexts, leaving a limited understanding of differences across multiple patient groups. Objective: This study aimed to determine whether certain patient groups are more likely to feel offended while reading their EHRs and which information is perceived as offensive and to provide a comparison across multiple patient groups using a mixed methods approach. Methods: A cross-sectional survey was conducted via the Finnish national patient portal using a web-based patient survey, adopting a mixed methods approach. The survey included multiple-choice and open-ended questions. The total sample comprised 4681 respondents. The survey respondents were placed into 4 patient groups: those who had received care for mental health, cancer, or other conditions and those who had received no care. Associations between the type of care and patients who felt offended were estimated using multivariate binary logistic regression. Inductive content analysis (n=502) was conducted to identify information perceived as offensive in the EHR. Results: The patients who had received mental health care (166/654, 25.4%) or cancer and mental health care (9/39, 23.1%) were more likely to be offended by information in their EHR compared to the other groups (cancer care: 37/375, 9.9%; other conditions care: 383/3316, 11.6%; no care: 22/206, 10.7%; other conditions care: odds ratio 0.37, 95% CI 0.29‐0.46; <.001; model A). Additionally, female patients, those with bad or very bad health conditions, and patients with bachelor’s or master’s degrees were significantly more likely to feel offended. Errors, the health care professionals’ disrespectful language, and perceived unnecessary information were the most frequently mentioned reasons for being offended. Patients with mental health care reported more often that unnecessary information and professionals’ opinions and word choices were experienced as offensive compared to other patients. Conclusions: This study contributes new knowledge by identifying differences across multiple patient groups. Although a minority of patients felt offended by their EHR, health care professionals should consider that some patients, particularly those who have received mental health care or cancer and mental health care, may be offended by specific information or word choices in their EHRs. To address this, health care professionals should receive education on how to write their notes in a neutral tone and avoid potentially offensive topics. Improving the quality of EHRs could strengthen the relationship between patients and professionals.

  • Source: Freepik; Copyright: freepik; URL: https://www.freepik.com/free-photo/senior-people-school-class-with-laptop-computer_37446909.htm; License: Licensed by JMIR.

    Digital Health Literacy, Technology Acceptance, and Competence Among Older Adults Aged ≥65 Years: Cross-Sectional Study Investigating Differences Between...

    Abstract:

    Background: Digital health literacy (DHL) has the potential to improve health among older adults by enhancing access to health-related information and health care services. Objective: The aim of this study was to analyze the relationship between DHL and technology commitment in adults aged 65 years and older, while also investigating possible gender differences. Methods: The analytical sample consisted of 1824 individuals. The analysis included descriptive comparisons in terms of DHL, technology acceptance, competency, support, and internet use. Multivariate regression models (generalized linear models) were applied in order to test the association between DHL and technology commitment, controlling for internet use as well as health-related and sociodemographic characteristics. Results: Male and female participants did not differ in terms of DHL (mean score: 3.5, SD 1.2 [men] and 3.5, SD 1.3 [women]; =.70); however, male participants reported significantly higher technology acceptance (<.001) and higher technology competencies (<.001), but less support with regard to technology use (<.001). Within regression models, only higher technology acceptance (coefficient=0.023, 95% CI 0.006‐0.041; =.01) and support (coefficient=0.027, 95% CI 0.014‐0.040; <.001) were significantly linked to greater DHL. The subgroup analysis revealed that DHL was significantly associated with technology acceptance among men (coefficient=0.036, 95% CI 0.012‐0.060; =.003) but not women (coefficient=0.024, 95% CI 0.008‐0.040; =.44). Conclusions: According to the current results, DHL is highly related to technology commitment. Gender differences should be taken into account when developing and evaluating appropriate interventions to improve DHL by addressing the acceptance of technologies and optimizing support infrastructures.

  • Source: The Authors/PlaceIt; Copyright: The Authors/PlaceIt; URL: https://www.jmir.org/2026/1/e97341; License: Licensed by JMIR.

    Maturity, Safety, and Equity of AI-Enabled Systems and Triage in Integrated Primary Care

    Authors List:

    Abstract:

    Artificial intelligence (AI)–enabled systems must simultaneously improve the Quintuple Aim and digital health maturity, including equitable access to and quality and interoperability of data, tools, agents, and services. This requires a comprehensive sociotechnical and global approach to cocreation, management, and governance for individuals and organizations in the ecosystem.

Citing this Article

Right click to copy or hit: ctrl+c (cmd+c on mac)

Latest Submissions Open for Peer-Review:

View All Open Peer Review Articles
  • Multinational Assessment of Readability in Online Informed Consent Forms for Implant-Based Breast Augmentation

    Date Submitted: May 14, 2026

    Open Peer Review Period: May 14, 2026 - Jul 9, 2026

    Background: Breast augmentation is one of the most performed aesthetic surgical procedures worldwide. Given its elective character and potential complications, clear and comprehensible informed consen...

    Background: Breast augmentation is one of the most performed aesthetic surgical procedures worldwide. Given its elective character and potential complications, clear and comprehensible informed consent is essential. Objective: To evaluate and compare the readability of online informed consent forms for implant-based breast augmentation in languages used in countries with the highest procedural volumes according to the 2024 ISAPS survey. Methods: The phrase “breast augmentation consent form” was translated into eight languages and searched using Google in private mode. After applying exclusion criteria, 77 documents were analysed. Readability was assessed using the LIX index. Results: The overall mean LIX score was 53 ± 9, corresponding to very difficult texts. Significant differences in readability were observed between languages (P < .001). English-language forms demonstrated the lowest mean LIX (46 ± 7), classified as difficult, whereas Portuguese (55 ± 9), Italian (58 ± 6), and Turkish (63 ± 2) reached very difficult or highly complex levels. No document achieved an “easy” or “moderately difficult” classification. There were no statistically significant differences in readability between private and non-private practice sources, nor any correlation between the number of available forms and mean LIX values. Conclusions: Online consent forms consistently exhibit high linguistic complexity, underscoring the need for systematic simplification and standardisation.

  • Family Size and Longitudinal Outcomes of a Digital–Human Parenting Intervention in Chinese Preschool Families: Secondary Analysis of a Cluster Randomized Controlled Trial

    Date Submitted: May 14, 2026

    Open Peer Review Period: May 14, 2026 - Jul 9, 2026

    Background: Parenting interventions can improve parental and child outcomes across diverse settings. However, less is known about how family size—particularly the number of children—shapes baselin...

    Background: Parenting interventions can improve parental and child outcomes across diverse settings. However, less is known about how family size—particularly the number of children—shapes baseline conditions and how intervention effects unfold over time. Most studies focus on average treatment effects, with limited attention to heterogeneity across family contexts and trajectories of change. Objective: Objective. This study aimed to examine whether family size was associated with baseline parental and child outcomes, moderated short-term intervention effectiveness, and shaped longitudinal trajectories of change following a parenting intervention. Methods: At baseline, families with more children reported lower levels of early learning and stimulation and proactive parenting practices, alongside higher parenting stress and greater endorsement of corporal punishment, while child behavioral outcomes and caregiver-perpetrated violence were broadly comparable across groups. The number of children did not significantly moderate intervention effectiveness at immediate post-intervention. However, trajectories diverged over time. Two-child families showed the most consistent improvements, whereas families with three or more children demonstrated larger but more variable gains in several child behavioral domains. In contrast, one-child families showed more limited changes across multiple domains. Results: At baseline, families with more children reported lower levels of early learning and stimulation and proactive parenting practices, alongside higher parenting stress and greater endorsement of corporal punishment, while child behavioral outcomes and caregiver-perpetrated violence were broadly comparable across groups. The number of children did not significantly moderate intervention effectiveness at immediate post-intervention. However, trajectories diverged over time. Two-child families showed the most consistent improvements, whereas families with three or more children demonstrated larger but more variable gains in several child behavioral domains. In contrast, one-child families showed more limited changes across multiple domains. Conclusions: Family size might not always be associated with short-term intervention effectiveness but was associated with divergence in longer-term trajectories. These findings suggest that caregiving demands are relevant for the sustainability of intervention effects. By integrating baseline differences, short-term effects, and longitudinal trajectories within a single framework, this study highlights the importance of moving beyond average treatment effects to more dynamic, context-sensitive evaluations. Designing parenting interventions, particularly scalable digital–human programs, that incorporate sustained and context-responsive support may be critical for addressing variation in family structure and enhancing long-term effectiveness. Clinical Trial: The trial was prospectively registered on the Chinese Clinical Trial Registry (ChiCTR2400081911).

  • Artificial Intelligence Avatars for Emotional Regulation and Anxiety Management Among University Students: Mixed Methods Survey Study

    Date Submitted: May 14, 2026

    Open Peer Review Period: May 14, 2026 - Jul 9, 2026

    Background: Conversational artificial intelligence (AI) avatars are emerging as possible tools for scalable mental health support, but their acceptability, perceived empathy, usability, and privacy im...

    Background: Conversational artificial intelligence (AI) avatars are emerging as possible tools for scalable mental health support, but their acceptability, perceived empathy, usability, and privacy implications remain insufficiently understood. Objective: This study aimed to examine university students' attitudes toward conversational AI avatars for mental health support and to evaluate perceived usability, empathy, satisfaction, and barriers after a brief avatar interaction. Methods: We conducted a two-phase mixed methods survey study with 102 university students. Phase 1 assessed attitudes toward AI-based mental health support using an online questionnaire. In Phase 2, a volunteer subset of 16 participants completed a 10-minute interaction with a three-dimensional avatar using cognitive behavioral therapy (CBT)-informed dialogue protocols and then completed a post interaction evaluation. Quantitative responses were summarized using descriptive statistics, and open-ended responses were examined using descriptive thematic analysis. Results: Among participants with valid item-level responses, 76.2% agreed that AI could help some people, and 59.4% reported that they would use AI therapy if it were free. However, only 23.8% believed that an AI therapist could genuinely understand their emotions, and 55.0% preferred talking to a real person rather than an AI system. In the interactive subset, 11 of 16 participants (68.8%) reported being moderately satisfied, although 11 of 16 participants (68.8%) still preferred an in-person therapist when given the choice. Qualitative feedback highlighted privacy, nonjudgmental support, effective communication, and practical advice as perceived strengths, whereas emotional depth, speech naturalness, and interaction pacing were identified as areas for improvement. Conclusions: Findings suggest that AI avatars may be acceptable as preliminary support, psychoeducation, or triage tools, but they should not be framed as replacements for human clinicians. Improving emotional nuance, voice quality, response pacing, and transparent data governance will be essential before broader deployment in university mental health settings.

  • Non-Patient Stakeholder Perspectives on the use of Gamification and Financial Incentives in mHealth for Medication Adherence: Mixed Methods Consensus Study

    Date Submitted: May 13, 2026

    Open Peer Review Period: May 14, 2026 - Jul 9, 2026

    Background: Medication nonadherence remains a major global health challenge, contributing to preventable disease, hospitalizations, and healthcare costs. Mobile health (mHealth) applications incorpora...

    Background: Medication nonadherence remains a major global health challenge, contributing to preventable disease, hospitalizations, and healthcare costs. Mobile health (mHealth) applications incorporating gamification and financial incentives have shown potential to improve adherence; however, most research has focused on patient perspectives, with limited understanding of how non-patient stakeholders perceive their feasibility, risks, and implementation. Understanding non-patient stakeholder perspectives in relation to patient viewpoints is essential for informing future policy development and establishing practical, industry-supported safeguards that protect consumers while enabling innovation. Objective: This study aimed to explore non-patient stakeholder perspectives on the use of gamification and financial incentives in mHealth apps for medication adherence and to integrate these with previously reported patient perspectives to inform consensus-based design and policy considerations. Methods: A mixed-methods study was conducted using a modified virtual Nominal Group Technique (vNGT). Non-patient stakeholders across healthcare, industry, and policy sectors in Australia were recruited. Data collection involved a pre-session survey followed by online focus groups. Qualitative responses were analyzed using thematic analysis supported by AI-assisted coding. Consensus statements derived from themes were rated during the focus groups. Additional prompts were used to elicit further discussion where consensus was not immediately achieved. Results: A total of 20 participants were included in the study. Six key themes were identified: tailored gamification for adherence, financial incentives as a contested motivator, designing for diversity and inclusion, usability barriers to engagement, trust through data governance, and validated and sustainable innovation. These informed 24 consensus statements, of which 54% (13/24) achieved unanimous agreement. Stakeholders strongly endorsed personalization, simplicity, and transparent data practices, while expressing nuanced concerns regarding the ethical use, sustainability, and potential unintended consequences of financial incentives. Compared with prior patient findings, the participants demonstrated substantial alignment on core design principles but contributed additional system-level considerations related to feasibility, scalability, and regulation. Conclusions: Non-patient stakeholders largely reinforce patient priorities while extending them with critical perspectives on implementation, governance, and sustainability. Gamification and financial incentives are viewed as potentially effective but require careful, ethically grounded design to balance engagement with long-term motivation and trust. These findings support the development of stakeholder-informed guidelines for responsible mHealth innovation and highlight the importance of integrating patient and system-level perspectives in digital health design. Future research should prioritize co-designed longitudinal studies utilizing apps with gamification and a range of incentive offers with clear redemption processes to evaluate the long-term impact on medication adherence across diverse patient populations.

  • From prediction model to clinical decision support: a user-centered study of anesthesiologists’ requirements for perioperative clinical decision support systems

    Date Submitted: May 13, 2026

    Open Peer Review Period: May 14, 2026 - Jul 9, 2026

    Background: High-performing perioperative prediction models have not consistently translated into clinical benefit, in part because model outputs must be delivered through clinical decision support sy...

    Background: High-performing perioperative prediction models have not consistently translated into clinical benefit, in part because model outputs must be delivered through clinical decision support systems (CDSS) that align with anesthesia workflows and end-user needs. Objective: To identify anesthesia professionals’ requirements for perioperative CDSS and use these findings to inform the design specification of a user-centered perioperative CDSS. Methods: This user-centered study was conducted in four sequential phases: translation of a previously validated explainable machine-learning model into candidate CDSS functions; three rounds of focus group–based iterative prototyping; a nationwide cross-sectional questionnaire survey; and CDSS finalization based on iterative prototyping and survey findings. The survey assessed requirements for information display, alerting, explainability, intervention support, and workflow integration among anesthesia-related professionals in China. Results: Three rounds of focus group discussion and iterative prototyping generated a preliminary prototype comprising candidate modules for information display, alerting, explainability, intervention support, and workflow integration. A total of 2401 valid questionnaires were analyzed. Respondents generally preferred direct risk presentation, probability-based alerting, interpretable displays of modifiable risk factors, actionable intervention support, and integration within existing clinical platforms. These findings informed the final specification of an integrated CDSS within the anesthesia information system, including dynamic risk prediction, threshold-based alerting, explainable risk attribution, and evidence-informed intervention recommendations. Conclusions: In this user-centered design study, anesthesia professionals identified key requirements for perioperative CDSS, including direct information display, clinically meaningful alerts, explainable risk-factor presentation, actionable recommendations, and workflow integration. These findings may inform the translation of perioperative prediction models into decision support tools that are more usable and acceptable in routine anesthesia practice.

  • Visualizing Health in Platform Work: A Photovoice Study Comparing Freelancers, Couriers, and Taxi Drivers in Sweden

    Date Submitted: May 13, 2026

    Open Peer Review Period: May 14, 2026 - Jul 9, 2026

    Background: The platform-based economy has expanded rapidly through the integration of digital platforms into sectors such as transportation, delivery, and freelance work. Platform labor combines feat...

    Background: The platform-based economy has expanded rapidly through the integration of digital platforms into sectors such as transportation, delivery, and freelance work. Platform labor combines features of precarious employment and digitalized work organization, encompassing both location-based and web-based work. However, the occupational health implications of platform work remain insufficiently understood, particularly regarding how risks differ across platform worker groups. Objective: This study aimed to explore how platform workers experience their working conditions and how platform work affects their health, wellbeing, and safety. Methods: A participatory photovoice study was conducted with platform-based taxi drivers, delivery couriers, and freelancers living in Stockholm. Between September and November 2022, 16 participants were recruited into three groups (5–6 participants per group). Across five sessions, participants documented their working lives through photographs and discussed them collectively, generating 105 photographs in total. Data were analyzed collaboratively to identify key themes and recommendations related to working conditions, health, and wellbeing. Results: Participants identified 14 themes representing major determinants of health, wellbeing, and safety at work, as well as 23 recommendations for improving working conditions. Workers reported exposure to both platform-specific risks, including algorithmic management and digital surveillance, and traditional occupational risks such as psychosocial strain, ergonomic challenges, and traffic-related hazards. Experiences differed substantially across platform work types. Delivery and taxi drivers reported greater exposure to physical and traffic-related risks, whereas freelancers emphasized psychosocial demands and digital work intensification. Economic insecurity and costs associated with maintaining work equipment emerged as common challenges across all groups. Attitudes toward flexibility, autonomy, and algorithmic management also varied between worker categories. Conclusions: This study highlights important similarities and differences in working conditions and health risks across platform work types. The findings suggest that research and occupational health interventions targeting platform workers should differentiate between specific forms of platform labor to better capture the diversity of workers’ experiences and exposures.