Enhancement of the Performance of Large Language Models in Diabetes Education through Retrieval-Augmented Generation: Comparative Study

doi:10.2196/58041

Journals

Zhu S, Hu W, Yang Z, Yan J, Zhang F. Qwen-2.5 Outperforms Other Large Language Models in the Chinese National Nursing Licensing Examination: Retrospective Cross-Sectional Comparative Study. JMIR Medical Informatics 2025;13:e63731 View
Liu S, McCoy A, Wright A. Improving large language model applications in biomedicine with retrieval-augmented generation: a systematic review, meta-analysis, and clinical development guidelines. Journal of the American Medical Informatics Association 2025;32(4):605 View
Kelly A, Noctor E, Ryan L, van de Ven P. The Effectiveness of a Custom AI Chatbot for Type 2 Diabetes Mellitus Health Literacy: Development and Evaluation Study. Journal of Medical Internet Research 2025;27:e70131 View
Genovese A, Prabha S, Borna S, Gomez-Cabello C, Haider S, Trabilsy M, Tao C, Aziz K, Murray P, Forte A. Artificial Intelligence for Patient Support: Assessing Retrieval-Augmented Generation for Answering Postoperative Rhinoplasty Questions. Aesthetic Surgery Journal 2025;45(7):735 View
Chen X, Xiang J, Lu S, Liu Y, He M, Shi D. Evaluating large language models and agents in healthcare: key challenges in clinical applications. Intelligent Medicine 2025;5(2):151 View
Sumner J, Wang Y, Tan S, Chew E, Wenjun Yip A. Perspectives and Experiences With Large Language Models in Health Care: Survey Study. Journal of Medical Internet Research 2025;27:e67383 View
Gaber F, Shaik M, Allega F, Bilecz A, Busch F, Goon K, Franke V, Akalin A. Evaluating large language model workflows in clinical decision support for triage and referral and diagnosis. npj Digital Medicine 2025;8(1) View
Córdova-Esparza D. AI-Powered Educational Agents: Opportunities, Innovations, and Ethical Challenges. Information 2025;16(6):469 View
Gültekin O, Inoue J, Yilmaz B, Cerci M, Kilinc B, Yilmaz H, Prill R, Kayaalp M. Evaluating DeepResearch and DeepThink in anterior cruciate ligament surgery patient education: ChatGPT‐4o excels in comprehensiveness, DeepSeek R1 leads in clarity and readability of orthopaedic information. Knee Surgery, Sports Traumatology, Arthroscopy 2025;33(8):3025 View
Zhang C, Yang H, Liu X, Wu R, Zong H, Wu E, Zhou Y, Li J, Shen B. A Knowledge-Enhanced Platform (MetaSepsisKnowHub) for Retrieval Augmented Generation–Based Sepsis Heterogeneity and Personalized Management: Development Study. Journal of Medical Internet Research 2025;27:e67201 View
Amugongo L, Mascheroni P, Brooks S, Doering S, Seidel J, Liu X. Retrieval augmented generation for large language models in healthcare: A systematic review. PLOS Digital Health 2025;4(6):e0000877 View
Fukui Y, Kawata Y, Kobashi K, Nagatani Y, Iguchi H. Evaluation of a retrieval-augmented generation system using a Japanese Institutional Nuclear Medicine Manual and large language model-automated scoring. Radiological Physics and Technology 2025;18(3):861 View
Wu X, Cai G, Guo B, Ma L, Shao S, Yu J, Zheng Y, Wang L, Yang F. A multi-dimensional performance evaluation of large language models in dental implantology: comparison of ChatGPT, DeepSeek, Grok, Gemini and Qwen across diverse clinical scenarios. BMC Oral Health 2025;25(1) View
Zhu X, Dai W, Evans R, Geng X, Mu A, Liu Z. Current Landscape and Future Directions Regarding Generative Large Language Models in Stroke Care: Scoping Review. JMIR Medical Informatics 2025;13:e76636 View
Song J, Xu Z, He M, Feng J, Shen B. Graph retrieval augmented large language models for facial phenotype associated rare genetic disease. npj Digital Medicine 2025;8(1) View
Wang D, Ye J, Li J, Liang J, Zhang Q, Hu Q, Pan C, Wang D, Liu Z, Shi W, Guo M, Li F, Du W, Zheng Y. Enhancing Large Language Models for Improved Accuracy and Safety in Medical Question Answering: Comparative Study. JMIR Medical Education 2025;11:e70190 View
Anghelescu A, Munteanu C, Anghelescu L, Onose G. ”A Midsummer Night’s Dream” quest for truth: From ChatGPT “hallucinations” to RAG reasoning and ACURAI precision — a scoping review on detection, minimizing, and (almost) complete error elimination and enhancing Large Language Models' re-liability. Balneo and PRM Research Journal 2025;16(Vol 16 No. 3):847 View
Jeon S, Lee S, Kim E, Eun J, Lee K, Lim H, Lee J. Generative AI Chatbot for Diabetes Management: Formative 2-Part Qualitative Study Using DTalksBot Involving Patients and Clinicians. JMIR Formative Research 2025;9:e72553 View
Serugunda H, Jianquan O, Kasujja Namatovu H, Ssemaluulu P, Kimbugwe N, Garimoi Orach C, Waiswa P. Using Large Language Models for Chronic Disease Management Tasks: Scoping Review. JMIR Medical Informatics 2025;13:e66905 View
Bayram H, Arslan S, Ozturkcan A. Evaluating AI‐Generated Meal Plans for Simulated Diabetes Profiles: A Guideline‐Based Comparison of Three Language Models. Journal of Evaluation in Clinical Practice 2025;31(7) View
Abo El-Enen M, Saad S, Nazmy T. A survey on retrieval-augmentation generation (RAG) models for healthcare applications. Neural Computing and Applications 2025 View
Mansoor H. A Scoping Review of Large Language Models in Personal Sleep Wellness. Mayo Clinic Proceedings: Digital Health 2025;3(4):100301 View
Tung J, Le Q, Yao J, Huang Y, Lim D, Sng G, Lau R, Tan Y, Chen K, Tay K, Tan J, Yuen J, Cheng C, Ho H. Performance of Retrieval-Augmented Generation Large Language Models in Guideline-Concordant Prostate-Specific Antigen Testing: Comparative Study With Junior Clinicians. Journal of Medical Internet Research 2025;27:e78393 View
Gültekin O, Sezgin E, Cakır O, Sengul H, Kilinc B, Yılmaz B, Kocaoglu B, Kayaalp M. Retrieval‐augmented ChatGPT‐4o improves accuracy but reduces readability in hip arthroscopy patient education. Knee Surgery, Sports Traumatology, Arthroscopy 2026;34(2):660 View
Chen B, Wang Y, Xie X, Fan B, Zhang W, Sun G. Trends in AI-based diagnosis and intervention of metabolic diseases: a bibliometric analysis of the literature from 2000 to 2024. Frontiers in Medicine 2025;12 View
Moureau M, Davis B, Hong C. Development and Evaluation of an Augmented Artificial Intelligence Model for Urogynecology Queries. International Urogynecology Journal 2025 View
Wang Y, Luan Y, Cheng S, Hao M, Tan W, Hu R, Yao Z, Wang J, Wu J. A multi-layer retrieval-augmented large language model framework for enhancing hypertension education. Hypertension Research 2026 View
Wang L, Tang K, Yang Z, Wang Y, Zhang P, Wu B, Zhao W, Chen J, Gong J, Du S, Li S. GastroTCM: a large language model assistant for gastroenterology in traditional Chinese medicine. Chinese Medicine 2026;21(1) View
Furukawa E, Okuhara T, Suzuki-Chiba H, Liu M, Okada H, Kiuchi T. The understandability and actionability of AI-generated information for patients with chronic kidney disease. Patient Education and Counseling 2026;147:109546 View

Books/Policy Documents

Siddireddy S, Sahay A, Singh T. Emerging Trends in Artificial Intelligence and Machine Learning. View

Conference Proceedings

Zhong C, Ye F, Wang Z, Jigeer A, Zhan Z. 2025 14th International Conference on Educational and Information Technology (ICEIT). Interdisciplinary-QG: An LLM-Based Framework for Generating High-Quality Interdisciplinary Test Questions with Knowledge Graphs and Chain-of-Thought Reasoning View

This paper is in the following e-collection/theme issue:

Enhancement of the Performance of Large Language Models in Diabetes Education through Retrieval-Augmented Generation: Comparative Study

Enhancement of the Performance of Large Language Models in Diabetes Education through Retrieval-Augmented Generation: Comparative Study

Journals

Books/Policy Documents

Conference Proceedings