Published on in Vol 26 (2024)
Preprints (earlier versions) of this paper are
available at
https://preprints.jmir.org/preprint/58041, first published
.

Journals
- Zhu S, Hu W, Yang Z, Yan J, Zhang F. Qwen-2.5 Outperforms Other Large Language Models in the Chinese National Nursing Licensing Examination: Retrospective Cross-Sectional Comparative Study. JMIR Medical Informatics 2025;13:e63731 View
- Liu S, McCoy A, Wright A. Improving large language model applications in biomedicine with retrieval-augmented generation: a systematic review, meta-analysis, and clinical development guidelines. Journal of the American Medical Informatics Association 2025;32(4):605 View
- Kelly A, Noctor E, Ryan L, van de Ven P. The Effectiveness of a Custom AI Chatbot for Type 2 Diabetes Mellitus Health Literacy: Development and Evaluation Study. Journal of Medical Internet Research 2025;27:e70131 View
- Genovese A, Prabha S, Borna S, Gomez-Cabello C, Haider S, Trabilsy M, Tao C, Aziz K, Murray P, Forte A. Artificial Intelligence for Patient Support: Assessing Retrieval-Augmented Generation for Answering Postoperative Rhinoplasty Questions. Aesthetic Surgery Journal 2025;45(7):735 View
- Chen X, Xiang J, Lu S, Liu Y, He M, Shi D. Evaluating large language models and agents in healthcare: key challenges in clinical applications. Intelligent Medicine 2025;5(2):151 View
- Sumner J, Wang Y, Tan S, Chew E, Wenjun Yip A. Perspectives and Experiences With Large Language Models in Health Care: Survey Study. Journal of Medical Internet Research 2025;27:e67383 View
- Gaber F, Shaik M, Allega F, Bilecz A, Busch F, Goon K, Franke V, Akalin A. Evaluating large language model workflows in clinical decision support for triage and referral and diagnosis. npj Digital Medicine 2025;8(1) View
- Córdova-Esparza D. AI-Powered Educational Agents: Opportunities, Innovations, and Ethical Challenges. Information 2025;16(6):469 View
- Gültekin O, Inoue J, Yilmaz B, Cerci M, Kilinc B, Yilmaz H, Prill R, Kayaalp M. Evaluating DeepResearch and DeepThink in anterior cruciate ligament surgery patient education: ChatGPT‐4o excels in comprehensiveness, DeepSeek R1 leads in clarity and readability of orthopaedic information. Knee Surgery, Sports Traumatology, Arthroscopy 2025;33(8):3025 View
- Zhang C, Yang H, Liu X, Wu R, Zong H, Wu E, Zhou Y, Li J, Shen B. A Knowledge-Enhanced Platform (MetaSepsisKnowHub) for Retrieval Augmented Generation–Based Sepsis Heterogeneity and Personalized Management: Development Study. Journal of Medical Internet Research 2025;27:e67201 View
- Amugongo L, Mascheroni P, Brooks S, Doering S, Seidel J, Liu X. Retrieval augmented generation for large language models in healthcare: A systematic review. PLOS Digital Health 2025;4(6):e0000877 View
- Fukui Y, Kawata Y, Kobashi K, Nagatani Y, Iguchi H. Evaluation of a retrieval-augmented generation system using a Japanese Institutional Nuclear Medicine Manual and large language model-automated scoring. Radiological Physics and Technology 2025;18(3):861 View
- Wu X, Cai G, Guo B, Ma L, Shao S, Yu J, Zheng Y, Wang L, Yang F. A multi-dimensional performance evaluation of large language models in dental implantology: comparison of ChatGPT, DeepSeek, Grok, Gemini and Qwen across diverse clinical scenarios. BMC Oral Health 2025;25(1) View
- Zhu X, Dai W, Evans R, Geng X, Mu A, Liu Z. Current Landscape and Future Directions Regarding Generative Large Language Models in Stroke Care: Scoping Review. JMIR Medical Informatics 2025;13:e76636 View
- Song J, Xu Z, He M, Feng J, Shen B. Graph retrieval augmented large language models for facial phenotype associated rare genetic disease. npj Digital Medicine 2025;8(1) View
- Wang D, Ye J, Li J, Liang J, Zhang Q, Hu Q, Pan C, Wang D, Liu Z, Shi W, Guo M, Li F, Du W, Zheng Y. Enhancing Large Language Models for Improved Accuracy and Safety in Medical Question Answering: Comparative Study. JMIR Medical Education 2025;11:e70190 View
- Anghelescu A, Munteanu C, Anghelescu L, Onose G. ”A Midsummer Night’s Dream” quest for truth: From ChatGPT “hallucinations” to RAG reasoning and ACURAI precision — a scoping review on detection, minimizing, and (almost) complete error elimination and enhancing Large Language Models' re-liability. Balneo and PRM Research Journal 2025;16(Vol 16 No. 3):847 View
- Jeon S, Lee S, Kim E, Eun J, Lee K, Lim H, Lee J. Generative AI Chatbot for Diabetes Management: Formative 2-Part Qualitative Study Using DTalksBot Involving Patients and Clinicians. JMIR Formative Research 2025;9:e72553 View
- Serugunda H, Jianquan O, Kasujja Namatovu H, Ssemaluulu P, Kimbugwe N, Garimoi Orach C, Waiswa P. Using Large Language Models for Chronic Disease Management Tasks: Scoping Review. JMIR Medical Informatics 2025;13:e66905 View
- Bayram H, Arslan S, Ozturkcan A. Evaluating AI‐Generated Meal Plans for Simulated Diabetes Profiles: A Guideline‐Based Comparison of Three Language Models. Journal of Evaluation in Clinical Practice 2025;31(7) View
- Abo El-Enen M, Saad S, Nazmy T. A survey on retrieval-augmentation generation (RAG) models for healthcare applications. Neural Computing and Applications 2025 View
- Mansoor H. A Scoping Review of Large Language Models in Personal Sleep Wellness. Mayo Clinic Proceedings: Digital Health 2025;3(4):100301 View
- Tung J, Le Q, Yao J, Huang Y, Lim D, Sng G, Lau R, Tan Y, Chen K, Tay K, Tan J, Yuen J, Cheng C, Ho H. Performance of Retrieval-Augmented Generation Large Language Models in Guideline-Concordant Prostate-Specific Antigen Testing: Comparative Study With Junior Clinicians. Journal of Medical Internet Research 2025;27:e78393 View
- Gültekin O, Sezgin E, Cakır O, Sengul H, Kilinc B, Yılmaz B, Kocaoglu B, Kayaalp M. Retrieval‐augmented ChatGPT‐4o improves accuracy but reduces readability in hip arthroscopy patient education. Knee Surgery, Sports Traumatology, Arthroscopy 2025 View
Books/Policy Documents
- Siddireddy S, Sahay A, Singh T. Emerging Trends in Artificial Intelligence and Machine Learning. View
Conference Proceedings
- Zhong C, Ye F, Wang Z, Jigeer A, Zhan Z. 2025 14th International Conference on Educational and Information Technology (ICEIT). Interdisciplinary-QG: An LLM-Based Framework for Generating High-Quality Interdisciplinary Test Questions with Knowledge Graphs and Chain-of-Thought Reasoning View
