Published on in Vol 25 (2023)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/49324, first published .
Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

Journals

  1. Sallam M, Barakat M, Sallam M. A Preliminary Checklist (METRICS) to Standardize the Design and Reporting of Studies on Generative Artificial Intelligence–Based Models in Health Care Education and Practice: Development Study Involving a Literature Review. Interactive Journal of Medical Research 2024;13:e54704 View
  2. Rudroff T. Revealing the Complexity of Fatigue: A Review of the Persistent Challenges and Promises of Artificial Intelligence. Brain Sciences 2024;14(2):186 View
  3. Marchi F, Bellini E, Iandelli A, Sampieri C, Peretti G. Exploring the landscape of AI-assisted decision-making in head and neck cancer treatment: a comparative analysis of NCCN guidelines and ChatGPT responses. European Archives of Oto-Rhino-Laryngology 2024;281(4):2123 View
  4. Berrezueta-Guzman S, Kandil M, Martín-Ruiz M, Pau de la Cruz I, Krusche S. Future of ADHD Care: Evaluating the Efficacy of ChatGPT in Therapy Enhancement. Healthcare 2024;12(6):683 View
  5. Litvin A, Stoma I, Sharshakova T, Rumovskaya S, Kyovalev A. New possibilities of artificial intelligence in medicine: a narrative review. Health and Ecology Issues 2024;21(1):7 View
  6. Omar M, Brin D, Glicksberg B, Klang E. Utilizing natural language processing and large language models in the diagnosis and prediction of infectious diseases: A systematic review. American Journal of Infection Control 2024;52(9):992 View
  7. Bonnechère B. Unlocking the Black Box? A Comprehensive Exploration of Large Language Models in Rehabilitation. American Journal of Physical Medicine & Rehabilitation 2024;103(6):532 View
  8. Naqvi W, Shaikh S, Mishra G. Large language models in physical therapy: time to adapt and adept. Frontiers in Public Health 2024;12 View
  9. Leypold T, Lingens L, Beier J, Boos A. Integrating AI in Lipedema Management: Assessing the Efficacy of GPT-4 as a Consultation Assistant. Life 2024;14(5):646 View
  10. Tailor P, D'Souza H, Li H, Starr M. Vision of the future: large language models in ophthalmology. Current Opinion in Ophthalmology 2024;35(5):391 View
  11. Tan S, Xin X, Wu D. ChatGPT in medicine: prospects and challenges: a review article. International Journal of Surgery 2024;110(6):3701 View
  12. Goktas P, Gulseren D, Tobin A. Large Language and Vision Assistant in dermatology: a game changer or just hype?. Clinical and Experimental Dermatology 2024;49(8):783 View
  13. Letterie G. Moonshot. Long shot. Or sure shot. What needs to happen to realize the full potential of AI in the fertility sector?. Human Reproduction 2024;39(9):1863 View
  14. Cong Y, LaCroix A, Lee J. Clinical efficacy of pre-trained large language models through the lens of aphasia. Scientific Reports 2024;14(1) View
  15. Luo M, Pang J, Bi S, Lai Y, Zhao J, Shang Y, Cui T, Yang Y, Lin Z, Zhao L, Wu X, Lin D, Chen J, Lin H. Development and Evaluation of a Retrieval-Augmented Large Language Model Framework for Ophthalmology. JAMA Ophthalmology 2024;142(9):798 View
  16. Leypold T, Schäfer B, Boos A, Beier J. Artificial Intelligence-Powered Hand Surgery Consultation: GPT-4 as an Assistant in a Hand Surgery Outpatient Clinic. The Journal of Hand Surgery 2024;49(11):1078 View
  17. Yang Z, Wang D, Zhou F, Song D, Zhang Y, Jiang J, Kong K, Liu X, Qiao Y, Chang R, Han Y, Li F, Tham C, Zhang X. Understanding natural language: Potential application of large language models to ophthalmology. Asia-Pacific Journal of Ophthalmology 2024;13(4):100085 View
  18. Labinsky H, Nagler L, Krusche M, Griewing S, Aries P, Kroiß A, Strunz P, Kuhn S, Schmalzing M, Gernert M, Knitza J. Vignette-based comparative analysis of ChatGPT and specialist treatment decisions for rheumatic patients: results of the Rheum2Guide study. Rheumatology International 2024;44(10):2043 View
  19. Wang Y, Liang L, Li R, Wang Y, Hao C. Comparison of the Performance of ChatGPT, Claude and Bard in Support of Myopia Prevention and Control. Journal of Multidisciplinary Healthcare 2024;Volume 17:3917 View
  20. Shapiro J, Lyakhovitsky A. Revolutionizing teledermatology: Exploring the integration of artificial intelligence, including Generative Pre-trained Transformer chatbots for artificial intelligence-driven anamnesis, diagnosis, and treatment plans. Clinics in Dermatology 2024;42(5):492 View
  21. Zheng Y, Gan W, Chen Z, Qi Z, Liang Q, Yu P. Large language models for medicine: a survey. International Journal of Machine Learning and Cybernetics 2025;16(2):1015 View
  22. Giacobbe D, Marelli C, Guastavino S, Signori A, Mora S, Rosso N, Campi C, Piana M, Murgia Y, Giacomini M, Bassetti M. Artificial intelligence and prescription of antibiotic therapy: present and future. Expert Review of Anti-infective Therapy 2024;22(10):819 View
  23. Wang J, Shi R, Le Q, Shan K, Chen Z, Zhou X, He Y, Hong J. Evaluating the effectiveness of large language models in patient education for conjunctivitis. British Journal of Ophthalmology 2025;109(2):185 View
  24. Merlino D, Brufau S, Saieed G, Van Abel K, Price D, Archibald D, Ator G, Carlson M. Comparative Assessment of Otolaryngology Knowledge Among Large Language Models. The Laryngoscope 2025;135(2):629 View
  25. Tam T, Sivarajkumar S, Kapoor S, Stolyar A, Polanska K, McCarthy K, Osterhoudt H, Wu X, Visweswaran S, Fu S, Mathur P, Cacciamani G, Sun C, Peng Y, Wang Y. A framework for human evaluation of large language models in healthcare derived from literature review. npj Digital Medicine 2024;7(1) View
  26. Goktas P, Grzybowski A. Assessing the Impact of ChatGPT in Dermatology: A Comprehensive Rapid Review. Journal of Clinical Medicine 2024;13(19):5909 View
  27. Bedi S, Liu Y, Orr-Ewing L, Dash D, Koyejo S, Callahan A, Fries J, Wornow M, Swaminathan A, Lehmann L, Hong H, Kashyap M, Chaurasia A, Shah N, Singh K, Tazbaz T, Milstein A, Pfeffer M, Shah N. Testing and Evaluation of Health Care Applications of Large Language Models. JAMA 2025;333(4):319 View
  28. Wu A. Chatting together: Using AI chatbots to improve diagnostic excellence. Journal of Patient Safety and Risk Management 2024;29(5):222 View
  29. Zhou S, Luo X, Chen C, Jiang H, Yang C, Ran G, Yu J, Yin C. The performance of large language model-powered chatbots compared to oncology physicians on colorectal cancer queries. International Journal of Surgery 2024;110(10):6509 View
  30. Al Khatib H, Neupane S, Kumar Manchukonda H, Golilarz N, Mittal S, Amirlatifi A, Rahimi S. Patient-centric knowledge graphs: a survey of current methods, challenges, and applications. Frontiers in Artificial Intelligence 2024;7 View
  31. Reyhan A, Mutaf Ç, Uzun İ, Yüksekyayla F. A Performance Evaluation of Large Language Models in Keratoconus: A Comparative Study of ChatGPT-3.5, ChatGPT-4.0, Gemini, Copilot, Chatsonic, and Perplexity. Journal of Clinical Medicine 2024;13(21):6512 View
  32. Zhang C, Liu S, Zhou X, Zhou S, Tian Y, Wang S, Xu N, Li W. Examining the Role of Large Language Models in Orthopedics: Systematic Review. Journal of Medical Internet Research 2024;26:e59607 View
  33. Coskun Benlidayi I, Gupta L. Translation and Cross-Cultural Adaptation: A Critical Step in Multi-National Survey Studies. Journal of Korean Medical Science 2024;39(49) View
  34. Slawaska-Eng D, Bourgeault-Gagnon Y, Cohen D, Pauyo T, Belzile E, Ayeni O. ChatGPT-3.5 and -4 provide mostly accurate information when answering patients’ questions relating to femoroacetabular impingement syndrome and arthroscopic hip surgery. Journal of ISAKOS 2025;10:100376 View
  35. Ding Z, Wei R, Xia J, Mu Y, Wang J, Lin Y. Exploring the potential of large language model–based chatbots in challenges of ribosome profiling data analysis: a review. Briefings in Bioinformatics 2024;26(1) View
  36. Bach T, Kaarstad M, Solberg E, Babic A. Insights into suggested Responsible AI (RAI) practices in real-world settings: a systematic literature review. AI and Ethics 2025;5(3):3185 View
  37. Zhan Y, Chen X, Ye F, Wu Z, Usman M, Yuan Z, Wu H, Huang J, Yu H. Evaluating AI Chatbot Responses to Postkidney Transplant Inquiries. Transplantation Proceedings 2025;57(2):394 View
  38. Ammo T, Guillaume V, Hofmann U, Ulmer N, Buenting N, Laenger F, Beier J, Leypold T. Evaluating ChatGPT-4o as a decision support tool in multidisciplinary sarcoma tumor boards: heterogeneous performance across various specialties. Frontiers in Oncology 2025;14 View
  39. Beheshti M, Toubal I, Alaboud K, Almalaysha M, Ogundele O, Turabieh H, Abdalnabi N, Boren S, Scott G, Dahu B. Evaluating the Reliability of ChatGPT for Health-Related Questions: A Systematic Review. Informatics 2025;12(1):9 View
  40. Flory J, Ancker J, Kim S, Kuperman G, Petrov A, Vickers A. Large Language Model GPT-4 Compared to Endocrinologist Responses on Initial Choice of Glucose-Lowering Medication Under Conditions of Clinical Uncertainty. Diabetes Care 2025;48(2):185 View
  41. Waldock W, Lam G, Baptista A, Walls R, Sam A. Which curriculum components do medical students find most helpful for evaluating AI outputs?. BMC Medical Education 2025;25(1) View
  42. Dillion D, Mondal D, Tandon N, Gray K. AI language model rivals expert ethicist in perceived moral expertise. Scientific Reports 2025;15(1) View
  43. Wang X, Ye H, Zhang S, Yang M, Wang X. Evaluation of the Performance of Three Large Language Models in Clinical Decision Support: A Comparative Study Based on Actual Cases. Journal of Medical Systems 2025;49(1) View
  44. Kleebayoon A, Wiwanitkit V. ChatGPT for responding to patient inquiries about otosclerosis: correspondence. European Archives of Oto-Rhino-Laryngology 2025;282(5):2785 View
  45. Huang Y, Shi R, Chen C, Zhou X, Zhou X, Hong J, Chen Z. Evaluation of large language models for providing educational information in orthokeratology care. Contact Lens and Anterior Eye 2025;48(3):102384 View
  46. Koss M, McLaughlin M, Switalla K, Falade I, Kim E. Exploring the role of ChatGPT in decision making for gender-affirming surgery. Artificial Intelligence Surgery 2025;5(1):116 View
  47. Choo S, Yoo S, Endo K, Truong B, Son M. Advancing Clinical Chatbot Validation Using AI-Powered Evaluation With a New 3-Bot Evaluation System: Instrument Validation Study. JMIR Nursing 2025;8:e63058 View
  48. Şişman A, Acar A. Artificial intelligence-based chatbot assistance in clinical decision-making for medically complex patients in oral surgery: a comparative study. BMC Oral Health 2025;25(1) View
  49. Shool S, Adimi S, Saboori Amleshi R, Bitaraf E, Golpira R, Tara M. A systematic review of large language model (LLM) evaluations in clinical medicine. BMC Medical Informatics and Decision Making 2025;25(1) View
  50. Rider N, Li Y, Chin A, DiGiacomo D, Dutmer C, Farmer J, Roberts K, Savova G, Ong M. Evaluating large language model performance to support the diagnosis and management of patients with primary immune disorders. Journal of Allergy and Clinical Immunology 2025;156(1):81 View
  51. Leypold T, Bahm J, Beier J, Guillaume V, Ammo T, Lauer H, Kolbenschlag J, Schäfer B. Evaluating ChatGPT o1’s Capabilities in Peripheral Nerve Surgery: Advancing Artificial Intelligence in Clinical Practice. World Neurosurgery 2025;196:123753 View
  52. Bhasuran B, Jin Q, Xie Y, Yang C, Hanna K, Costa J, Shavor C, Han W, Lu Z, He Z. Preliminary analysis of the impact of lab results on large language model generated differential diagnoses. npj Digital Medicine 2025;8(1) View
  53. Li J, Chang C, Li Y, Cui S, Yuan F, Li Z, Wang X, Li K, Feng Y, Wang Z, Wei Z, Jian F. Large Language Models’ Responses to Spinal Cord Injury: A Comparative Study of Performance. Journal of Medical Systems 2025;49(1) View
  54. Ao G, Chen M, Li J, Nie H, Zhang L, Chen Z. Comparative analysis of large language models on rare disease identification. Orphanet Journal of Rare Diseases 2025;20(1) View
  55. Chen X, Xiang J, Lu S, Liu Y, He M, Shi D. Evaluating large language models and agents in healthcare: key challenges in clinical applications. Intelligent Medicine 2025;5(2):151 View
  56. Kunze K, Gerhold C, Dave U, Abunnur N, Mamonov A, Nwachukwu B, Verma N, Chahla J. Large Language Model Use Cases in Health Care Research Are Redundant and Often Lack Appropriate Methodological Conduct: A Scoping Review and Call for Improved Practices. Arthroscopy: The Journal of Arthroscopic & Related Surgery 2025;41(11):4928 View
  57. Saxena A, Rishi B. AI and human collaboration in tourism: a framework for scalable, authentic, and engaging content. Asia Pacific Journal of Tourism Research 2025;30(9):1226 View
  58. Okenyi M, Ataguba G, Henry K, Anukem S, Orji R. Going vegan with ChatGPT: Towards designing LLMs for personalized lifestyle changes. Machine Learning with Applications 2025;20:100659 View
  59. Giuffrè M, You K, Pang Z, Kresevic S, Chung S, Chen R, Ko Y, Chan C, Saarinen T, Ajcevic M, Crocè L, Garcia-Tsao G, Gralnek I, Sung J, Barkun A, Laine L, Sekhon J, Stadie B, Shung D. Expert of Experts Verification and Alignment (EVAL) Framework for Large Language Models Safety in Gastroenterology. npj Digital Medicine 2025;8(1) View
  60. Li Y, Li Z, Li J, Liu L, Liu Y, Zhu B, shi K, Lu Y, Li Y, Zeng X, Feng Y, Wang X. The actual performance of large language models in providing liver cirrhosis-related information: A comparative study. International Journal of Medical Informatics 2025;201:105961 View
  61. Othman A, Flaharty K, Ledgister Hanchard S, Hu P, Duong D, Waikel R, Solomon B. Assessing large language model performance related to aging in genetic conditions. npj Aging 2025;11(1) View
  62. Qiang S, Zhang H, Liao Y, Zhang Y, Gu Y, Wang Y, Xu Z, Shi H, Han N, Yu H. Application of Large Language Models in Stroke Rehabilitation Health Education: 2-Phase Study. Journal of Medical Internet Research 2025;27:e73226 View
  63. Zhou J, Cheng Y, He S, Chen Y, Chen H. Large Language Models for Transforming Healthcare: A Perspective on DeepSeek‐R1. MedComm – Future Medicine 2025;4(2) View
  64. Borgonovo F, Matsuo T, Petri F, Amin Alavi S, Mazudie Ndjonko L, Gori A, Berbari E. Battle of the Bots: Solving Clinical Cases in Osteoarticular Infections With Large Language Models. Mayo Clinic Proceedings: Digital Health 2025;3(3):100230 View
  65. Ejas F, Khan S, Mujahid A, AlJoker F, Mautong H, Alvarado-Villa G, Kashyap A, Yasir M, Nigatu K, Jain N, Iyer N, Sandhu A, Sharafat S, Yahya S, Ghaly M, Ibrar I, Singh A, Grewal H, Huespe I, Mehta P, Arshad Z, Kashyap R, Nawaz F. Medical Students’ Perceptions of Large Language Models in Healthcare: A Multinational Cross-Sectional Study. Journal of Medical Education and Curricular Development 2025;12 View
  66. Su H, Sun Y, Li R, Zhang A, Yang Y, Xiao F, Duan Z, Chen J, Hu Q, Yang T, Xu B, Zhang Q, Zhao J, Li Y, Li H. Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis. Journal of Medical Internet Research 2025;27:e72062 View
  67. Alkalbani A, Alrawahi A, Salah A, Haghighi V, Zhang Y, Alkindi S, Sheng Q. A Systematic Review of Large Language Models in Medical Specialties: Applications, Challenges and Future Directions. Information 2025;16(6):489 View
  68. See Y, Lim K, Au W, Chia S, Fan X, Li Z. The Use of Large Language Models in Ophthalmology: A Scoping Review on Current Use-Cases and Considerations for Future Works in This Field. Big Data and Cognitive Computing 2025;9(6):151 View
  69. Angyal V, Bertalan Á, Domján P, Dinya E. Exploring the possibilities and limitations of customized large language model to support and improve cervical cancer screening. BMC Medical Informatics and Decision Making 2025;25(1) View
  70. Shataer D, Cao S, Liu X, Aierken K, Bhattacharya P, Sinha A, Liu H. Application of Large Language Models in Traditional Chinese Medicine: A State-of-the-Art Review. The American Journal of Chinese Medicine 2025;53(04):973 View
  71. Urda-Cîmpean A, Leucuța D, Drugan C, Duțu A, Călinici T, Drugan T. Assessing the Accuracy of Diagnostic Capabilities of Large Language Models. Diagnostics 2025;15(13):1657 View
  72. Zhan L, Dang X, Xie Z, Zeng C, Wu W, Zhang X, Zhang L, Cai X. Evaluating GPT-4o in infectious disease diagnostics and management: A comparative study with residents and specialists on accuracy, completeness, and clinical support potential. DIGITAL HEALTH 2025;11 View
  73. Yang H, Li M, Zhou H, Xiao Y, Fang Q, Zhou S, Zhang R. Large Language Model Synergy for Ensemble Learning in Medical Question Answering: Design and Evaluation Study. Journal of Medical Internet Research 2025;27:e70080 View
  74. Báez J, Ahn E, Tamietti A, Victor B, Goldkind L. Clinical Social Workers’ Perceptions of Large Language Models in Practice: Resistance to Automation and Prospects for Integration. Journal of Evidence-Based Social Work 2025:1 View
  75. Artiaga J, Guevarra M, Sosuan G, Agnihotri A, Nagel I, Kalaw F. Large language models in ophthalmology: a scoping review on their utility for clinicians, researchers, patients, and educators. Eye 2025;39(15):2752 View
  76. Duan L, Li T, Li B, Li X, Fu D, Yang X, Cao K, Cai H. Application of large language models to natural language processing and image analysis tasks in dermatology: a systematic review. Intelligent Medicine 2025 View
  77. Elangovan K, Ong J, Jin L, Seng B, Kwan Y, Ng L, Zhong R, Ma J, Ke Y, Liu N, Giacomini K, Ting D, Bui T. Development and evaluation of a lightweight large language model chatbot for medication enquiry. PLOS Digital Health 2025;4(9):e0000961 View
  78. Salehin I, Tomal Ahmed Sajib M, Huda Badhon N, Sakibul Hassan Rifat M, Amin N, Nessa Moon N. Systematic Literature Review of LLM‐Large Language Model in Medical: Digital Health, Technology and Applications. Engineering Reports 2025;7(9) View
  79. Wu S, Miao Y, Mei J, Xiong S. The Rise of Artificial Intelligence in Orthopedics: A Bibliometric and Visualization Analysis. Journal of Multidisciplinary Healthcare 2025;Volume 18:6037 View
  80. Jaleel A, Aziz U, Farid G, Zahid Bashir M, Mirza T, Khizar Abbas S, Aslam S, Sikander R. Evaluating the Potential and Accuracy of ChatGPT-3.5 and 4.0 in Medical Licensing and In-Training Examinations: Systematic Review and Meta-Analysis. JMIR Medical Education 2025;11:e68070 View
  81. Tian M, Li S, Du W, Yang S, Zhao X, Xiong H, Li H, Lu M, Ying Y, Zhang J, Liao Q, Yang D, Guo F. Novel Insights into the Application of Large Language Models in the Diagnosis and Treatment of Complex Cardiovascular Diseases: A Comparative Study. Journal of Medical Systems 2025;49(1) View
  82. XUE T, BAI Y, ZHANG T. Intelligent technology-driven orthopedic rehabilitation: Progress and applications. SCIENTIA SINICA Technologica 2025;55(10):1659 View
  83. Goh S, Mariappan R, Soo Woon Tan G, Yao J, Hew F, Yeo Y, Guan Wei Ow S, Koh W, Kumarakulasingh N, Tan T, Tai B, Hartman M, Ngiam K. Augmenting Large Language Models With National Comprehensive Cancer Network Guidelines for Improved and Standardized Adjuvant Therapy Recommendations in Postoperative Breast Cancer Cases. JCO Clinical Cancer Informatics 2025;(9) View
  84. Büyükceran E, Seyfettin A, Babatürk A, Eskalen Z, Özkan M, Kaymaz E, Mersin H, Dönmez F. Text-based prediction of ımmunohistochemical biomarkers in breast cancer using a generative large language model: a retrospective study. Health Information Science and Systems 2025;14(1) View
  85. Zhou H, Zhu Z, Oh K, Hong S. Empowering Informal Caregivers of Persons with Early-Stage Dementia by Large Language Models: Mixed Methods Evaluation (Preprint). JMIR Formative Research 2025 View
  86. Pohlmann P, Glienke M, Sandkamp R, Gratzke C, Schmal H, Schoeb D, Fuchs A. Assessing the Efficacy of Ortho GPT: A Comparative Study with Medical Students and General LLMs on Orthopedic Examination Questions. Bioengineering 2025;12(12):1290 View
  87. Sieciński K, Oliński M. A Multidisciplinary Bibliometric Analysis of Differences and Commonalities Between GenAI in Science. Publications 2025;13(4):67 View
  88. Akkus Yildirim B, Tutun B, Durak G, Yildirim E, Uysal E, Erturk S, Bagci U. Large language models standardize the interpretation of complex oncology guidelines for brain metastases. Communications Medicine 2025 View
  89. Kaczmarczyk R, Pieroh P, Koob S, Fröschen F, Scheidt S, Welle K, Martin R, Roos J. Application of Vision-Language Models in the Automatic Recognition of Bone Tumors on Radiographs: A Retrospective Study. AI 2025;6(12):327 View

Books/Policy Documents

  1. Berlincioni L, Cultrera L, Becattini F, Bertini M, Del Bimbo A. Computer Vision – ECCV 2024 Workshops. View

Conference Proceedings

  1. Mohammed H, Kiss G, Serrano J, Lindseth F. 2025 IEEE Symposium on Computational Intelligence in Health and Medicine (CIHM). Comparative Analysis and Evaluation of Well-Being Activity-Infused Fine-Tuned Language Models with Benchmark Models View
  2. Zhao S, Wang J. Proceedings of the 34th ACM SIGSOFT International Symposium on Software Testing and Analysis. Best practice for supply chain in LLM-assisted medical applications View
  3. Liu Z, Hu L, Zhou T, Tang Y, Cai Z. 2025 IEEE Symposium on Security and Privacy (SP). Prevalence Overshadows Concerns? Understanding Chinese Users' Privacy Awareness and Expectations Towards LLM-Based Healthcare Consultation View
  4. Zhang T, Chung T, Dey A, Bae S. 2025 International Conference on Activity and Behavior Computing (ABC). AXAI-CDSS: An Affective Explainable AI-Driven Clinical Decision Support System for Cannabis Use View
  5. Preiß N, Westner M. Proceedings of the 20th Conference on Computer Science and Intelligence Systems (FedCSIS). From Agents to Copilots: A Systematic Review of Digital Assistant Technology Adoption in Proprietary Productivity Software View