Published on in Vol 26 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/53164, first published .
Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis

Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis

Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis

Journals

  1. Ren X, Wei Q. Sligpt: A Large Language Model-Based Approach for Data Dependency Analysis on Solidity Smart Contracts. Software 2024;3(3):345 View
  2. Adam G, DeYoung J, Paul A, Saldanha I, Balk E, Trikalinos T, Wallace B. Literature search sandbox: a large language model that generates search queries for systematic reviews. JAMIA Open 2024;7(3) View
  3. Zhang D, Ma Z, Gong R, Lian L, Li Y, He Z, Han Y, Hui J, Huang J, Jiang J, Weng W, Feng J. Using Natural Language Processing (GPT-4) for Computed Tomography Image Analysis of Cerebral Hemorrhages in Radiology: Retrospective Analysis. Journal of Medical Internet Research 2024;26:e58741 View
  4. Mija D, Kehlet H, Rosero E, Joshi G. Evaluating the role of ChatGPT in perioperative pain management versus procedure-specific postoperative pain management (PROSPECT) recommendations. British Journal of Anaesthesia 2024;133(6):1318 View
  5. Sanduleanu S, Ersahin K, Bremm J, Talibova N, Damer T, Erdogan M, Kottlors J, Goertz L, Bruns C, Maintz D, Abdullayev N. Feasibility of GPT-3.5 versus Machine Learning for Automated Surgical Decision-Making Determination: A Multicenter Study on Suspected Appendicitis. AI 2024;5(4):1942 View
  6. Thimm H, Rasmussen K. ChatGPT discovery of green image damaging information for large production companies. Journal of Cleaner Production 2024;478:143978 View
  7. Yan L, Greiff S, Teuber Z, Gašević D. Promises and challenges of generative artificial intelligence for human learning. Nature Human Behaviour 2024;8(10):1839 View
  8. Ivanisenko T, Demenkov P, Ivanisenko V. An Accurate and Efficient Approach to Knowledge Extraction from Scientific Publications Using Structured Ontology Models, Graph Neural Networks, and Large Language Models. International Journal of Molecular Sciences 2024;25(21):11811 View
  9. Iglesias S, Earp B, Voinea C, Mann S, Zahiu A, Jecker N, Savulescu J. Digital Doppelgängers and Lifespan Extension: What Matters?. The American Journal of Bioethics 2025;25(2):95 View
  10. Tomczyk P, Brüggemann P, Mergner N, Petrescu M. Are AI tools better than traditional tools in literature searching? Evidence from E-commerce research. Journal of Librarianship and Information Science 2024 View
  11. Goto H, Shiraishi Y, Okada S. Performance Evaluation of GPT-4o and o1-Preview Using the Certification Examination for the Japanese 'Operations Chief of Radiography With X-rays'. Cureus 2024 View
  12. Gunesli I, Aksun S, Fathelbab J, Yildiz B. Comparative evaluation of ChatGPT-4, ChatGPT-3.5 and Google Gemini on PCOS assessment and management based on recommendations from the 2023 guideline. Endocrine 2024;88(1):315 View
  13. Atkinson C. AI-pocalypse now: Automating the systematic literature review with SPARK (Systematic processing and automated review Kit) – gathering, organising, filtering, and scaffolding.. MethodsX 2025;14:103129 View
  14. Cohen J, Moher D. Generative artificial intelligence and academic writing: friend or foe?. Journal of Clinical Epidemiology 2025;179:111646 View
  15. Gao C, Hu X, Gao S, Xia X, Jin Z. The Current Challenges of Software Engineering in the Era of Large Language Models. ACM Transactions on Software Engineering and Methodology 2025;34(5):1 View
  16. Zhang C, Zhao T, Saraoglu H, Louton D. A Benchmark Comparison of a Domain-Focused Pipeline with ChatGPT. Journal of Computer Information Systems 2025:1 View
  17. Jones N. AI hallucinations can’t be stopped — but these techniques can limit their damage. Nature 2025;637(8047):778 View
  18. Ratuszniak A, Gos E, Lorens A, Skarzynski P, Skarzynski H, Jedrzejczak W. Performance of ChatGPT in Pediatric Audiology as Rated by Students and Experts. Journal of Clinical Medicine 2025;14(3):875 View
  19. Cinquin O. Steering veridical large language model analyses by correcting and enriching generated database queries: first steps toward ChatGPT bioinformatics. Briefings in Bioinformatics 2024;26(1) View
  20. Hidalgo-Betanzos J, Prol-Godoy I, Terés-Zubiaga J, Briones-Llorente R, Martín-Garín A. Can ChatGPT AI Replace or Contribute to Experts’ Diagnosis for Renovation Measures Identification?. Buildings 2025;15(3):421 View
  21. Erdem O, Hassett K, Egriboyun F. Hallucination in AI-generated financial literature reviews: evaluating bibliographic accuracy. International Journal of Data Science and Analytics 2025;20(5):4501 View
  22. Bracken A, Reilly C, Feeley A, Sheehan E, Merghani K, Feeley I. Artificial Intelligence (AI) – Powered Documentation Systems in Healthcare: A Systematic Review. Journal of Medical Systems 2025;49(1) View
  23. Kiyomiya K, Aomori T, Ohtani H. Medication counseling for OTC drugs using customized ChatGPT-4: Comparison with ChatGPT-3.5 and ChatGPT-4o. DIGITAL HEALTH 2025;11 View
  24. Gupta A, Basha A, Sontam T, Hlavinka W, Croen B, Abdou C, Abdullah M, Hamilton R. Evolution of patient education materials from large-language artificial intelligence models on complex regional pain syndrome: are patients learning?. Baylor University Medical Center Proceedings 2025;38(3):221 View
  25. Lorenc-Kukula K. Cutting-edge AI tools revolutionizing scientific research in life sciences. BioTechnologia 2025 View
  26. Scheinkman R, Kraft G, Kasheri E, Nouri K. Determining if ChatGPT‐4o Simplification of Mohs Postoperative Instructions Affects Instruction Quality. International Journal of Dermatology 2025;64(8):1524 View
  27. Jacob C, Kerrigan P, Bastos M. The chat-chamber effect: Trusting the AI hallucination. Big Data & Society 2025;12(1) View
  28. Šuto Pavičić J, Marušić A, Buljan I. Using ChatGPT to Improve the Presentation of Plain Language Summaries of Cochrane Systematic Reviews About Oncology Interventions: Cross-Sectional Study. JMIR Cancer 2025;11:e63347 View
  29. Kula B, Kula A, Bagcier F, Alyanak B. Artificial intelligence solutions for temporomandibular joint disorders: Contributions and future potential of ChatGPT. Korean Journal of Orthodontics 2025;55(2):131 View
  30. Velasco A, Coelho E, Paes V, Castro R, Oliveira R. A Inteligência Artificial na saúde: os Chatbots e suas aplicações na educação e pesquisa científica médicas. Caderno Pedagógico 2025;22(5):e15048 View
  31. Mun I, Hwang K. Exploring the Influence of Prompt Self-Efficacy: Accurate and Customized Information, Perceived Ease of Use, Satisfaction, and Continuance Intention to Use ChatGPT. International Journal of Human–Computer Interaction 2025;41(22):13952 View
  32. Jongkind R, Elings E, Joukes E, Broens T, Leopold H, Wiesman F, Meinema J. Is your curriculum GenAI-proof? A method for GenAI impact assessment and a case study. MedEdPublish 2025;15:11 View
  33. Katz G, Zloto O, Hostovsky A, Huna-Baron R, Ben-Bassat Mizrachi I, Burgansky Z, Skaat A, Vishnevskia-Dai V, Fabian I, Sagiv O, Priel A, Glicksberg B, Klang E. Chat GPT vs an experienced ophthalmologist: evaluating chatbot writing performance in ophthalmology. Eye 2025;39(10):1948 View
  34. Ostrovsky A. Evaluating a large language model's accuracy in chest X-ray interpretation for acute thoracic conditions. The American Journal of Emergency Medicine 2025;93:99 View
  35. Urban M, Brom C, Lukavský J, Děchtěrenko F, Hein V, Svacha F, Kmoníčková P, Urban K. “ChatGPT can make mistakes. Check important info.” Epistemic beliefs and metacognitive accuracy in students' integration of ChatGPT content into academic writing. British Journal of Educational Technology 2025;56(5):1897 View
  36. Matsutomo N, Fukami M, Yamamoto T. Can interactive artificial intelligence be used for patient explanations of nuclear medicine examinations in Japanese?. Annals of Nuclear Medicine 2025;39(8):774 View
  37. Li H, Huang J, Liu K, Liu J, Liu Q, Zhou Z, Zong Z, Mao S. ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer. European Journal of Surgical Oncology 2025;51(8):110096 View
  38. Lee Y, Oh J, Lee D, Kang M, Lee S. Prompt engineering in ChatGPT for literature review: practical guide exemplified with studies on white phosphors. Scientific Reports 2025;15(1) View
  39. Scherbakov D, Hubig N, Jansari V, Bakumenko A, Lenert L. The emergence of large language models as tools in literature reviews: a large language model-assisted systematic review. Journal of the American Medical Informatics Association 2025;32(6):1071 View
  40. Zhou J, Cheng Y, He S, Chen Y, Chen H. Large Language Models for Transforming Healthcare: A Perspective on DeepSeek‐R1. MedComm – Future Medicine 2025;4(2) View
  41. Norbu P, Wangdi S. Assessing information literacy among health professional students: a case study from Khesar Gyalpo University in Bhutan. Information Discovery and Delivery 2025 View
  42. Urban M, Lukavský J, Brom C, Hein V, Svacha F, Děchtěrenko F, Urban K. Prompting for creative problem-solving: A process-mining study. Learning and Instruction 2025;99:102156 View
  43. Daraio C, Di Leo S, Ferazzoli F. Pitfalls, benefits, and comparative analysis of artificial intelligence ChatBots in the systematic review process. International Transactions in Operational Research 2026;33(2):719 View
  44. Chen J, Mokmin N, Qi S. Generative AI-powered arts-based learning in middle school history: Impact on achievement, motivation, and cognitive load. The Journal of Educational Research 2025;118(6):688 View
  45. Adisa I, Adefisayo A. Middle school students’ perspectives on adopting generative AI in K-12 education. The Journal of Educational Research 2025;118(6):724 View
  46. Guo A, Canagasingham A, Rasiah K, Chalasani V, Mundy J, Chung A. The Growing Role of Artificial Intelligence in Surgical Education: ChatGPT Undertakes the Australian Generic Surgical Sciences Examination. ANZ Journal of Surgery 2025;95(7-8):1350 View
  47. Li C, Jia W, Chu Y, Menge F, Speer T, Reißfelder C, Hohenberger P, Jakob J, Yang C. Improving Accuracy and Source Transparency in Responses to Soft Tissue Sarcoma Queries Using GPT-4o Enhanced with German Evidence-Based Guidelines. Oncology Research and Treatment 2025;48(6):351 View
  48. Liu Y, Li H, Ouyang J, Xue Z, Wang M, He H, Song B, Zheng X, Gan W. Evaluating Large Language Models for Preoperative Patient Education in Superior Capsular Reconstruction: Comparative Study of Claude, GPT, and Gemini. JMIR Perioperative Medicine 2025;8:e70047 View
  49. Low Y, Jackson M, Hyde R, Brown R, Sanghavi N, Baldwin J, Pike C, Muralidharan J, Hui G, Alexander N, Hassan H, Nene R, Pike M, Pokrzywa C, Vedak S, Yan A, Yao D, Zipursky A, Dinh C, Ballentine P, Derieg D, Polony V, Chawdry R, Davies J, Hyde B, Shah N, Gombar S. Answering real-world clinical questions using large language model, retrieval-augmented generation, and agentic systems. DIGITAL HEALTH 2025;11 View
  50. Linardon J, Messer M, Anderson C, Liu C, McClure Z, Jarman H, Goldberg S, Torous J. Role of large language models in mental health research: an international survey of researchers’ practices and perspectives. BMJ Mental Health 2025;28(1):e301787 View
  51. Trewren T, Fitzgerald N, Jaensch S, Nguyen O, Tsymbal A, Gao C, Stretton B, Anderson S, Lin D, Winterton D, Gheihman G, Ludbrook G, Bratkovic K, Bacchi S. Artificial intelligence in perioperative medicine education: A feasibility test of case-based learning. Journal of Perioperative Practice 2025 View
  52. Boltaboyeva A, Baigarayeva Z, Imanbek B, Ozhikenov K, Getahun A, Aidarova T, Karymsakova N. A Review of Innovative Medical Rehabilitation Systems with Scalable AI-Assisted Platforms for Sensor-Based Recovery Monitoring. Applied Sciences 2025;15(12):6840 View
  53. Queiroz A, Sartori L, Lima G, Moraes R. Editorial policies for use and acknowledgment of artificial intelligence in dental journals. Journal of Dentistry 2025;161:105923 View
  54. Chen J, Hsu C, Tsai Y. Intelligent Decentralized Governance: A Case Study of KlimaDAO Decision-Making. Electronics 2025;14(12):2462 View
  55. Joranger P, Rivenes Lafontan S, Brevik A. Evaluating a Large Language Model’s Ability to Synthesize a Health Science Master’s Thesis: Case Study. JMIR Formative Research 2025;9:e73248 View
  56. Wada A, Tanaka Y, Nishizawa M, Yamamoto A, Akashi T, Hagiwara A, Hayakawa Y, Kikuta J, Shimoji K, Sano K, Kamagata K, Nakanishi A, Aoki S. Retrieval-augmented generation elevates local LLM quality in radiology contrast media consultation. npj Digital Medicine 2025;8(1) View
  57. Madsen D, Toston D. ChatGPT and Digital Transformation: A Narrative Review of Its Role in Health, Education, and the Economy. Digital 2025;5(3):24 View
  58. Gilvaz V, Sudheer A, Reginato A. Emerging Artificial Intelligence Innovations in Rheumatoid Arthritis and Challenges to Clinical Adoption. Current Rheumatology Reports 2025;27(1) View
  59. Zhang Z, Scroggins J, Harkins S, Hulchafo I, Moen H, Tadiello M, Barcelona V, Topaz M. Toward equitable documentation: Evaluating ChatGPT’s role in identifying and rephrasing stigmatizing language in electronic health records. Nursing Outlook 2025;73(4):102472 View
  60. Tsai C, Lin Y, Hou J, Tsai S, Yeh P, Kao C. Optimizing patient education for radioactive iodine therapy and the role of ChatGPT incorporating chain-of-thought technique: ChatGPT questionnaire. DIGITAL HEALTH 2025;11 View
  61. Sun R, Tang M, Zhou J, Loan N, Wang C. The dark tetrad as associated factors in generative AI academic misconduct: insights beyond personal attribute variables. Frontiers in Education 2025;10 View
  62. Lee J, Yoon J. Current Perspectives on the Artificial Intelligence in Critical Care Medicine. Anesthesiology Clinics 2025;43(3):507 View
  63. Gao Y, Xu Q, Zhang O, Wang H, Wang Y, Wang J, Chen X. Large language models: unlocking new potential in patient education for thyroid eye disease. Endocrine 2025;90(2):689 View
  64. Triposkiadis F, Brutsaert D. Evidence-Based Medicine: Past, Present, Future. Journal of Clinical Medicine 2025;14(14):5094 View
  65. Camlet A, Kusiak A, Ossowska A, Świetlik D. Advances in Periodontal Diagnostics: Application of MultiModal Language Models in Visual Interpretation of Panoramic Radiographs. Diagnostics 2025;15(15):1851 View
  66. Peykani P, Ramezanlou F, Tanasescu C, Ghanidel S. Large Language Models: A Structured Taxonomy and Review of Challenges, Limitations, Solutions, and Future Directions. Applied Sciences 2025;15(14):8103 View
  67. Bayani A, Epoh Ewane L, Oliveira dos Anjos D, Mac-Seing M, Nikiema J. Leveraging open-source large language models (LLMs) in scoping reviews: a case study on disability and AI applications. International Journal of Medical Informatics 2025;204:106048 View
  68. Omar M, Sorin V, Collins J, Reich D, Freeman R, Gavin N, Charney A, Stump L, Bragazzi N, Nadkarni G, Klang E. Multi-model assurance analysis showing large language models are highly vulnerable to adversarial hallucination attacks during clinical decision support. Communications Medicine 2025;5(1) View
  69. Leung T, Coristine A, Benis A. AI Scribes in Health Care: Balancing Transformative Potential With Responsible Integration. JMIR Medical Informatics 2025;13:e80898 View
  70. King C, Lopour B. Fostering Critical Thinking During Use of Generative AI: A Novel Learning Module for Ideation in Biomedical Engineering Design. Biomedical Engineering Education 2025;5(2):397 View
  71. Wang Z, Cao L, Danek B, Jin Q, Lu Z, Sun J. Accelerating clinical evidence synthesis with large language models. npj Digital Medicine 2025;8(1) View
  72. Asiri S. Assessing the Reliability of ChatGPT and Gemini in Identifying Relevant Orthodontic Literature. European Journal of General Dentistry 2025 View
  73. Tensen D, Grainger P, Graham W. Using AI to generate formative feedback in doctoral education. Assessment & Evaluation in Higher Education 2025:1 View
  74. Xu S, Zhao Z, Liu X, Meng X. A comparative study of screening performance between abstrackr and GPT models: Systematic review and contextual analysis. BMC Medical Informatics and Decision Making 2025;25(1) View
  75. Ng J. Prompt engineering for generative artificial intelligence chatbots in health research: A practical guide for traditional, complementary, and integrative medicine researchers. Integrative Medicine Research 2025;14(4):101222 View
  76. . Artificial Intelligence in Scholarly Publishing: New Tools, Same Standards. Clinical Journal of Oncology Nursing 2025 View
  77. Penny-Dimri J, Bachmann M, Cooke W, Mathewlynn S, Dockree S, Tolladay J, Kossen J, Li L, Gal Y, Davis Jones G. Measuring large language model uncertainty in women's health using semantic entropy and perplexity: a comparative study. The Lancet Obstetrics, Gynaecology, & Women's Health 2025;1(1):e47 View
  78. Campos V, Prudente T, Leão L, da Costa M, Oliva H, Monteiro-Junior R. Analyses of different prescriptions for health using artificial intelligence: a critical approach based on the international guidelines of health institutions. Health Information Science and Systems 2025;13(1) View
  79. Bezrukova K, Griffith T. Post-Public AI: Research in Groups and Teams. Small Group Research 2025;56(5):799 View
  80. Russinovich M, Salem A, Zanella-Béguelin S, Zunger Y. The Price of Intelligence. Communications of the ACM 2025;68(9):46 View
  81. Jain A, Nimonkar P, Jadhav P. Citation integrity in the age of AI: evaluating the risks of reference hallucination in maxillofacial literature. Journal of Cranio-Maxillofacial Surgery 2025;53(10):1871 View
  82. Hasnain M, Aurangzeb K, Alhussein M, Ghani I, Mahmood M. AI in conjunctivitis research: assessing ChatGPT and DeepSeek for etiology, intervention, and citation integrity via hallucination rate analysis. Frontiers in Artificial Intelligence 2025;8 View
  83. Rashid M, Yi C, Sathapanasiri T, Udayachalerm S, Boonpattharatthiti K, Insuk S, Veettil S, Lai N, Chaiyakunapruk N, Dhippayom T, Rashid M, Cheng S, Ming Lai N, Lawin S, Limhensin P, Wechkunanukul K, Mayang N, Rattanachaisit N, Ye X. Role of Generative Artificial Intelligence in Assisting Systematic Review Process in Health Research: A Systematic Review. Value in Health 2025;28(11):1665 View
  84. Seth I, Marcaccini G, Lim B, Novo J, Bacchi S, Cuomo R, Ross R, Rozen W. The Temporal Evolution of Large Language Model Performance: A Comparative Analysis of Past and Current Outputs in Scientific and Medical Research. Informatics 2025;12(3):86 View
  85. Shen Z, Yu C. How Technology Advances Research and Practice in Autism Spectrum Disorder: A Narrative Review on Early Detection, Subtype Stratification, and Intervention. Brain Sciences 2025;15(8):890 View
  86. Tekin S, Oguz S, Dagdelen S. ChatGPT-4o as a digital health tool for diabetes technology education: insights on reliability, quality, and readability. Endocrine 2025;90(2):652 View
  87. Shor R, Greene E, Sumberg L, Weingrad A. AI Tools in Academia: Evaluating NotebookLM as a Tool for Conducting Literature Reviews. Psychiatry 2025:1 View
  88. Li K, Peng Y, Li L, Liu B, Huang Z. Evaluating ChatGPT’s Utility in Biologic Therapy for Systemic Lupus Erythematosus: Comparative Study of ChatGPT and Google Web Search. JMIR Formative Research 2025;9:e76458 View
  89. Dong Y, Zhang Z, Zhi Y, Li X, Guo T, He L, Zhao S, Yang X, Tang J, Zhong W, Niu Q, Ma M, Huang Z, Mao Y. Evaluating large language models' performance in answering common questions on drug-induced liver injury. JHEP Reports 2025;7(12):101579 View
  90. Insuk S, Boonpattharatthiti K, Booncharoen C, Chaipitak P, Rashid M, Veettil S, Lai N, Chaiyakunapruk N, Dhippayom T. How Well Do ChatGPT and Claude Perform in Study Selection for Systematic Review in Obstetrics. Journal of Medical Systems 2025;49(1) View
  91. Antisdel J, Miller W, Groves D. Data Mining Trauma: AI-Assisted Qualitative Study of Cyber Victimization on Reddit. JMIR Infodemiology 2025;5:e75493 View
  92. Tanas Y, Gasper G, Rashidi K, Swed S. Evaluating large language models in patient education on facial plastic surgery: a standardized protocol. International Journal of Surgery Protocols 2025;29(3):108 View
  93. Kurulkar G, Ingale S, Shinde A, Jangale Y, Itkar S. Information Retrieval System for Automating Quiz Generation and Evaluation Using Large Language Models. Cureus Journal of Computer Science 2025 View
  94. Balcells D. Co-intelligent Design of Catalysis Research with Large Language Models: Hype or Reality?. ACS Catalysis 2025;15(18):16412 View
  95. Boonrit N, Thaweechai A, Kessarin B, Ruanglertboon W. Capabilities of Large Language Models in Detecting and Managing Drug Interactions During Medication Reviews: Potential Implications as A Digital Assistant for Pragmatic Pharmacy Practice in Thailand. JACCP: JOURNAL OF THE AMERICAN COLLEGE OF CLINICAL PHARMACY 2025;8(11):1117 View
  96. Soujah C, Bejjani C, Adra N, Blackburn L. Artificial Intelligence as a Drug Information Resource: Limitations and Strategies to Optimize in Pharmacy Practice. Hospital Pharmacy 2025 View
  97. Tosca E, Aiello L, De Carlo A, Magni P. Pharmacometrics in the Age of Large Language Models: A Vision of the Future. Pharmaceutics 2025;17(10):1274 View
  98. Dogru-Huzmeli E, Moore-Vasram S, Phadke C, Shafiee E, Amanullah S. Evaluating ChatGPT’s ability to simplify scientific abstracts for clinicians and the public. Scientific Reports 2025;15(1) View
  99. Hohagen F. Artifizielle Intelligenz in der Psychotherapie – werden Psychotherapeut*innen bald überflüssig?. PSYCH up2date 2025;19(05):351 View
  100. He Z, Zhao L, Li G, Wang J, Cai S, Tu P, Chen J, Wu J, Zhang J, Chen R, Huang Y, Pan X, Chen W. Comparative performance evaluation of large language models in answering esophageal cancer-related questions: a multi-model assessment study. Frontiers in Digital Health 2025;7 View
  101. Huang Y, Yang G, Shen Y, Chen H, Wu W, Li X, Wu Y, Zhang K, Xu J, Zhang J. Application of Large Language Models in Complex Clinical Cases: Cross-Sectional Evaluation Study. JMIR Medical Informatics 2025;13:e73941 View
  102. Drummond D, Girault A, Gonsard A. ChatGPT and other large language models for childhood asthma. Paediatric Respiratory Reviews 2025 View
  103. Leiva-Araos A, Kalasapudi V, Jiang A, Kaushal H. Evaluating Smart Building Features for Fire, Electrical, and Life Safety: A Rapid Human-LLM Framework for Literature Review and Research Mapping. IEEE Access 2025;13:173312 View
  104. Wu Y, Hu P, Wang D. The AI Annotator: Large Language Models’ Potential in Scoring Sustainability Reports. Systems 2025;13(10):899 View
  105. Celik S. Integrating artificial intelligence into scientific writing: a narrative review for clinical and surgical researchers. The American Journal of Surgery 2025;250:116657 View
  106. Rai M, Ngaw M, Nannas N. Artificial Intelligence Performance in Introductory Biology: Passing Grades but Poor Performance at High Cognitive Complexity. Education Sciences 2025;15(10):1400 View
  107. Küçükuncular A. Learning with, rather than through, AI: co-designing science education for critical AI literacy. Frontiers in Education 2025;10 View
  108. Xu P, Gong X, Chen X, Zhang W, Yang J, Yan B, Yuan M, Zheng Y, He M, Shi D. Benchmarking Large Multimodal Models for Ophthalmic Visual Question Answering with OphthalWeChat. Advances in Ophthalmology Practice and Research 2025 View
  109. Lee J. ChatGPT: how to use it and the pitfalls/cautions in academia. Annals of Pediatric Endocrinology & Metabolism 2025;30(5):229 View
  110. Thomas L, Romasanta A, Pujol Priego L. Jagged competencies: Measuring the reliability of generative AI in academic research. Journal of Business Research 2026;203:115804 View
  111. Aziz M, Brookhart M. Can Contemporary Large Language Models Provide the Domain Knowledge Needed for Causal Inference? Evaluating Automated Causal Graph Discovery Through an ASCVD Case Study. Clinical Epidemiology 2025;Volume 17:863 View
  112. Sökmen D, Albayrak A, Sertkaya Z, Başağa Y, Serefoglu E. Artificial intelligence meets medical rarity: evaluating ChatGPT’s responses on post-orgasmic illness syndrome. International Journal of Impotence Research 2025 View
  113. Civelekler M, Citirik M. Benchmarking Artificial Intelligence Models for Citation Accuracy in Neuro-Ophthalmological Disorders Research: A Comparative Analysis of Four Models. Journal of Hospital Librarianship 2025:1 View
  114. Harman A. Preparing Students for the Generative Artificial Intelligence Future: Integrating Artificial Intelligence Literacy in Sport Management Education. Sport Management Education Journal 2025:1 View
  115. C Calderon1,2,3 J, Robles-Velasco2,3 K, C Ferreira1,4 J. Artificial intelligence transforming healthcare research: opportunities, risks, and responsible use. Jornal Brasileiro de Pneumologia 2025:e20250433 View
  116. Inooka T, Ota H, Taki Y, Yasuda S, Sajiki A, Suzumura A, Shimizu H, Takeuchi J, Tomita R, Kominami T, Ushida H, Yuki K, Nishiguchi K. Evolving Consultation: Enhancing Ophthalmologic Diagnostic Performance Using Large Language Model. Ophthalmology Science 2025:101004 View
  117. Campuzano E, Chakraborty S. SustainTima: a task-oriented text-based conversational AI agent for sustainable fast fashion brands. International Journal of Fashion Design, Technology and Education 2025:1 View
  118. Voiculescu F, Darvasi P, Osmanlliu E, Krishnamoorthy P, Makri A. Natural Language AI Models and Pediatric Type 1 Diabetes: Can Chatbots Help With Diabetes Self-Management and Patient Education?. JMIR Diabetes 2025;10:e76986 View
  119. Liao Z, Huang J, Liu Y, Li F, Tang L, Cong L, Wang H, Luo S. Utility of Large Language Models for Congenital Microtia Reconstruction Education: Comparison of the Performance of Claude, GPT, and Gemini. Aesthetic Plastic Surgery 2025 View
  120. Niraula D, Shotande M, El Naqa I. Human-machine Interaction in the Age of Generative AI. The Cancer Journal 2025;31(6) View
  121. García-Peñalvo F. Tres escenarios para la IA en educación: del apoyo responsable a la cocreación. Education in the Knowledge Society (EKS) 2025;26:e32932 View
  122. Giuliani C, Benadi G, Engel F, Werner J, Watter M, Schwarzer G, Groß O, Zeiser R, Binder H, Kaier K. Identifying Biomedical Entities for Datasets in Scientific Articles: 4-Step Cache-Augmented Generation Approach Using GPT-4o and PubTator 3.0. JMIR Formative Research 2025;9:e73822 View
  123. Hooshiar M. Artificial intelligence reliability in implant dentistry: A comparative analysis of clinical accuracy and hallucination patterns across multiple language models. The Journal of Prosthetic Dentistry 2025 View
  124. Chen Y. Evaluating the potential of ChatGPT-reformulated essays as written feedback in L2 writing. Computers and Education: Artificial Intelligence 2025;9:100500 View
  125. Lupon E. Artificial intelligence and plastic surgery: Between innovation and responsibility. Annales de Chirurgie Plastique Esthétique 2025 View
  126. Blanc-Durand F, Koopman M, Patel S, Aldea M, Kather J. Will AI Write the Next "Chapter" in Literature Reviews?. Annals of Oncology 2025 View
  127. Ros T, Samuel A. Artificial Intelligence in Academic Writing: Tools and Techniques for Scholars to Succeed in the Publishing World. New Directions for Adult and Continuing Education 2025 View
  128. Cherrez‐Ojeda I, Zuberbier T, Rodas‐Valero G, Sanchez J, Rudenko M, Dramburg S, Demoly P, Caimmi D, Gómez R, Ramon G, Fouda G, Quimby K, Chong‐Neto H, Llosa O, Larco J, Monge Ortega O, Faytong‐Haro M, Pfaar O, Bousquet J, Robles‐Velasco K. Evaluation of the Quality and Reliability of ChatGPT‐4's Responses on Allergen Immunotherapy Using Validated Instruments for Health Information Quality Assessment. Clinical and Translational Allergy 2025;15(12) View
  129. Zade F, Ebrahimkhanlou A. Vision-Language Artificial Intelligence for Robotic-Based Monitoring: Concrete Defect Detection, Classification, and Localization in Two-Dimensional Maps. Journal of Computing in Civil Engineering 2026;40(2) View
  130. Mansoor I, Abdullah M, Rizwan M, Fraz M. Reasoning with large language models in medicine: a systematic review of techniques, challenges and clinical integration. Health Information Science and Systems 2025;14(1) View

Books/Policy Documents

  1. Caslini G, Gianotti M, Garzotto F. End-User Development. View
  2. Montenegro L, Gomes L, Machado J. Progress in Artificial Intelligence. View
  3. Acar S, Ellis A. Generative Artificial Intelligence and Creativity. View

Conference Proceedings

  1. Steinbach M, Bhandari S, Meyer J, Pardos Z. Proceedings of the Twelfth ACM Conference on Learning @ Scale. When LLMs Hallucinate: Examining the Effects of Erroneous Feedback in Math Tutoring Systems View
  2. Shi Q, Han Q, Soares C. 2025 IEEE International Conference on Digital Health (ICDH). C-PATH: Conversational Patient Assistance and Triage in Healthcare System View
  3. Cline D, Zgonc D, Vass W, Butkus M, Baideme M, Krueger B. 2025 ASEE Annual Conference & Exposition Proceedings. A New “Age of Generative AI” Paradigm for the Development and Management of Curricula in Undergraduate Environmental Engineering Programs View
  4. Hristov H, Bekirski K, Somova E, Ignatov A, Stavrev S, Poptolev Z. EEPES 2025. Approach and Tool for Creating Sustainable Learning Video Resources Through Integration of AI Subtitle Translator View
  5. Thang H, Vi T. 2025 International Conference on Circuit, Systems and Communication (ICCSC). AI in the Age of Scientific Saturation: A Survey of How LLMs Rediscover the Obvious View
  6. Hermann J, Jansen N, Dogangün A, van Ledden S. Proceedings of the 37th Australian Conference on Human-Computer Interaction. Implicit SDG Reasoning in LLMs: A Classroom-Oriented Benchmark View