Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis

Ren X, Wei Q. Sligpt: A Large Language Model-Based Approach for Data Dependency Analysis on Solidity Smart Contracts. Software 2024;3(3):345 View
Adam G, DeYoung J, Paul A, Saldanha I, Balk E, Trikalinos T, Wallace B. Literature search sandbox: a large language model that generates search queries for systematic reviews. JAMIA Open 2024;7(3) View
Zhang D, Ma Z, Gong R, Lian L, Li Y, He Z, Han Y, Hui J, Huang J, Jiang J, Weng W, Feng J. Using Natural Language Processing (GPT-4) for Computed Tomography Image Analysis of Cerebral Hemorrhages in Radiology: Retrospective Analysis. Journal of Medical Internet Research 2024;26:e58741 View
Mija D, Kehlet H, Rosero E, Joshi G. Evaluating the role of ChatGPT in perioperative pain management versus procedure-specific postoperative pain management (PROSPECT) recommendations. British Journal of Anaesthesia 2024;133(6):1318 View
Sanduleanu S, Ersahin K, Bremm J, Talibova N, Damer T, Erdogan M, Kottlors J, Goertz L, Bruns C, Maintz D, Abdullayev N. Feasibility of GPT-3.5 versus Machine Learning for Automated Surgical Decision-Making Determination: A Multicenter Study on Suspected Appendicitis. AI 2024;5(4):1942 View
Thimm H, Rasmussen K. ChatGPT discovery of green image damaging information for large production companies. Journal of Cleaner Production 2024;478:143978 View
Yan L, Greiff S, Teuber Z, Gašević D. Promises and challenges of generative artificial intelligence for human learning. Nature Human Behaviour 2024;8(10):1839 View
Ivanisenko T, Demenkov P, Ivanisenko V. An Accurate and Efficient Approach to Knowledge Extraction from Scientific Publications Using Structured Ontology Models, Graph Neural Networks, and Large Language Models. International Journal of Molecular Sciences 2024;25(21):11811 View
Iglesias S, Earp B, Voinea C, Mann S, Zahiu A, Jecker N, Savulescu J. Digital Doppelgängers and Lifespan Extension: What Matters?. The American Journal of Bioethics 2025;25(2):95 View
Tomczyk P, Brüggemann P, Mergner N, Petrescu M. Are AI tools better than traditional tools in literature searching? Evidence from E-commerce research. Journal of Librarianship and Information Science 2026;58(1):135 View
Goto H, Shiraishi Y, Okada S. Performance Evaluation of GPT-4o and o1-Preview Using the Certification Examination for the Japanese 'Operations Chief of Radiography With X-rays'. Cureus 2024 View
Gunesli I, Aksun S, Fathelbab J, Yildiz B. Comparative evaluation of ChatGPT-4, ChatGPT-3.5 and Google Gemini on PCOS assessment and management based on recommendations from the 2023 guideline. Endocrine 2024;88(1):315 View
Atkinson C. AI-pocalypse now: Automating the systematic literature review with SPARK (Systematic processing and automated review Kit) – gathering, organising, filtering, and scaffolding.. MethodsX 2025;14:103129 View
Cohen J, Moher D. Generative artificial intelligence and academic writing: friend or foe?. Journal of Clinical Epidemiology 2025;179:111646 View
Gao C, Hu X, Gao S, Xia X, Jin Z. The Current Challenges of Software Engineering in the Era of Large Language Models. ACM Transactions on Software Engineering and Methodology 2025;34(5):1 View
Zhang C, Zhao T, Saraoglu H, Louton D. A Benchmark Comparison of a Domain-Focused Pipeline with ChatGPT. Journal of Computer Information Systems 2025:1 View
Jones N. AI hallucinations can’t be stopped — but these techniques can limit their damage. Nature 2025;637(8047):778 View
Ratuszniak A, Gos E, Lorens A, Skarzynski P, Skarzynski H, Jedrzejczak W. Performance of ChatGPT in Pediatric Audiology as Rated by Students and Experts. Journal of Clinical Medicine 2025;14(3):875 View
Cinquin O. Steering veridical large language model analyses by correcting and enriching generated database queries: first steps toward ChatGPT bioinformatics. Briefings in Bioinformatics 2024;26(1) View
Hidalgo-Betanzos J, Prol-Godoy I, Terés-Zubiaga J, Briones-Llorente R, Martín-Garín A. Can ChatGPT AI Replace or Contribute to Experts’ Diagnosis for Renovation Measures Identification?. Buildings 2025;15(3):421 View
Erdem O, Hassett K, Egriboyun F. Hallucination in AI-generated financial literature reviews: evaluating bibliographic accuracy. International Journal of Data Science and Analytics 2025;20(5):4501 View
Bracken A, Reilly C, Feeley A, Sheehan E, Merghani K, Feeley I. Artificial Intelligence (AI) – Powered Documentation Systems in Healthcare: A Systematic Review. Journal of Medical Systems 2025;49(1) View
Kiyomiya K, Aomori T, Ohtani H. Medication counseling for OTC drugs using customized ChatGPT-4: Comparison with ChatGPT-3.5 and ChatGPT-4o. DIGITAL HEALTH 2025;11 View
Gupta A, Basha A, Sontam T, Hlavinka W, Croen B, Abdou C, Abdullah M, Hamilton R. Evolution of patient education materials from large-language artificial intelligence models on complex regional pain syndrome: are patients learning?. Baylor University Medical Center Proceedings 2025;38(3):221 View
Lorenc-Kukula K. Cutting-edge AI tools revolutionizing scientific research in life sciences. BioTechnologia 2025 View
Scheinkman R, Kraft G, Kasheri E, Nouri K. Determining if ChatGPT‐4o Simplification of Mohs Postoperative Instructions Affects Instruction Quality. International Journal of Dermatology 2025;64(8):1524 View
Jacob C, Kerrigan P, Bastos M. The chat-chamber effect: Trusting the AI hallucination. Big Data & Society 2025;12(1) View
Šuto Pavičić J, Marušić A, Buljan I. Using ChatGPT to Improve the Presentation of Plain Language Summaries of Cochrane Systematic Reviews About Oncology Interventions: Cross-Sectional Study. JMIR Cancer 2025;11:e63347 View
Kula B, Kula A, Bagcier F, Alyanak B. Artificial intelligence solutions for temporomandibular joint disorders: Contributions and future potential of ChatGPT. Korean Journal of Orthodontics 2025;55(2):131 View
Velasco A, Coelho E, Paes V, Castro R, Oliveira R. A Inteligência Artificial na saúde: os Chatbots e suas aplicações na educação e pesquisa científica médicas. Caderno Pedagógico 2025;22(5):e15048 View
Mun I, Hwang K. Exploring the Influence of Prompt Self-Efficacy: Accurate and Customized Information, Perceived Ease of Use, Satisfaction, and Continuance Intention to Use ChatGPT. International Journal of Human–Computer Interaction 2025;41(22):13952 View
Jongkind R, Elings E, Joukes E, Broens T, Leopold H, Wiesman F, Meinema J. Is your curriculum GenAI-proof? A method for GenAI impact assessment and a case study. MedEdPublish 2025;15:11 View
Katz G, Zloto O, Hostovsky A, Huna-Baron R, Ben-Bassat Mizrachi I, Burgansky Z, Skaat A, Vishnevskia-Dai V, Fabian I, Sagiv O, Priel A, Glicksberg B, Klang E. Chat GPT vs an experienced ophthalmologist: evaluating chatbot writing performance in ophthalmology. Eye 2025;39(10):1948 View
Ostrovsky A. Evaluating a large language model's accuracy in chest X-ray interpretation for acute thoracic conditions. The American Journal of Emergency Medicine 2025;93:99 View
Urban M, Brom C, Lukavský J, Děchtěrenko F, Hein V, Svacha F, Kmoníčková P, Urban K. “ChatGPT can make mistakes. Check important info.” Epistemic beliefs and metacognitive accuracy in students' integration of ChatGPT content into academic writing. British Journal of Educational Technology 2025;56(5):1897 View
Matsutomo N, Fukami M, Yamamoto T. Can interactive artificial intelligence be used for patient explanations of nuclear medicine examinations in Japanese?. Annals of Nuclear Medicine 2025;39(8):774 View
Li H, Huang J, Liu K, Liu J, Liu Q, Zhou Z, Zong Z, Mao S. ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer. European Journal of Surgical Oncology 2025;51(8):110096 View
Lee Y, Oh J, Lee D, Kang M, Lee S. Prompt engineering in ChatGPT for literature review: practical guide exemplified with studies on white phosphors. Scientific Reports 2025;15(1) View
Scherbakov D, Hubig N, Jansari V, Bakumenko A, Lenert L. The emergence of large language models as tools in literature reviews: a large language model-assisted systematic review. Journal of the American Medical Informatics Association 2025;32(6):1071 View
Zhou J, Cheng Y, He S, Chen Y, Chen H. Large Language Models for Transforming Healthcare: A Perspective on DeepSeek‐R1. MedComm – Future Medicine 2025;4(2) View
Norbu P, Wangdi S. Assessing information literacy among health professional students: a case study from Khesar Gyalpo University in Bhutan. Information Discovery and Delivery 2025 View
Urban M, Lukavský J, Brom C, Hein V, Svacha F, Děchtěrenko F, Urban K. Prompting for creative problem-solving: A process-mining study. Learning and Instruction 2025;99:102156 View
Daraio C, Di Leo S, Ferazzoli F. Pitfalls, benefits, and comparative analysis of artificial intelligence ChatBots in the systematic review process. International Transactions in Operational Research 2026;33(2):719 View
Chen J, Mokmin N, Qi S. Generative AI-powered arts-based learning in middle school history: Impact on achievement, motivation, and cognitive load. The Journal of Educational Research 2025;118(6):688 View
Adisa I, Adefisayo A. Middle school students’ perspectives on adopting generative AI in K-12 education. The Journal of Educational Research 2025;118(6):724 View
Guo A, Canagasingham A, Rasiah K, Chalasani V, Mundy J, Chung A. The Growing Role of Artificial Intelligence in Surgical Education: ChatGPT Undertakes the Australian Generic Surgical Sciences Examination. ANZ Journal of Surgery 2025;95(7-8):1350 View
Li C, Jia W, Chu Y, Menge F, Speer T, Reißfelder C, Hohenberger P, Jakob J, Yang C. Improving Accuracy and Source Transparency in Responses to Soft Tissue Sarcoma Queries Using GPT-4o Enhanced with German Evidence-Based Guidelines. Oncology Research and Treatment 2025;48(6):351 View
Liu Y, Li H, Ouyang J, Xue Z, Wang M, He H, Song B, Zheng X, Gan W. Evaluating Large Language Models for Preoperative Patient Education in Superior Capsular Reconstruction: Comparative Study of Claude, GPT, and Gemini. JMIR Perioperative Medicine 2025;8:e70047 View
Low Y, Jackson M, Hyde R, Brown R, Sanghavi N, Baldwin J, Pike C, Muralidharan J, Hui G, Alexander N, Hassan H, Nene R, Pike M, Pokrzywa C, Vedak S, Yan A, Yao D, Zipursky A, Dinh C, Ballentine P, Derieg D, Polony V, Chawdry R, Davies J, Hyde B, Shah N, Gombar S. Answering real-world clinical questions using large language model, retrieval-augmented generation, and agentic systems. DIGITAL HEALTH 2025;11 View
Linardon J, Messer M, Anderson C, Liu C, McClure Z, Jarman H, Goldberg S, Torous J. Role of large language models in mental health research: an international survey of researchers’ practices and perspectives. BMJ Mental Health 2025;28(1):e301787 View
Trewren T, Fitzgerald N, Jaensch S, Nguyen O, Tsymbal A, Gao C, Stretton B, Anderson S, Lin D, Winterton D, Gheihman G, Ludbrook G, Bratkovic K, Bacchi S. Artificial intelligence in perioperative medicine education: A feasibility test of case-based learning. Journal of Perioperative Practice 2026;36(1-2):50 View
Boltaboyeva A, Baigarayeva Z, Imanbek B, Ozhikenov K, Getahun A, Aidarova T, Karymsakova N. A Review of Innovative Medical Rehabilitation Systems with Scalable AI-Assisted Platforms for Sensor-Based Recovery Monitoring. Applied Sciences 2025;15(12):6840 View
Queiroz A, Sartori L, Lima G, Moraes R. Editorial policies for use and acknowledgment of artificial intelligence in dental journals. Journal of Dentistry 2025;161:105923 View
Chen J, Hsu C, Tsai Y. Intelligent Decentralized Governance: A Case Study of KlimaDAO Decision-Making. Electronics 2025;14(12):2462 View
Joranger P, Rivenes Lafontan S, Brevik A. Evaluating a Large Language Model’s Ability to Synthesize a Health Science Master’s Thesis: Case Study. JMIR Formative Research 2025;9:e73248 View
Wada A, Tanaka Y, Nishizawa M, Yamamoto A, Akashi T, Hagiwara A, Hayakawa Y, Kikuta J, Shimoji K, Sano K, Kamagata K, Nakanishi A, Aoki S. Retrieval-augmented generation elevates local LLM quality in radiology contrast media consultation. npj Digital Medicine 2025;8(1) View
Madsen D, Toston D. ChatGPT and Digital Transformation: A Narrative Review of Its Role in Health, Education, and the Economy. Digital 2025;5(3):24 View
Gilvaz V, Sudheer A, Reginato A. Emerging Artificial Intelligence Innovations in Rheumatoid Arthritis and Challenges to Clinical Adoption. Current Rheumatology Reports 2025;27(1) View
Zhang Z, Scroggins J, Harkins S, Hulchafo I, Moen H, Tadiello M, Barcelona V, Topaz M. Toward equitable documentation: Evaluating ChatGPT’s role in identifying and rephrasing stigmatizing language in electronic health records. Nursing Outlook 2025;73(4):102472 View
Tsai C, Lin Y, Hou J, Tsai S, Yeh P, Kao C. Optimizing patient education for radioactive iodine therapy and the role of ChatGPT incorporating chain-of-thought technique: ChatGPT questionnaire. DIGITAL HEALTH 2025;11 View
Sun R, Tang M, Zhou J, Loan N, Wang C. The dark tetrad as associated factors in generative AI academic misconduct: insights beyond personal attribute variables. Frontiers in Education 2025;10 View
Lee J, Yoon J. Current Perspectives on the Artificial Intelligence in Critical Care Medicine. Anesthesiology Clinics 2025;43(3):507 View
Gao Y, Xu Q, Zhang O, Wang H, Wang Y, Wang J, Chen X. Large language models: unlocking new potential in patient education for thyroid eye disease. Endocrine 2025;90(2):689 View
Triposkiadis F, Brutsaert D. Evidence-Based Medicine: Past, Present, Future. Journal of Clinical Medicine 2025;14(14):5094 View
Camlet A, Kusiak A, Ossowska A, Świetlik D. Advances in Periodontal Diagnostics: Application of MultiModal Language Models in Visual Interpretation of Panoramic Radiographs. Diagnostics 2025;15(15):1851 View
Peykani P, Ramezanlou F, Tanasescu C, Ghanidel S. Large Language Models: A Structured Taxonomy and Review of Challenges, Limitations, Solutions, and Future Directions. Applied Sciences 2025;15(14):8103 View
Bayani A, Epoh Ewane L, Oliveira dos Anjos D, Mac-Seing M, Nikiema J. Leveraging open-source large language models (LLMs) in scoping reviews: a case study on disability and AI applications. International Journal of Medical Informatics 2025;204:106048 View
Omar M, Sorin V, Collins J, Reich D, Freeman R, Gavin N, Charney A, Stump L, Bragazzi N, Nadkarni G, Klang E. Multi-model assurance analysis showing large language models are highly vulnerable to adversarial hallucination attacks during clinical decision support. Communications Medicine 2025;5(1) View
Leung T, Coristine A, Benis A. AI Scribes in Health Care: Balancing Transformative Potential With Responsible Integration. JMIR Medical Informatics 2025;13:e80898 View
King C, Lopour B. Fostering Critical Thinking During Use of Generative AI: A Novel Learning Module for Ideation in Biomedical Engineering Design. Biomedical Engineering Education 2025;5(2):397 View
Wang Z, Cao L, Danek B, Jin Q, Lu Z, Sun J. Accelerating clinical evidence synthesis with large language models. npj Digital Medicine 2025;8(1) View
Asiri S. Assessing the Reliability of ChatGPT and Gemini in Identifying Relevant Orthodontic Literature. European Journal of General Dentistry 2026;15(02):217 View
Tensen D, Grainger P, Graham W. Using AI to generate formative feedback in doctoral education. Assessment & Evaluation in Higher Education 2026;51(3):476 View
Xu S, Zhao Z, Liu X, Meng X. A comparative study of screening performance between abstrackr and GPT models: Systematic review and contextual analysis. BMC Medical Informatics and Decision Making 2025;25(1) View
Ng J. Prompt engineering for generative artificial intelligence chatbots in health research: A practical guide for traditional, complementary, and integrative medicine researchers. Integrative Medicine Research 2025;14(4):101222 View
. Artificial Intelligence in Scholarly Publishing: New Tools, Same Standards. Clinical Journal of Oncology Nursing 2025 View
Penny-Dimri J, Bachmann M, Cooke W, Mathewlynn S, Dockree S, Tolladay J, Kossen J, Li L, Gal Y, Davis Jones G. Measuring large language model uncertainty in women's health using semantic entropy and perplexity: a comparative study. The Lancet Obstetrics, Gynaecology, & Women's Health 2025;1(1):e47 View
Campos V, Prudente T, Leão L, da Costa M, Oliva H, Monteiro-Junior R. Analyses of different prescriptions for health using artificial intelligence: a critical approach based on the international guidelines of health institutions. Health Information Science and Systems 2025;13(1) View
Bezrukova K, Griffith T. Post-Public AI: Research in Groups and Teams. Small Group Research 2025;56(5):799 View
Russinovich M, Salem A, Zanella-Béguelin S, Zunger Y. The Price of Intelligence. Communications of the ACM 2025;68(9):46 View
Jain A, Nimonkar P, Jadhav P. Citation integrity in the age of AI: evaluating the risks of reference hallucination in maxillofacial literature. Journal of Cranio-Maxillofacial Surgery 2025;53(10):1871 View
Hasnain M, Aurangzeb K, Alhussein M, Ghani I, Mahmood M. AI in conjunctivitis research: assessing ChatGPT and DeepSeek for etiology, intervention, and citation integrity via hallucination rate analysis. Frontiers in Artificial Intelligence 2025;8 View
Rashid M, Yi C, Sathapanasiri T, Udayachalerm S, Boonpattharatthiti K, Insuk S, Veettil S, Lai N, Chaiyakunapruk N, Dhippayom T, Rashid M, Cheng S, Ming Lai N, Lawin S, Limhensin P, Wechkunanukul K, Mayang N, Rattanachaisit N, Ye X. Role of Generative Artificial Intelligence in Assisting Systematic Review Process in Health Research: A Systematic Review. Value in Health 2025;28(11):1665 View
Seth I, Marcaccini G, Lim B, Novo J, Bacchi S, Cuomo R, Ross R, Rozen W. The Temporal Evolution of Large Language Model Performance: A Comparative Analysis of Past and Current Outputs in Scientific and Medical Research. Informatics 2025;12(3):86 View
Shen Z, Yu C. How Technology Advances Research and Practice in Autism Spectrum Disorder: A Narrative Review on Early Detection, Subtype Stratification, and Intervention. Brain Sciences 2025;15(8):890 View
Tekin S, Oguz S, Dagdelen S. ChatGPT-4o as a digital health tool for diabetes technology education: insights on reliability, quality, and readability. Endocrine 2025;90(2):652 View
Shor R, Greene E, Sumberg L, Weingrad A. AI Tools in Academia: Evaluating NotebookLM as a Tool for Conducting Literature Reviews. Psychiatry 2026;89(1):82 View
Li K, Peng Y, Li L, Liu B, Huang Z. Evaluating ChatGPT’s Utility in Biologic Therapy for Systemic Lupus Erythematosus: Comparative Study of ChatGPT and Google Web Search. JMIR Formative Research 2025;9:e76458 View
Dong Y, Zhang Z, Zhi Y, Li X, Guo T, He L, Zhao S, Yang X, Tang J, Zhong W, Niu Q, Ma M, Huang Z, Mao Y. Evaluating large language models' performance in answering common questions on drug-induced liver injury. JHEP Reports 2025;7(12):101579 View
Insuk S, Boonpattharatthiti K, Booncharoen C, Chaipitak P, Rashid M, Veettil S, Lai N, Chaiyakunapruk N, Dhippayom T. How Well Do ChatGPT and Claude Perform in Study Selection for Systematic Review in Obstetrics. Journal of Medical Systems 2025;49(1) View
Antisdel J, Miller W, Groves D. Data Mining Trauma: AI-Assisted Qualitative Study of Cyber Victimization on Reddit. JMIR Infodemiology 2025;5:e75493 View
Tanas Y, Gasper G, Rashidi K, Swed S. Evaluating large language models in patient education on facial plastic surgery: a standardized protocol. International Journal of Surgery Protocols 2025;29(3):108 View
Kurulkar G, Ingale S, Shinde A, Jangale Y, Itkar S. Information Retrieval System for Automating Quiz Generation and Evaluation Using Large Language Models. Cureus Journal of Computer Science 2025 View
Balcells D. Co-intelligent Design of Catalysis Research with Large Language Models: Hype or Reality?. ACS Catalysis 2025;15(18):16412 View
Boonrit N, Thaweechai A, Kessarin B, Ruanglertboon W. Capabilities of Large Language Models in Detecting and Managing Drug Interactions During Medication Reviews: Potential Implications as A Digital Assistant for Pragmatic Pharmacy Practice in Thailand. JACCP: JOURNAL OF THE AMERICAN COLLEGE OF CLINICAL PHARMACY 2025;8(11):1117 View
Soujah C, Bejjani C, Adra N, Blackburn L. Artificial Intelligence as a Drug Information Resource: Limitations and Strategies to Optimize in Pharmacy Practice. Hospital Pharmacy 2026;61(2):117 View
Tosca E, Aiello L, De Carlo A, Magni P. Pharmacometrics in the Age of Large Language Models: A Vision of the Future. Pharmaceutics 2025;17(10):1274 View
Dogru-Huzmeli E, Moore-Vasram S, Phadke C, Shafiee E, Amanullah S. Evaluating ChatGPT’s ability to simplify scientific abstracts for clinicians and the public. Scientific Reports 2025;15(1) View
Hohagen F. Artifizielle Intelligenz in der Psychotherapie – werden Psychotherapeut*innen bald überflüssig?. PSYCH up2date 2025;19(05):351 View
He Z, Zhao L, Li G, Wang J, Cai S, Tu P, Chen J, Wu J, Zhang J, Chen R, Huang Y, Pan X, Chen W. Comparative performance evaluation of large language models in answering esophageal cancer-related questions: a multi-model assessment study. Frontiers in Digital Health 2025;7 View
Huang Y, Yang G, Shen Y, Chen H, Wu W, Li X, Wu Y, Zhang K, Xu J, Zhang J. Application of Large Language Models in Complex Clinical Cases: Cross-Sectional Evaluation Study. JMIR Medical Informatics 2025;13:e73941 View
Drummond D, Girault A, Gonsard A. ChatGPT and other large language models for childhood asthma. Paediatric Respiratory Reviews 2025 View
Leiva-Araos A, Kalasapudi V, Jiang A, Kaushal H. Evaluating Smart Building Features for Fire, Electrical, and Life Safety: A Rapid Human-LLM Framework for Literature Review and Research Mapping. IEEE Access 2025;13:173312 View
Wu Y, Hu P, Wang D. The AI Annotator: Large Language Models’ Potential in Scoring Sustainability Reports. Systems 2025;13(10):899 View
Celik S. Integrating artificial intelligence into scientific writing: a narrative review for clinical and surgical researchers. The American Journal of Surgery 2025;250:116657 View
Rai M, Ngaw M, Nannas N. Artificial Intelligence Performance in Introductory Biology: Passing Grades but Poor Performance at High Cognitive Complexity. Education Sciences 2025;15(10):1400 View
Küçükuncular A. Learning with, rather than through, AI: co-designing science education for critical AI literacy. Frontiers in Education 2025;10 View
Xu P, Gong X, Chen X, Zhang W, Yang J, Yan B, Yuan M, Zheng Y, He M, Shi D. Benchmarking large multimodal models for ophthalmic visual question answering with OphthalWeChat. Advances in Ophthalmology Practice and Research 2026;6(1):33 View
Lee J. ChatGPT: how to use it and the pitfalls/cautions in academia. Annals of Pediatric Endocrinology & Metabolism 2025;30(5):229 View
Thomas L, Romasanta A, Pujol Priego L. Jagged competencies: Measuring the reliability of generative AI in academic research. Journal of Business Research 2026;203:115804 View
Aziz M, Brookhart M. Can Contemporary Large Language Models Provide the Domain Knowledge Needed for Causal Inference? Evaluating Automated Causal Graph Discovery Through an ASCVD Case Study. Clinical Epidemiology 2025;Volume 17:863 View
Sökmen D, Albayrak A, Sertkaya Z, Başağa Y, Serefoglu E. Artificial intelligence meets medical rarity: evaluating ChatGPT’s responses on post-orgasmic illness syndrome. International Journal of Impotence Research 2025 View
Civelekler M, Citirik M. Benchmarking Artificial Intelligence Models for Citation Accuracy in Neuro-Ophthalmological Disorders Research: A Comparative Analysis of Four Models. Journal of Hospital Librarianship 2025;25(3-4):140 View
Harman A. Preparing Students for the Generative Artificial Intelligence Future: Integrating Artificial Intelligence Literacy in Sport Management Education. Sport Management Education Journal 2025:1 View
C Calderon1,2,3 J, Robles-Velasco2,3 K, C Ferreira1,4 J. Artificial intelligence transforming healthcare research: opportunities, risks, and responsible use. Jornal Brasileiro de Pneumologia 2025:e20250433 View
Inooka T, Ota H, Taki Y, Yasuda S, Sajiki A, Suzumura A, Shimizu H, Takeuchi J, Tomita R, Kominami T, Ushida H, Yuki K, Nishiguchi K. Evolving Consultation: Enhancing Ophthalmic Diagnostic Performance Using Large Language Model. Ophthalmology Science 2026;6(2):101004 View
Campuzano E, Chakraborty S. SustainTima: a task-oriented text-based conversational AI agent for sustainable fast fashion brands. International Journal of Fashion Design, Technology and Education 2025:1 View
Voiculescu F, Darvasi P, Osmanlliu E, Krishnamoorthy P, Makri A. Natural Language AI Models and Pediatric Type 1 Diabetes: Can Chatbots Help With Diabetes Self-Management and Patient Education?. JMIR Diabetes 2025;10:e76986 View
Liao Z, Huang J, Liu Y, Li F, Tang L, Cong L, Wang H, Luo S. Utility of Large Language Models for Congenital Microtia Reconstruction Education: Comparison of the Performance of Claude, GPT, and Gemini. Aesthetic Plastic Surgery 2025 View
Niraula D, Shotande M, El Naqa I. Human-machine Interaction in the Age of Generative AI. The Cancer Journal 2025;31(6) View
García-Peñalvo F. Tres escenarios para la IA en educación: del apoyo responsable a la cocreación. Education in the Knowledge Society (EKS) 2025;26:e32932 View
Giuliani C, Benadi G, Engel F, Werner J, Watter M, Schwarzer G, Groß O, Zeiser R, Binder H, Kaier K. Identifying Biomedical Entities for Datasets in Scientific Articles: 4-Step Cache-Augmented Generation Approach Using GPT-4o and PubTator 3.0. JMIR Formative Research 2025;9:e73822 View
Hooshiar M. Artificial intelligence reliability in implant dentistry: A comparative analysis of clinical accuracy and hallucination patterns across multiple language models. The Journal of Prosthetic Dentistry 2026;135(5):e155 View
Chen Y. Evaluating the potential of ChatGPT-reformulated essays as written feedback in L2 writing. Computers and Education: Artificial Intelligence 2025;9:100500 View
Lupon E. Artificial intelligence and plastic surgery: Between innovation and responsibility. Annales de Chirurgie Plastique Esthétique 2026;71(1):1 View
Blanc-Durand F, Koopman M, Patel S, Aldea M, Kather J. Will AI write the next ‘Chapter’ in literature reviews?. Annals of Oncology 2026;37(4):448 View
Ros T, Samuel A. Artificial Intelligence in Academic Writing: Tools and Techniques for Scholars to Succeed in the Publishing World. New Directions for Adult and Continuing Education 2025;2025(188):57 View
Cherrez‐Ojeda I, Zuberbier T, Rodas‐Valero G, Sanchez J, Rudenko M, Dramburg S, Demoly P, Caimmi D, Gómez R, Ramon G, Fouda G, Quimby K, Chong‐Neto H, Llosa O, Larco J, Monge Ortega O, Faytong‐Haro M, Pfaar O, Bousquet J, Robles‐Velasco K. Evaluation of the Quality and Reliability of ChatGPT‐4's Responses on Allergen Immunotherapy Using Validated Instruments for Health Information Quality Assessment. Clinical and Translational Allergy 2025;15(12) View
Zade F, Ebrahimkhanlou A. Vision-Language Artificial Intelligence for Robotic-Based Monitoring: Concrete Defect Detection, Classification, and Localization in Two-Dimensional Maps. Journal of Computing in Civil Engineering 2026;40(2) View
Mansoor I, Abdullah M, Rizwan M, Fraz M. Reasoning with large language models in medicine: a systematic review of techniques, challenges and clinical integration. Health Information Science and Systems 2025;14(1) View
Äyräväinen L, Hinds J, Davidson B, Wong W. Disambiguating sentiment annotation: A mixed methods investigation of annotator experience and impact of instructions on annotator agreement. PLOS One 2025;20(12):e0336269 View
Tang W, Hang Y, Zhang J, Dang Y. Recent Advances and Future Directions of Artificial Intelligence in Glaucoma Management. The Open Ophthalmology Journal 2026;20(1) View
Guillen-Aguinaga M, Aguinaga-Ontoso E, Guillen-Aguinaga L, Guillen-Grima F, Aguinaga-Ontoso I. Data Quality in the Age of AI: A Review of Governance, Ethics, and the FAIR Principles. Data 2025;10(12):201 View
Mourtzinis S, Silva T, Lo J, Smith D, Conley S. A human in the loop approach to applying large language models for farm management insight. Scientific Reports 2025;16(1) View
Colder Carras M, Qureshi R, Naaman K, Aldayel F, Date M, AlJuboori D, Thrul J. Using Large Language Models to Summarize Evidence in Biomedical Articles: Exploratory Comparison Between AI- and Human-Annotated Bibliographies. JMIR Formative Research 2026;10:e69707 View
Srikandabala K, Prabagar K, Jayatilleke S, Zhang P, Ellerbrock R, Rinas S, de Silva D, Alahakoon D. Optimization and Management of Data Center Networks: A Scoping Review on Key Themes, Challenges, and Artificial Intelligence and Machine Learning Approaches. IEEE Access 2025;13:134699 View
Ding S, Ahmed M, Malik T, Somagani R, Vohra F. Readability Comparison of AI-Generated Versus UpToDate Educational Content on Stroke Management: A Cross-Sectional Study. Cureus 2025 View
Maggio L, Konopasky A. AI, authorship teams and agency: Enacting fission and fusion. Medical Education 2026;60(2):101 View
Mavrych V, Yousef E, Yaqinuddin A, Shaikh A, Bolgova O. Evaluating the Reliability of GPT‐4o in Histological Image Interpretation. Clinical Anatomy 2026;39(4):517 View
Bianchi F, Queen O, Thakkar N, Sun E, Zou J. Exploring the use of AI authors and reviewers at Agents4Science. Nature Biotechnology 2026;44(1):11 View
Lamba N, Tiwari S, Gaur M. Hallucinations in Scholarly LLMs. Open Conference Proceedings 2025;8 View
Venne D, Hartley D, Dorris C, Torres L, Malchione M, Mitgang E, Goodman J. Strategies for identifying and using diverse global health data: perspectives from data searching and meta-analyses on antimicrobial resistance. Discover Public Health 2025;22(1) View
Jagasia P, Bagdady K, Franco M, DeYoung J, Fracol M. Novel Breast Reconstruction Generative Pretrained Transformer Promotes Decisional Confidence and Clinical Efficiency. Plastic and Reconstructive Surgery - Global Open 2025;13(12):e7286 View
Leong A, Ormsby K. Generative AI for patient education in cancer care: A scoping review of evaluation practices and emerging trends. Technical Innovations & Patient Support in Radiation Oncology 2026;37:100373 View
Gu C, Zhang J. How does “Dr. AI” trigger cyberchondria? An empirical study based on the CMIS framework using a hybrid SEM–ANN approach. Current Psychology 2026;45(1) View
Yadav V, Trushna T, Mandal U, Singh M, Goel A, Bairwa M, Dabar D, Sabde Y. AI-assisted search strategy construction with step-by-step instructions to execute and manage searches across major databases. Medical Reference Services Quarterly 2026;45(1):1 View
Anand E, Ghersin I, Lingam G, Devlin K, Pelly T, Singer D, Tomlinson C, Munro R, Capstick R, Antoniou A, Hart A, Tozer P, Sahnan K, Lung P. Enhancing Patient Understanding of Perianal Fistula MRI Findings Using ChatGPT: A Randomized, Single Centre Study. Diagnostics 2025;16(1):72 View
Birsin Z, Jeral S, Cebeci S, Çerme E, Aliyev V, Günaltılı M, Abbasov H, Çiçek E, Demirci N, Alan Ö. Stage II colon cancer: does ChatGPT recommend more intensive adjuvant therapy? A comparison with MDT decisions. Future Oncology 2026;22(4):427 View
Shamsi A, Wang T, Amraei M, Raju N. Evaluating AI Text Detection Tools for Distinguishing Human-Written from AI-Generated Abstracts in Persian-Language Journals of Library and Information Science. Acta Informatica Pragensia 2026;15(1):126 View
Cleland J, Driessen E, Masters K, Lingard L, Maggio L. When and how to disclose AI use in academic publishing: AMEE Guide No.192. Medical Teacher 2026;48(4):542 View
Umpar N, Umpar S, Magsino T, Nolasco D, Abuton D. Exploring the Experiences of College Students in Using ChatGPT as an Academic Support Tool: A Qualitative Study. International Journal on Culture, History, and Religion 2025;7(2):1 View
Yu S, Hwang H. The ethics of using artificial intelligence in writing medical research papers. Kosin Medical Journal 2025;40(4):270 View
Kanbakan A, Berikol G, İlhan B, Altıntaş E, Doğanay F. Artificial intelligence based resuscitation simulation: a pilot study of a novel approach to team leadership training. BMC Medical Education 2026;26(1) View
de Beer J, Mohammed K, Barrios J, Elnajjar S, Aldabahy M, Moodley V, Almuntashiri A, Basha M, Eissa A, Barayan W, Young S. Ethical Integration of Artificial Intelligence in Nursing Research: An Evidence-based Practice Project from Saudi Arabia. Journal of Nursing Science and Professional Practice 2025;2(4):183 View
Singh S, Chandrasekhar P. Accuracy Is Not Enough: Reasoning and Reference Reliability in Orthopaedic Large Language Model (LLM) Applications. Cureus 2026 View
Kobierski M. AI the Teacher? A Study on the Use of Artificial Intelligence Tools in Learning. Linguistics Beyond and Within (LingBaW) 2025;11:89 View
Rios-Garcia W, Silva-Jiménez S, Gálvez-Rodríguez E, Alberca-Naira Y, Via-y-Rada-Torres A, Rios-Garcia A. Assessment of ChatGPT-5 as an Artificial Intelligence Tool for Exploring Emerging Dimensions of Clinical Simulation: A Proof-of-concept Study. Journal of Medical Systems 2026;50(1) View
Evenstein Sigalov S, Tsybulsky D. Critical ignoring reimagined: insights from STEM digital curation on Wikimedia platforms. Smart Learning Environments 2026;13(1) View
Jiang G, Yang S, Wang Y, Hui P. When trust collides: Exploring human-LLM cooperation intention through the prisoner’s dilemma. International Journal of Human-Computer Studies 2026;209:103740 View
Civelekler M, Citirik M. Exploring Artificial Intelligence’s Role in Citation Generation for Ocular Inflammation and Uveal Diseases Research: A Comparative Evaluation Across Four Models. Ocular Immunology and Inflammation 2026;34(2):397 View
Basil M, Ahmed W, Hajeomar R, Strawbridge J, Lynch M, Mukhalalati B. A scoping review of the use of generative artificial intelligence tools in health profession education. BMC Medical Education 2026;26(1) View
Guo Y, Yang C. Large Language Models for High-Entropy Alloys: Literature Mining, Design Orchestration, and Evaluation Standards. Metals 2026;16(2):162 View
Bati Sutcu Y, Ozcan S, Dincer M, Atli Z, Trabulus S, Seyahi N. Comparative Analysis of ChatGPT and Gemini in Addressing Questions from Chronic Kidney Disease Patients. Kidney and Dialysis 2026;6(1):9 View
Hack S, Karni R, Maniaci A, Fundakowski C, Castellani L, Incandela F, Accorona R, Mayo-Yanez M, Violati M, Giannini L, Mevio N, Saibene A. Evaluation of large language models as decision support tools for head and neck cancer management: A blinded multidisciplinary simulation study. Oral Oncology 2026;174:107877 View
McCavitt D, Shabani S, Mulakaluri A, Dhandi S, Duong A, Patterson J. ChatGPT-4o Mini Fabricates and Miscites Evidence for American Academy of Orthopaedic Surgeons Hip Fracture Clinical Practice Guidelines. JBJS Open Access 2026;11(1) View
Chisale P. Protecting creativity in the age of generative AI: productive uncertainty, and visible thinking in scholarship and assessment. Frontiers in Education 2026;10 View
Georgiou G. Key Features to Distinguish Between Human- and AI-Generated Texts: Perspectives from University Professors. AI in Education 2026;2(1):2 View
Tensen D, Carey M, Grainger P. Comparing ChatGPT-5 and other GenAI tools in doctoral confirmation: variability, persona effects, and alignment with human feedback. Assessment & Evaluation in Higher Education 2026:1 View
Güngör V, Yaslikaya S. Informing patients with otologic balance disorders: A performance review of ChatGPT-4. Medicine 2026;105(6):e47127 View
Şengül H, Akın B, Kayaalp M, Sezgin E. Deep research capabilities in GPT‐5 thinking and Gemini 2.5 Pro improve citation integrity and concordance with American Academy of Orthopaedic Surgeons anterior cruciate ligament and rotator cuff guidelines. Knee Surgery, Sports Traumatology, Arthroscopy 2026;34(4):1504 View
Rao A, Cholankeril G, Flores A, Sood G, White T, Kanwal F, El-Serag H. Development of retrieval-augmented generation–based large language model for drug-induced liver injury using Livertox data. Hepatology Communications 2026;10(3) View
Kudaş İ. Can we trust chatbots for tacrolimus? A STROBE-aligned multimodel benchmark of large language models for drug information in kidney transplantation. Journal of Surgery and Medicine 2026;10(2):48 View
Rosas-Jiménez C, Lokker C, Ibhawoh B, Schwartz L. Trust and overreliance on ChatGPT and its implications on critical thinking: an exploration in health policy. Discover Public Health 2026;23(1) View
Wagg A. Large Language Models in Urology, a Cautionary Promise. Comment on Eskandar, K. Assessing ChatGPT Accuracy Across Versions for Patient and Guideline Queries in Sacral Neuromodulation. Soc. Int. Urol. J. 2026, 7, 11. Société Internationale d’Urologie Journal 2026;7(1):12 View
Eskandar K. Assessing ChatGPT Accuracy Across Versions for Patient and Guideline Queries in Sacral Neuromodulation. Société Internationale d’Urologie Journal 2026;7(1):11 View
Park Y, Zhang H, Bai S. Large language models in systematic review and meta-analysis of surgical treatments for vaginal vault prolapse. npj Digital Medicine 2026;9(1) View
Sivri I, Ozden F, Gul G, Kaygin E, Colak T. Comment on: “Assessing AI-generated patient leaflets on descemet membrane endothelial keratoplasty”. European Journal of Ophthalmology 2026;36(4) View
Jongkind R, Elings E, Joukes E, Broens T, Leopold H, Wiesman F, Meinema J. Is your curriculum GenAI-proof? A method for GenAI impact assessment and a case study. MedEdPublish 2026;15:11 View
Brun Y, Chakraborty S, Le Goues C, Păsăreanu C, Singla A. Automatically Engineering Trusted Software: A Research Roadmap. ACM Transactions on Software Engineering and Methodology 2026 View
Marjanović M, Latinović L. Artificial intelligence in medical diagnostics: A critical narrative review of risks, responsibility, and the epistemological limits of large language models. Serbian Journal of Engineering Management 2026;11(sp.iss.):76 View
Alvin S. Human PR is Irreplaceable and AI as a Supporting Tool: Exploring Indonesian PR Practitioners’ Attitudes, Applications, and Adoption of AI. International Journal of Strategic Communication 2026:1 View
Jeon K. Limitations and mitigation strategies for using generative artificial intelligence in medical writing: a narrative review. Journal of the Korean Medical Association 2026;69(2):118 View
Salzmann M, Ramadanov N, Prill R, Hable R, Becker R. Large language models are comparable with commonly used statistical software: A validation of GPT 5.1 for frequentist meta‐analysis in orthopaedics. Knee Surgery, Sports Traumatology, Arthroscopy 2026 View
Cao L, Lin A. Dialogic triad. Journal of English for Research Publication Purposes 2025;6(2):340 View
Ademolu E. Unreliable minds, unreliable machines: dyslexic memory, ChatGPT, and the epistemic disobedience of generative AI. AI & SOCIETY 2026;41(6):5841 View
Aydin M, Vatansever A, Erer Kafa S. From Hallucination to Precision: A Longitudinal Analysis of Reference Accuracy and Plagiarism in AI-Generated Medical Literature (2024–2026). Uludağ Üniversitesi Tıp Fakültesi Dergisi 2026;52:1870116 View
Jay M, Morgan M, Straus S, Wilson E, Sutradhar R, Yu C, Gotlib Conn L, Dharma C, Lipscombe L, Eskander A. Evaluating the Methodological Quality of Artificial Intelligence–Assisted Systematic Reviews: Protocol for a Mixed Methods Meta-Research Study. JMIR Research Protocols 2026;15:e90588 View
Seymour C, Tolley K, Zengeya T, Spear D, Cloete J, Dayaram A, da Silva J, Alexander G, Handley K, Joseph G, Simba L, Snaddon K, von Maltitz G, Carrick P. A 2026 horizon scan for biodiversity conservation in South Africa. Ambio 2026 View
Psarellis Y, Pillai N, Dhakal S, Mavroudis P. Machine-learning-enabled modeling of pharmacokinetics and pharmacodynamics. Drug Discovery Today 2026;31(3):104645 View
Wang X, Li X, Wong W. Designing a flipped AI-chatbot learning module to support students’ environmental literacy development: A Fuzzy Delphi Method. PLOS One 2026;21(3):e0345027 View
Rajaratnam V, Omar U, Kee K, Kaliya-Perumal A. Citation Inaccuracies and the Need for Multi-Level Oversight in AI-Assisted Medical Writing. Standards 2026;6(1):10 View
Omar M, Awan K, Ramaiya N, Mohamed I. Radiology oral boards coach: A custom large language model prompt for structured oral board preparation in resident readouts. Current Problems in Diagnostic Radiology 2026;55(4):509 View
Jain A, Merchant Y. Evaluating the accuracy of ChatGPT-4 generated references in oral and maxillofacial surgery: a preliminary observational study. Oral and Maxillofacial Surgery 2026;30(1) View
Sallai T, Di Nitto M, Catania G, Zanini M, Sasso L, Bagnasco A. When AI writes in minutes and supervisors verify for hours: Rethinking thesis supervision in nursing education. Nurse Education in Practice 2026;94:104801 View
Cihodaru-Ștefanache Ș, Popovici M, Dumitru M, Podina I. When ChatGPT becomes a shortcut: psychological predictors of student plagiarism threshold. Interactive Learning Environments 2026:1 View
Şahin A, Yorulmaz E. Guideline Concordance and Safety of AI Chatbots for Circumcision Anesthesia: A Comparative Study. Journal of Contemporary Medicine 2026;16(2):109 View
Singh R, Venugopal V, Daga R, Aggarwal A, Kaur M, Kanaujiya R, Gupta A, Puri S. Clinical readiness of LLM-generated oncologic CT/MRI impressions: A blinded comparison with subspecialty oncoradiologists at a tertiary cancer centre. European Journal of Radiology Artificial Intelligence 2026;6:100088 View
Pei B, Sun X, Guo R, Zhang Z. LitPilot: A Human-Centered Platform for Transparent and Interactive Systematic Literature Reviews. International Journal of Human–Computer Interaction 2026:1 View
Makarov V, Stroganov O, Furlong L, Evarts B, van den Biggelaar L, Goulas A, Stolte E, Marren D, Greiffenberg L. Natural language querying of biological databases with large language models. Drug Discovery Today 2026;31(3):104654 View
Karaarslan E, Aydın Ö. Governing Artificial Intelligence in the Evolving Academic Publishing Ecosystem. Journal of Metaverse 2026;(6):134 View
Wang X, Yin C, He H, Guo J, Fu X, Bai F. Benchmarking public large language model responses to patient-facing inflammatory bowel disease questions: informational quality, transparency proxies, and readability. Frontiers in Public Health 2026;14 View
Naffi N, El Bahlouli Y, Fortier N, Doumbouya M, Shakeraneh S, Al Faraj D, Gregoire J, Beaulieu N, Whelan K. Intelligence artificielle et formation des professionnels de la santé : une revue intégrative des apports, défis et enjeux de médiation. Médiations et médiatisations 2026;(25) View
Fabi A, Egli C, Wendelspiess S, Griewing S, Haas Y, De Pellegrin L, Schaefer D, Qiu S, Harder Y, Kappos E. Exploring the Role of AI in Managing Treatment Recommendations for Lymphedema: International, Multidisciplinary, Multiprofessional Survey Study of Trust, Reliability, and Impact on Decision-Making. JMIR Medical Informatics 2026;14:e80553 View
Yotov K, Hadzhikoleva S, Hadzhikolev E, Milev M, Rachovski T. A Conceptual Framework for Simulated Self-Assessment and Meta-Evaluation of Generative AI Models. AI 2026;7(4):134 View
Zhu L, Zamore Z, Azad T, Amlani L, Chen J, Harris C, Khalifeh J, Murali S, Sylvester S, Mundy L, Azad C. Enhancing Readability of Surgical Patient Education Resources Using ChatGPT. Journal of Surgical Research 2026;322:87 View
Anggraini C, Rachmadania N, Nugroho C, Wulandari A, Rina N. How students make sense of ChatGPT in digital literacy education: reflections on ethics, dependency, emotion, and evaluation. Frontiers in Political Science 2026;8 View
Fiş Erümit S. ChatGPT Practices of High School Students in Programming Education: Experiences, Perceptions and Challenges. European Journal of Education 2026;61(2) View
Qadri S, Kewalramani D, Aboutanos M, Narayan M, Quintana M. Artificial Intelligence in Surgical Care: A Point-Counterpoint Analysis. Trauma Surgery & Acute Care Open 2026;11(2):e002093 View
Ren K, Weng Q, Chen Q, Li H, Xie D, Zeng C, Wei J, Lei G, Wang Y. The application of large language models in orthopedic postgraduate education: potentials, challenges, and future prospects. Journal of Orthopaedic Surgery and Research 2026;21(1) View
Thoeni A, Fryer L. AI chatbots in higher education: Comparing expectations to evidence. Computers in Human Behavior Reports 2026;22:101061 View
Altunisik N, Altunisik Toplu S, Turkmen D. Scenario-based evaluation of large language models for reference accuracy in dermatology: literature retrieval on latent tuberculosis in psoriasis patients on anti-IL-17/23 therapy. Cutaneous and Ocular Toxicology 2026;45(2):145 View
Jongkind R, Elings E, Joukes E, Broens T, Leopold H, Wiesman F, Meinema J. Is your curriculum GenAI-proof? A method for GenAI impact assessment and a case study. MedEdPublish 2026;15:11 View
Ozkara Menekseoglu P, Weibezahl M, Ellingsen M, Sterkenburg J, Kharko A, Hochwarter S, Schwarz J. Errors in AI-Transformed Patient-Centered Mental Health Documentation Written by Psychiatrists: Qualitative Pre-Post Study. JMIR Mental Health 2026;13:e78351 View
Hack S, Hayu D, Karni J, Glikson E, Remer E, Alon E, Ahmed O, Takashima M. Impact of Generative AI on Operative Planning in Otolaryngology Residents. Surgical Innovation 2026 View
Cabezas-Clavijo Á, Sidorenko-Bautista P. Assessing the Performance of 8 AI Chatbots in Bibliographic Reference Retrieval: Grok and DeepSeek Outperform ChatGPT, but None are Entirely Accurate. Journal of Data and Information Science 2026 View
Thaker N, Liu W, Waddle M, Showalter T, Mastroleo F, Luh J, Levitt C, Ning M, Loaiza-Bonilla A, Hong J. Retrieval-Augmented Generation in Oncology: Promises, Pitfalls, and Early Applications. AI in Precision Oncology 2026;3(2-3):34 View
Csigó K, Cserey G. Human Shadows in Machine Minds: Quantitative Study Interpreting AI Responses to the Rorschach Test. JMIR Mental Health 2026;13:e88186 View
Lee Y. Bridging theory and practice in generative artificial intelligence for medical education: insights from clinical teaching experience. Ewha Medical Journal 2026;49(2):e16 View
Santhosh V, Vas R, Roychowdhury B, Sakthi K, Rahaman M. Development and content validation of the CAREFUL-AI framework for evaluating AI-generated scientific manuscripts: an exploratory cross-platform study. Research Evaluation 2026;35 View
Funa A, Ricafort J, Gabay R. Reflect-Compare-Revise (RCR): A Practitioner Inquiry on Integrating Generative AI in Nursing Care Planning Education. Medical Science Educator 2026 View
Karni J, Simon C, Hack S. Assistive, not autonomous: Generative artificial intelligence in head and neck cancer care - A scoping review. DIGITAL HEALTH 2026;12 View
Hall A, Patel A, Shariati K, Perrotta A, Argame A, Tseng C, Tseng C, Hidalgo M, Lee J. Generative AI in Surgical Care: Evaluating Large Language Model Performance in Patient Education. Journal of Artificial Intelligence for Medical Sciences 2025;6(1-4):54 View
Shiukashvili N, Rochikashvili M, Kupradze V, Gonjilashvili N, Gvajaia N, Kutchava L, Janikashvili N, Tevzadze N, Undilashvili A, Ekaladze E. A Performance-Based Rubric for Generative AI use in Medical Students’ Research Tasks: Development and Initial Psychometric Evaluation. Journal of Medical Education and Curricular Development 2026;13 View
Рзянкин И. Минимизация ошибок нейросетевых моделей в библиотечно-издательской сфере: подходы и их эффективность. Международный научный журнал "Современные информационные технологии и ИТ-образование" 2025;21(2):272 View
Worrell T, Roberts Z. Bringing Uncle Chatty Gee (ChatGPT) into the classroom: a lens from Critical Indigenous Studies. Curriculum Perspectives 2026 View
Nowacki L, Wrochna A. Framing without intent: AI super-frames and the hidden agency of large language models - a case study of ChatGPT and DeepSeek. Communication Research and Practice 2026:1 View
Sharpe B, Rod M, Horne G. Artificial Intelligence in Applied Cognitive Psychology: A Commentary. International Journal of Cognitive Research in Science, Engineering and Education (IJCRSEE) 2026;14(1):115 View
Chigwada J, Ngulube P. Use of artificial intelligence tools in the publishing process: expectations from publishers through author guidelines. Frontiers in Research Metrics and Analytics 2026;11 View
Wang K, Ward D, Ord M, Liu B, Jardine A. AI-designed and AI-implemented control systems for bespoke scientific instrumentation: application to scanning microscopy. Digital Discovery 2026;5(6):2448 View
Madsen D, Silva E. Teaching citation in the age of generative AI: Rethinking research literacy, academic integrity, and epistemic responsibility. The Journal of Academic Librarianship 2026;52(4):103267 View
Istrate O. Generative AI and multimodal pedagogy: implications for teaching, learning, and assessment. Frontiers in Education 2026;11 View
McLaughlin N, Srinivas A, Lowe Z, Botterbush K, Patel M, Avila M. Large Language Model Hallucinations in Spine Surgery: A Comparative Analysis of Clinician vs Patient-Level Prompts. Neurosurgery Practice 2026;7(3) View
Çepni S, Çetinkaya Ş. Pre-service english teachers’ tendency to integrate ChatGPT into exam routines: a metaphor analysis. SN Social Sciences 2026;6(6) View
Ok F, Sukur I, Gül M, van Renterghem K. Evaluation of ChatGPT-5–generated surgical literature: the accuracy of a review on penile prosthesis implantation. International Journal of Impotence Research 2026 View
Dereli F, Yildirim J. The Validity and Reliability of Artificial Intelligence Chatbots in the Self-management of Diabetes. CIN: Computers, Informatics, Nursing 2026 View
Civelekler M, Citirik M. Comparing the Accuracy of 4 Artificial Intelligence Models in PubMed Citation Generation for Glaucoma Research. Journal of Glaucoma 2026;35(6):e68 View
Jafar A, Motair M, Hajeri S. Evaluating ChatGPT's Reliability in Analyzing Facial Asymmetry: A Comparative Study With Human Evaluators Using Cross Sectional Static Images. World Journal of Otorhinolaryngology - Head and Neck Surgery 2026 View
El Kayali M, Neumann E, Fahy S, Bartek B, Oehme S, Getgood A, Jung T, Milinkovic D. Assessment of OpenEvidence responses to American Academy of Orthopaedic Surgeons clinical practice guidelines for anterior cruciate ligament reconstruction. Journal of ISAKOS 2026;19:101160 View
Rezaeian M. Limitations of Using Generative Artificial Intelligence in Writing Scientific Articles: Use of Fake Citations and Tendency Towards Western Texts. Journal of Rafsanjan University of Medical Sciences 2025;24(9):773 View
Albaloul O, Alajmi A, Oster N, Killian C. Systematic review of qualitative studies exploring K-12 teachers’ perceptions and experiences of using ChatGPT. Discover Artificial Intelligence 2026;6(1) View
Al Hamid A, Alshahrani R, Alghareeb M, Hawsawi H, Alghareeb R, Alsultan M. Assessing the Accuracy and Precision of Artificial Intelligence for Diabetes Mellitus and Hypertension Management. Journal of Clinical Medicine 2026;15(12):4419 View
Rezaeian M. Limitations of Using Generative Artificial Intelligence in Writing Scientific Articles: Complex Considerations in Authorship. Journal of Rafsanjan University of Medical Sciences 2026;24(10):887 View
Chan W, Gao Z, Chuxi A, Rainer T, Tang W, Wing Y, Lederman Z. Loneliness and social isolation at the emergency department: a scoping review protocol. BMJ Open 2026;16(6):e108972 View
Hilkenmeier F, Stoltenberg M, Stierle C. Using full agreement across multiple large language models for title-and-abstract screening in systematic reviews: a proof-of-concept. Systematic Reviews 2026;15(1) View
Khairil L, Benny K, Jerry J, Khatib F, Che Ramli M, Kumar S. AI in Drug Discovery: Clinical Failures, Regulatory Reality, and the Validation Crisis Behind the Hype. Pharmaceuticals 2026;19(6):916 View
Zhong W, Zheng G, Li Q, Huang Y, Zhao J. Benchmarking public large language model responses to patient-facing varicose veins questions: informational quality, verifiability indicators, and readability. Frontiers in Public Health 2026;14 View
Ang M, Chen L, Song L, Lipovich L, Choo S. Responsible Use of Large Language Models in Microbial Genomics and Bioinformatics: A Life-Science Framework for Reliability, Reproducibility, and Risk-Aware Interpretation. Life 2026;16(6):1032 View
Correia L, Feng Y. Can Consumers Rely on AI Chatbots for Food Safety Advice? A Comparative Analysis of Chatbot Responses and Food Safety Specialists’ Guidance. Journal of Food Protection 2026;89(9):100852 View
Civelekler M, Çıtırık M. The role of artificial intelligence in academic citation: A study on lens, cataract, and anterior segment research. Indian Journal of Ophthalmology 2026;74(7):1073 View
Zhang X, Dai Y, Zhao X, Wu L, Shao B, Shan X, Ji F, Deng R, Zhao B. Large language model chatbots as sources of pediatric anesthesia health advice: An evaluation of reliability and readability. DIGITAL HEALTH 2026;12 View
Freudenberg J, Knitza J, Gremke N, Amann N, Deutsch T, Tauber N, Muras K, Oftring Z, Engler T, Englisch A, Lukac S, Fabi A, Alves A, Reinhardt K, Wallwiener M, Kuhlmann M, Bamberger J, Wagner U, Kuhn S, Griewing S. AI-enabled clinical decision support in breast cancer care: a blinded multicenter benchmarking study comparing medically specialized with a general-purpose system. Journal of Medical Systems 2026;50(1) View
Spennemann D. How Unique Are Hallucinated Citations Offered by Generative Artificial Intelligence Models?. Publications 2026;14(3):38 View
Shimazaki T, Tachikawa M. Bridging interpretable machine learning and large language models through direct representative selection and prediction: a two-layer framework for quantitative-linguistic insight. Physical Chemistry Chemical Physics 2026;28(28):17246 View
Knauer M, Greß J, Kather J, May P. Augmenting Oncology Guideline Maintenance with Large Language Models: A Prospective Case Study (Preprint). JMIR AI 2026 View
Hunt D, Di Miceli M. Evaluating the performance of 3 large language models in higher education essay-like assessments in 2024 and 2026. American Journal of STEM Education 2026;25:279 View
Scallan R, Atwood B, Moser C, Eberhardt G, Stucky C, Kessinger S. A Comparative Study of Generative Artificial Intelligence Versus Clinical Experts for Evidence-Based Decision-Making in Austere Environments. Military Medicine 2026 View
Stephenson C, Edakkanambeth Varayil J, Aakre C, Croghan I, Hurt R. Prompting Strategies for Large Language Models in Primary Care: A Primer for Clinician-Artificial Intelligence Interaction. Journal of Primary Care & Community Health 2026;17 View
Ülkir M, Paslı B. Reference Hallucination, Citation Reliability, and Readability of Large Language Models in Anatomy‐Related Question Answering. Clinical Anatomy 2026 View
Semiletova A. Semantic degradation of scientific text in AI editing. Language and Text 2026;13(2):213 View
Boyd A, Srinivasan M, Wijesinha R, Coates M, Affandi S. An assessment of the quality and readability of patient information resources produced by OpenAI's ChatGPT and Microsoft's Copilot for invasive cardiovascular procedures. International Journal of Cardiology Innovations 2026;3:100013 View
Suksatan W. Who Is Responsible When AI Gets Cancer Information Wrong? Implications for Patient Education. Journal of Cancer Education 2026 View
SOMER S, VARDY M, EISENBERGER J, SHAPIRA M, MEHLER S, SAFRAI M. Evaluation of six large language models for study identification in an obstetric systematic review. Minerva Obstetrics and Gynecology 2026;78(4) View
Zakhari R. Teaching Critical Appraisal in the Age of Generative Artificial Intelligence. CIN: Computers, Informatics, Nursing 2026 View
Han C, Manoharan S, Ye X, Speidel U. Phantom citations: An empirical study of non‐existent and unverifiable references in scholarly literature. Journal of the Association for Information Science and Technology 2026 View
Domingo J. The Role of Artificial Intelligence in the Lifecycle of Scientific Manuscripts: Authoring, Reviewing, and Editorial Selection. Qeios 2026;8(7) View
Llopis-Agelán J, Estrada-Lorenzo J, Martín-Martín Ó, Del-Gallego-Lastra R. Evaluación de ChatGPT para la corrección de referencias bibliográficas en estilo Vancouver. BiD: textos universitaris de biblioteconomia i documentació 2026;(56) View
Kalem Ozgen A, Celik Karbancioglu E, Tezel N. Artificial Intelligence Large Language Models for Pelvic Floor Rehabilitation: Readabilty, Usability and Consistency. Neurourology and Urodynamics 2026 View
Lai K, Mustaffa N, Preece C, Wong F. Hallucinations in large language models within academic contexts: a systematic review. Journal of Science and Technology Policy Management 2026:1 View
Chen C, Liang S, Wang T. Beyond existence checks: a cross-database benchmark of bibliographic metadata consistency across academic domains and LLM-generated citations. Scientometrics 2026 View
Descamps J, Bouguennec N, Porcher R, Resche-Rigon M, Bouché P. Artificial intelligence in systematic reviews and meta-analyses: Task-specific performance, residual error quantification, and human oversight. Orthopaedics & Traumatology: Surgery & Research 2026:104793 View

Books/Policy Documents

Caslini G, Gianotti M, Garzotto F. End-User Development. View
Montenegro L, Gomes L, Machado J. Progress in Artificial Intelligence. View
Acar S, Ellis A. Generative Artificial Intelligence and Creativity. View
Zaboli A, Turcato G. Triage Systems: Essential Knowledge for Emergency Nurses and Physicians. View
Madani R, Richter D, Gomes T. Innovation in Medical Education and Clinical Practice. View
Hems N. Überinformiert und fehlgeleitet: Digitale Fehlinformation im Gesundheitsbereich. View
Egbueri J, Agbasi J, Usman A, Uwajingba H, Abba S. Application of Artificial Intelligence in Hydrogeological Research. View
Rahimi A, Wimmer M, Gusenbauer M. Web Engineering. View
Sense F, Dye I, Collins M, Swan G, Krusmark M, Smith S, Jastrzembski Myers T. Adaptive Instructional Systems. View

Conference Proceedings

Steinbach M, Bhandari S, Meyer J, Pardos Z. Proceedings of the Twelfth ACM Conference on Learning @ Scale. When LLMs Hallucinate: Examining the Effects of Erroneous Feedback in Math Tutoring Systems View
Shi Q, Han Q, Soares C. 2025 IEEE International Conference on Digital Health (ICDH). C-PATH: Conversational Patient Assistance and Triage in Healthcare System View
Cline D, Zgonc D, Vass W, Butkus M, Baideme M, Krueger B. 2025 ASEE Annual Conference & Exposition Proceedings. A New “Age of Generative AI” Paradigm for the Development and Management of Curricula in Undergraduate Environmental Engineering Programs View
Hristov H, Bekirski K, Somova E, Ignatov A, Stavrev S, Poptolev Z. EEPES 2025. Approach and Tool for Creating Sustainable Learning Video Resources Through Integration of AI Subtitle Translator View
Thang H, Vi T. 2025 International Conference on Circuit, Systems and Communication (ICCSC). AI in the Age of Scientific Saturation: A Survey of How LLMs Rediscover the Obvious View
Hermann J, Jansen N, Dogangün A, van Ledden S. Proceedings of the 37th Australian Conference on Human-Computer Interaction. Implicit SDG Reasoning in LLMs: A Classroom-Oriented Benchmark View
Zhang Z, Chen P, Du F, Ye R, Huang O, Liut M, Aspuru-Guzik A. 2025 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC). TreeReader: A Hierarchical Academic Paper Reader Powered by Language Models View
Lorenzoni G, Scaramuzzino M, Iacono S, Martini L, Zolezzi D, Vercelli G. 2025 IEEE International Conference on Cyber Humanities (IEEE-CH). Real-Time Multilingual Museum Guidance with ConvAI and Unreal Engine 5 View
Vanmala , Dubey A. 2025 Seventh International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN). A Comprehensive Evaluation of Large Language Models: Capabilities, Challenges, and Cross-Domain Applications View
Jiang X, Sun Q, Lv X. Proceedings of the 2025 11th International Conference on Communication and Information Processing. BasketVision: Benchmarking MLLMs' Grasp of Complex Dynamic Systems View
Ryser A, Allwein F, Schlippe T. 2025 3rd International Conference on Foundation and Large Language Models (FLLM). Calibrated Trust in Dealing with LLM Hallucinations: A Qualitative Study View
Farhad M, Masud M, Abdul Rasheed A, Musthfa N. Proceedings of the 2026 Australasian Information Security Conference. How Linguistic Variation Shapes Hallucination Risk in Large Language Models: A Psychometric Perspective View
Park S, Lee T, Lee W. Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems. LLM-box vs. Thinking-box: Designing for Deliberate User Engagement with Distorted Information in Conversational Search View
Hannig L, Bush A, Aksoy M, Trappen T, Becker S, Ontrup G. Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems. Campus AI vs. Commercial AI: Comparing How Students and Employees Perceive their University’s LLM Chatbot vs. ChatGPT View
Winecoff A, Klyman K. Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems. From Symptoms to Systems: An Expert-Guided Approach to Understanding Risks of Generative AI for Eating Disorders View
Salem AlMahri K, Ghanem Mohamed Alzahmi S, Ezzaza M, Abdullah Alyammahi J, Ertek G. 2025 International Conference on Business Intelligence for Technology Innovation (ICBITI). Automated Data Analytics with GPTs under ChatGPT: Lessons Learned from Two Case Studies View
Spiță M, Schipor O. 2026 International Conference on Development and Application Systems (DAS). Scholar Augment: An End-to-End Multi-LLM Platform for Accelerating Systematic Literature Reviews and Semantic Corpus Analysis View
Hacker P, Edwards L, Kasirzadeh A. Proceedings of the 2026 ACM Conference on Fairness, Accountability, and Transparency. AI, Digital Platforms, and the New Systemic Risk View
Drašković N, Rešetar M, Vejzagić V. 2026 49th MIPRO ICT and Electronics Convention (MIPRO). Managerial Acceptance of Generative Artificial Intelligence for Strategic Decision Making: A Review of Evidence and Frameworks View
Mai H, Li W, Chen J, Huang B, Li B, Long S. 2026 3rd International Conference on Computer and Multimedia Technology (ICCMT). A Multi-Source Knowledge Verification-Based Hallucination Mitigation Algorithm for Generative Writing Models View

Citation

Please cite as:

Chelli M, Descamps J, Lavoué V, Trojani C, Azar M, Deckert M, Raynier JL, Clowez G, Boileau P, Ruetsch-Chelli C
Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis
J Med Internet Res 2024;26:e53164
doi: 10.2196/53164 PMID: 38776130 PMCID: 11153973

Export Metadata

END for: Endnote

BibTeX for: BibDesk, LaTeX

RIS for: RefMan, Procite, Endnote, RefWorks

Add this article to your Mendeley library

This paper is in the following e-collection/theme issue:

Artificial Intelligence (4591) Epublishing and Open Access (154) Methods and Instruments in Medical Informatics (399) Reviews in Medical Education (282) New Methods and Approaches in Medical Education (617) Chatbots and Conversational Agents (1145) Generative Language Models Including ChatGPT (1443)

Download

Download PDF Download XML

Share Article

Share on Bluesky Share on Twitter Share on Facebook Share on LinkedIn