Published on in Vol 26 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/52113, first published .
Assessing ChatGPT’s Mastery of Bloom’s Taxonomy Using Psychosomatic Medicine Exam Questions: Mixed-Methods Study

Assessing ChatGPT’s Mastery of Bloom’s Taxonomy Using Psychosomatic Medicine Exam Questions: Mixed-Methods Study

Assessing ChatGPT’s Mastery of Bloom’s Taxonomy Using Psychosomatic Medicine Exam Questions: Mixed-Methods Study

Journals

  1. Masters K, Benjamin J, Agrawal A, MacNeill H, Pillow M, Mehta N. Twelve tips on creating and using custom GPTs to enhance health professions education. Medical Teacher 2024;46(6):752 View
  2. Noda M, Ueno T, Koshu R, Takaso Y, Shimada M, Saito C, Sugimoto H, Fushiki H, Ito M, Nomura A, Yoshizaki T. Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study. JMIR Medical Education 2024;10:e57054 View
  3. Huang K. Evaluating GPT-4’s Cognitive Functions Through the Bloom Taxonomy: Insights and Clarifications. Journal of Medical Internet Research 2024;26:e56997 View
  4. Herrmann-Werner A, Festl-Wietek T, Holderried F, Herschbach L, Griewatz J, Masters K, Zipfel S, Mahling M. Authors’ Reply: “Evaluating GPT-4’s Cognitive Functions Through the Bloom Taxonomy: Insights and Clarifications”. Journal of Medical Internet Research 2024;26:e57778 View
  5. Bharatha A, Ojeh N, Fazle Rabbi A, Campbell M, Krishnamurthy K, Layne-Yarde R, Kumar A, Springer D, Connell K, Majumder M. Comparing the Performance of ChatGPT-4 and Medical Students on MCQs at Varied Levels of Bloom’s Taxonomy. Advances in Medical Education and Practice 2024;Volume 15:393 View
  6. Buldur M, Sezer B. Evaluating the accuracy of Chat Generative Pre-trained Transformer version 4 (ChatGPT-4) responses to United States Food and Drug Administration (FDA) frequently asked questions about dental amalgam. BMC Oral Health 2024;24(1) View
  7. Holderried F, Stegemann-Philipps C, Herrmann-Werner A, Festl-Wietek T, Holderried M, Eickhoff C, Mahling M. A Language Model–Powered Simulated Patient With Automated Feedback for History Taking: Prospective Study. JMIR Medical Education 2024;10:e59213 View
  8. Vidhani D, Mariappan M. Optimizing Human–AI Collaboration in Chemistry: A Case Study on Enhancing Generative AI Responses through Prompt Engineering. Chemistry 2024;6(4):723 View
  9. Kipp M. From GPT-3.5 to GPT-4.o: A Leap in AI’s Medical Exam Performance. Information 2024;15(9):543 View
  10. Zhang Q, Hou H, Ju Y, Yuan J, Zhang K, Wang H, Chen J. Category Mapping of Emergency Supplies Classification Standard Based on BERT-TextCNN. Systems 2024;12(9):358 View
  11. Boyer J, Boyer W. A Strategic Roadmap for Mitigating Generative Artificial Intelligence Hallucinations. Cureus Journals 2024 View
  12. Sallam M, Al-Salahat K, Eid H, Egger J, Puladi B. Human versus Artificial Intelligence: ChatGPT-4 Outperforming Bing, Bard, ChatGPT-3.5 and Humans in Clinical Chemistry Multiple-Choice Questions. Advances in Medical Education and Practice 2024;Volume 15:857 View
  13. Kolding S, Lundin R, Hansen L, Østergaard S. Use of generative artificial intelligence (AI) in psychiatry and mental health care: a systematic review. Acta Neuropsychiatrica 2025;37 View
  14. Boyer J, Boyer W. A Strategic Roadmap for Mitigating Generative Artificial Intelligence Hallucinations. Cureus Journal of Computer Science 2024 View
  15. Eisinger F, Holderried F, Mahling M, Stegemann–Philipps C, Herrmann–Werner A, Nazarenus E, Sonanini A, Guthoff M, Eickhoff C, Holderried M. What’s Going On With Me and How Can I Better Manage My Health? The Potential of GPT-4 to Transform Discharge Letters Into Patient-Centered Letters to Enhance Patient Safety: Prospective, Exploratory Study. Journal of Medical Internet Research 2025;27:e67143 View
  16. Ng O, Phua D, Chu J, Wilding L, Mogali S, Cleland J. Answering Patterns in SBA Items: Students, GPT3.5, and Gemini. Medical Science Educator 2024;35(2):629 View
  17. Luo Y, Miao Y, Zhao Y, Li J, Chen Y, Yue Y, Wu Y. Comparing the Accuracy of Two Generated Large Language Models in Identifying Health-Related Rumors or Misconceptions and the Applicability in Health Science Popularization: Proof-of-Concept Study. JMIR Formative Research 2024;8:e63188 View
  18. Chang Y, Yin J, Li J, Liu C, Cao L, Lin S. Applications and Future Prospects of Medical LLMs: A Survey Based on the M-KAT Conceptual Framework. Journal of Medical Systems 2024;48(1) View
  19. Qiu Y, Liu C. Capable exam-taker and question-generator: the dual role of generative AI in medical education assessment. Global Medical Education 2025 View
  20. Wei Y, Peng Z. Application of the flipped classroom model based on Bloom’s Taxonomy of Educational Objectives in endodontics education for undergraduate dental students. PeerJ 2025;13:e18843 View
  21. Lubbe A, Marais E, Kruger D. Cultivating independent thinkers: The triad of artificial intelligence, Bloom’s taxonomy and critical thinking in assessment pedagogy. Education and Information Technologies 2025;30(12):17589 View
  22. Zhou J, Zhang J, Wan R, Cui X, Liu Q, Guo H, Shi X, Fu B, Meng J, Yue B, Zhang Y, Zhang Z. Integrating AI into clinical education: evaluating general practice trainees’ proficiency in distinguishing AI-generated hallucinations and impacting factors. BMC Medical Education 2025;25(1) View
  23. Jongkind R, Elings E, Joukes E, Broens T, Leopold H, Wiesman F, Meinema J. Is your curriculum GenAI-proof? A method for GenAI impact assessment and a case study. MedEdPublish 2025;15:11 View
  24. Guo S, Li G, Du W, Situ F, Li Z, Lei J. The performance of ChatGPT and ERNIE Bot in surgical resident examinations. International Journal of Medical Informatics 2025;200:105906 View
  25. Altermatt F, Neyem A, Sumonte N, Villagrán I, Mendoza M, Lacassie H. Evaluating the Performance of Large Language Models on the CONACEM Anesthesiology Certification Exam: A Comparison with Human Participants. Applied Sciences 2025;15(11):6245 View
  26. Abouzeid E, Wassef R, Jawwad A, Harris P. Chatbots’ Role in Generating Single Best Answer Questions for Undergraduate Medical Student Assessment: Comparative Analysis. JMIR Medical Education 2025;11:e69521 View
  27. Azzi A, Erdős F, Németh R, Varadarajan V, Afrifa S. Comparative analysis of NLP-driven MCQ generators from text sources. Computers and Education: Artificial Intelligence 2025;9:100440 View
  28. Fenton A. Reconsidering the Use of Oral Exams and Assessments: An Old Way to Move Into a New Future. Educational Researcher 2025;54(7):430 View
  29. Hachoumi N, Eddabbah M, El adib A. Enhancing teaching and learning in health sciences education through the integration of Bloom's taxonomy and artificial intelligence. Informatics and Health 2025;2(2):130 View
  30. Tai H, Kovarik C. ChatGPT-4’s Level of Dermatological Knowledge Based on Board Exam Review Questions (Preprint). JMIR Dermatology 2025 View
  31. Mavrych V, Yousef E, Yaqinuddin A, Bolgova O. Large language models in medical education: a comparative cross-platform evaluation in answering histological questions. Medical Education Online 2025;30(1) View
  32. Atre M, Karandikar S, Merchant K, Suryawanshi A, Patil H. Revolutionizing Educational Assessment Using Bloom’s Taxonomy Bot. WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS 2025;22:593 View
  33. García-Ramírez Y. AI Chatbots as Tools for Designing Evaluations in Road Geometric Design According to Bloom’s Taxonomy. Applied Sciences 2025;15(16):8906 View
  34. Wu R, Zong H, Wu E, Li J, Zhou Y, Zhang C, Zhang Y, Wang J, Tang T, Shen B. Improving large language models for miRNA information extraction via prompt engineering. Computer Methods and Programs in Biomedicine 2025;271:109033 View
  35. Yang X, Chen W. The performance of ChatGPT on medical image-based assessments and implications for medical education. BMC Medical Education 2025;25(1) View
  36. Goorts L, Hollevoet R, Xia V, Cammaerts F, Güngör A. How do LLMs perform in the context of MCQs across different levels of thinking skills in a business education course at higher education? A comparison of ChatGPT, Gemini, and Copilot. Computers and Education: Artificial Intelligence 2025;9:100475 View
  37. Juanda J, Azis A, Ramadhan Z. Pemetaan Kompetensi Membaca dan Menulis Akademik Mahasiswa dan Artificial Intelligence dalam Praktik Literasi Ilmiah. Jurnal Onoma: Pendidikan, Bahasa, dan Sastra 2025;11(4):4004 View
  38. Al‐Haj Ali S. Reliability of Multimodal AI for Assessing Preclinical Stainless Steel Crown Preparations: A Comparative Study With Human Experts. International Journal of Paediatric Dentistry 2025 View
  39. Ramachandran R, Wey S. Mastering Ophthalmology in the Digital Age. JAMA Ophthalmology 2025 View
  40. Adnans A, Serang Y, Eunike I, Silalahi A. Does ChatGPT-enhanced collaborative learning foster critical thinking in education? A Bloom’s Taxonomy perspective. Computers and Education Open 2025;9:100316 View
  41. Emirtekin E, Özarslan Y. Automatic Short‐Answer Grading in Sustainability Education: AI–Human Agreement. Journal of Computer Assisted Learning 2026;42(1) View

Books/Policy Documents

  1. Pears M, Konstantinidis S. Disruptive Technologies in Education and Workforce Development. View
  2. Karakose T, Polat H. Social Robots in Education. View
  3. Adams L, Alzaabi I. Blending Human Intelligence With Technology in the Classroom. View

Conference Proceedings

  1. Gavhane J, Pagare R. 2025 IEEE Global Engineering Education Conference (EDUCON). Revolutionizing Academic Evaluation: Bloom's Taxonomy Meets Deep Learning and NLP View
  2. Sánchez-Rodriguez E, Merino-Soto C, Chen M, Zavala G, Chans G. 2025 IEEE Global Engineering Education Conference (EDUCON). AI-Supported Learning: Integrating ChatGPT to Enhance Cognitive Skills in STEM Education View