Accessibility settings

Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/84120, first published .
Knowledge-Practice Performance Gap in Clinical Large Language Models: Systematic Review of 39 Benchmarks

Knowledge-Practice Performance Gap in Clinical Large Language Models: Systematic Review of 39 Benchmarks

Knowledge-Practice Performance Gap in Clinical Large Language Models: Systematic Review of 39 Benchmarks

Journals

  1. Lin X, Yang Y, Ren Y. Making Chatbots more human: deep reasoning large language models in ophthalmology. Frontiers in Medicine 2026;12 View
  2. Spieser J, Balapour A, Meller J, Patra K, Shamsaei B. A Review of Multi-Agent AI Systems for Biological and Clinical Data Analysis. Methods and Protocols 2026;9(2):33 View
  3. Zhu Q, Li Q, Zan Y, Lu Y, Xia L, Xia Y, Xu T. Patient-centered gastrointestinal function assessment technologies: a paradigm shift from traditional approaches to non-invasive innovations. Frontiers in Physiology 2026;17 View
  4. Prause M. No skin in the game: why agentic AI requires principal-agent governance. AI and Ethics 2026;6(2) View