Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/64452, first published .
Performance of Large Language Models in Numerical Versus Semantic Medical Knowledge: Cross-Sectional Benchmarking Study on Evidence-Based Questions and Answers

Performance of Large Language Models in Numerical Versus Semantic Medical Knowledge: Cross-Sectional Benchmarking Study on Evidence-Based Questions and Answers

Performance of Large Language Models in Numerical Versus Semantic Medical Knowledge: Cross-Sectional Benchmarking Study on Evidence-Based Questions and Answers

Eden Avnat   1, 2 , MPH, MD ;   Michal Levy   3, 4 , BCS, MD ;   Daniel Herstain   1 , MD ;   Elia Yanko   5 , BSc ;   Daniel Ben Joya   2, 6 , MD ;   Michal Tzuchman Katz   2 , MD ;   Dafna Eshel   2 , MD ;   Sahar Laros   1, 2 , BMedSci ;   Yael Dagan   1, 2 , BMedSci ;   Shahar Barami   1, 2 , BMedSci ;   Joseph Mermelstein   2 , BCS ;   Shahar Ovadia   2 , MCS ;   Noam Shomron   1 , PhD ;   Varda Shalev   1 , MD, MPH ;   Raja-Elie E Abdulnour   7 , MD

1 Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel

2 Kahun Medical Ltd, Givatayim, Israel

3 Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel

4 School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel

5 The Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel

6 Kaplan Medical Center, Rehovot, Israel

7 Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, United States

Corresponding Author: