Abstract
Background: Global digitalization continues to advance, extending its influence into medicine and health care systems worldwide. In recent years, substantial advancements have been made in the research and development of artificial intelligence (AI), raising questions about its potential in medicine. The integration and application of AI in intensive care medicine, particularly in sepsis treatment, presents significant potential for advancing patient outcomes and enhancing patient-relevant benefits. However, a comprehensive and systematic overview of the full spectrum of patient-relevant benefits associated with AI-based clinical decision support systems (CDSS) remains lacking.
Objective: This scoping review aimed to identify and categorize evidence on patient-relevant benefits of AI-based CDSS in sepsis care.
Methods: Systematic research was conducted in 4 electronic databases: MEDLINE via PubMed, Embase, the ACM Digital Library, and IEEE Xplore. In addition, a comprehensive search on the websites of relevant international organizations, along with a citation search of the included articles, was conducted. Articles were included if they (1) focused on sepsis and (2) described patient-relevant benefits of AI-based CDSS. Articles published between January 1, 2008, and March 2, 2023, were considered for inclusion. Study selection was performed independently by 2 reviewers. The manuscript was drafted in accordance with the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) checklist. The analysis of the included articles was conducted using the program MAXQDA (VERBI Software GmbH), with systemization finalized in a consensus workshop.
Results: A total of 3368 records were identified across the 4 databases, of which 24 met the inclusion criteria and were included in the scoping review. The additional search on international websites and in reference lists identified 6 more relevant articles, resulting in 30 included studies. Of these, 20 were quantitative, comprising 7 prospective and 13 retrospective designs. In addition, 1 qualitative study, 1 mixed methods study, 6 review articles, and 2 articles from institutional websites were included. Patient-relevant benefits were systematized in six main categories: (1) prediction, (2) earlier treatment and prioritization of high-risk patients, (3) individualized therapy, (4) improved patient outcomes (including improved Sequential Organ Failure Assessment score, reduced length of stay, and reduced mortality), (5) general improvements in care, and (6) reduced readmission rate.
Conclusions: This scoping review underscores the potential of AI-based CDSS to positively impact patient-relevant benefits, particularly in sepsis care, where they demonstrate considerable promise for improving intensive care. However, the majority of the identified studies rely on retrospective database analyses. Future research should focus on validating these findings through prospective studies.
doi:10.2196/76772
Keywords
Introduction
The treatment of infectious diseases has historically resulted in medical progress, exemplified by antibiotics and vaccines. Despite all medical advances, infections remain a major global cause of morbidity and mortality [,]. Sepsis, defined as “life-threatening organ dysfunction caused by a dysregulated host response to infection” [], remains among the top contributors to worldwide mortality. It accounts for 30%‐50% of all hospital deaths in high-income countries, such as the United States [], and approximately 11 million annual deaths worldwide []. Sepsis is a heterogeneous syndrome with variable phenotypes and outcomes. Thus, the interpretation of initial symptoms can be difficult for health care providers [,].
The effectiveness and accuracy of established rule-based scoring systems used for the assessment of patients in the intensive care unit (ICU), such as the systemic inflammatory response syndrome (SIRS) criteria, which were historically of importance, the sequential organ failure assessment (SOFA) score or the quickSOFA (qSOFA) score, the acute physiology and chronic health evaluation II (APACHE II) score, or the national early warning score 2 (NEWS2), is limited. This is partly because these scoring systems are not always specifically developed for sepsis patients and are therefore of limited use to health care providers in this context [,]. Nevertheless, timely identification and treatment are crucial to enhance patient outcomes [-], as untreated sepsis can progress to septic shock, exacerbating the patient’s condition [] and leading to multiple organ failure, which carries an even higher mortality rate than sepsis itself [].
This is where recent developments in artificial intelligence (AI) become particularly relevant, as they are considered to hold substantial potential for improving sepsis diagnostics. Especially machine learning (ML), a branch of AI, has the ability to rapidly analyze vast amounts of data, exceeding human capacity to process. By evaluating numerous data points, ML can derive conclusions and recognize correlations that a human health care provider would be incapable of identifying. This is why ML is well-suited as a technological foundation for clinical decision support systems (CDSS), particularly in the complex clinical picture of sepsis []. The use of ML in the development of CDSS can make the sepsis diagnosis more reliable, with the prospect of long-term improvements in patient outcomes. Machine learning algorithms (MLAs) demonstrated potential to enhance patient-relevant benefits in distinct studies. Documented benefits include, for example, reductions in sepsis-related mortality and the average hospital length of stay (LOS). Additionally, MLAs facilitate earlier interventions, such as the timely administration of antibiotics [-].
Despite the high clinical relevance of sepsis and significant advancements in both the availability of digital patient data and in the field of ML, the real-world application of AI-based CDSS remains negligible. The majority of these algorithms remain in the prototype phase, with deployment limited to a single hospital or a single hospital operator. This gap is highlighted by an analysis of the Food and Drug Administration’s database of medical devices using AI or ML. As of April 2025, none of the over 1000 listed products are specifically dedicated to intensive care [], the medical field at the forefront of sepsis treatment. This illustrates the discrepancy between technological progress and its real-world implementation in the critical care environment. For AI-based CDSS to be successfully implemented in clinical practice, it is a necessary prerequisite that they demonstrate tangible added value. Accordingly, patient-relevant benefits should constitute a primary focus.
The research objective of the present study differs from those of previous scoping reviews on AI-based CDSS in sepsis care. Certain reviews focused specifically on neonatal [] or pediatric [] sepsis, whereas others concentrated on tasks for which MLAs were designed—such as risk assessment, treatment planning, or process support—and thus focused on the process of medical service delivery rather than on actual patient-relevant benefits [] or on the actual design of the CDSS and its intended users []. Importantly, none of the aforementioned reviews [-] focused exclusively on patient-relevant benefits. Furthermore, several existing scoping reviews used narrow methodological approaches, for example, being restricted to a single ML method [] or considering only antibiotic treatment of sepsis []. To the authors’ knowledge, no other scoping review has explicitly examined the patient-relevant benefits of AI-based CDSS in the context of sepsis while applying a broad and exploratory methodological approach, without restrictions regarding the ML methods used or the types of patient-relevant benefits assessed. Accordingly, the objective of the present study is to identify patient-relevant benefits of AI-based CDSS in sepsis care compared with the current standard of care, thereby addressing this research gap, as patient-relevant benefits constitute a meaningful benchmark for evaluating the value of any medical innovation. In this context, a taxonomy of benefits comprising 6 main categories has been developed.
This scoping review was conducted within the framework of the KI@work (User-Oriented Requirements for AI-Based Clinical Decision Support Systems) project, which is funded by the German Federal Joint Committee (funding code: 01VSF22050). The research project is led by the Institute for Health Care Management and Research at the University of Duisburg-Essen. Consortium partners include the Department of Anesthesiology, Intensive Care Medicine and Pain Therapy at the University Hospital Knappschaftskrankenhaus Bochum, the Knappschaft Kliniken GmbH, the Department of Medical Informatics, Biometry and Epidemiology at the Ruhr University Bochum and the German Sepsis Society. This scoping review addressed 2 additional research questions. However, to ensure a coherent presentation of the findings, this article focuses exclusively on patient-relevant benefits.
Methods
Overview
This scoping review is based on the methodology framework of the Joanna Briggs Manual for evidence synthesis [], a further development of the work of Arksey and O’Malley [] and Levac et al []. The review process followed the five stages originally described by Arksey and O’Malley: (1) identifying the research question, (2) identifying relevant studies, (3) study selection, (4) charting the data, and (5) collating, summarizing, and reporting the results []. The manuscript was prepared according to the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) checklist by Tricco et al [] (). As scoping reviews encompass a broad range of study types in order to present a comprehensive overview of the research field [-], comparability between studies is limited. Consequently, no formal quality appraisal was conducted. Although no distinct protocol for the scoping review was published, the methodology was described in detail in a protocol for the overarching multimethod research project [].
Search Strategy
The development of the search strategy commenced with an initial limited search in MEDLINE via PubMed and Embase to identify relevant search terms. Subsequently, the identified terms were discussed in recurring team discussions within the consortium. In a third step, the consented search terms were combined into search queries.
The electronic databases MEDLINE via PubMed, Embase, as well as the ACM Digital Library and IEEE Xplore, were searched for relevant literature on March 2, 2023. The databases were selected to ensure that the interdisciplinary research question could be adequately addressed from both a medical and a computer science perspective. The search string was developed using the PCC (population=persons with or at risk of sepsis, concept=CDSS, and context=AI) framework. The MEDLINE via PubMed search string was quality-assured by the chief librarian at the library of the University Medical Centre Essen before the database search was conducted. The other 3 search strings were developed based on the same quality assurance principles as the MEDLINE via PubMed search query. The individual search terms were limited to occurrences in title, abstract, and keyword searches but were supplemented by indexing terms (MeSH and Emtree) and truncations. The final search strategies for each database can be found in -.
In agreement with ML experts (NT, HN), the search was limited to articles published in the last 15 years. Further explanation for the time restriction is provided in the discussion of this article. The search was restricted to English and German. In cases of missing full texts, the interlibrary loan service of the University of Duisburg-Essen was used. If that approach was not successful, the reviewers contacted the respective authors of the papers of interest. The identified citations were imported into the reference management program Endnote 20 (Clarivate Analytics).
In addition to the systematic search of electronic databases, a structured search for gray literature (eg, working papers and guidelines) from various governmental and nongovernmental stakeholders was conducted via their websites. The selection of countries included in the search was based on the results of the Bertelsmann #SmartHealthSystem study, which examined the degree of digitalization of various health care systems in 2018. It was assumed that the prospect of identifying information on AI-based CDSS would be particularly high in countries with highly digitalized health care systems. According to the Bertelsmann study, this applies to the health care systems of Canada, Denmark, Estonia, Israel, and Spain. In addition, 3 large economies—Germany, the United Kingdom, and the United States—were included in the structured research. Alongside institutional websites from these countries, websites of relevant international stakeholders were also examined. These included the World Health Organization (WHO) and the Organisation for Economic Co-operation and Development (OECD), as well as websites of international sepsis, intensive care, and medical informatics associations. Further information about included websites can be found in . To supplement further evidence, reference lists of articles identified through the systematic and structured search were screened, and the cited articles were subsequently assessed for eligibility. If eligible, the referenced articles were included in the scoping review.
Eligibility Criteria
Exploratory research and internal discussions contributed to the development of inclusion and exclusion criteria, which were refined iteratively during the initial stages of the research process. The search strategy was designed to address 3 different research questions. Studies were considered for inclusion if they described (1) patient-relevant benefits of AI-based CDSS in the context of sepsis as well as (2) problems in their development, implementation, or application, or (3) suggestions for improving these processes. Patient-relevant benefits were identified entirely exploratively and categorized independently of existing frameworks, allowing AI-based CDSS benefits to be classified without reliance on established definitions or patient-relevant endpoints. This approach provides a comprehensive and complete overview of the potential benefits of this emerging technology, without constraining the findings of this paper to predefined frameworks and definitions. Patient-relevant benefits were defined as the positive impact of an intervention on patients, irrespective of whether these comprise general qualitative observations or specific, measurable quantitative endpoints. Specific inclusion and exclusion criteria were developed for each research question to ensure a tailored approach to the unique scope of each question. AI was defined as ML-based algorithms that operate as a “black box” for the user (physician or caregiver), meaning their output is not directly interpretable for health care providers. Consequently, all ML-based technologies developed through data-driven training and sufficiently complex to preclude full comprehension by the user were eligible for inclusion. In contrast, rule-based algorithms, such as those relying on SIRS or SOFA criteria, did not meet this definition and were therefore excluded in this review. Moreover, earlier diagnosis facilitated by AI-based CDSS was not considered a patient-relevant benefit, as earlier diagnosis itself has no impact on patient outcomes. It is the interventions that follow an earlier diagnosis—such as increased attention by health care providers to patients developing sepsis or earlier initiation of treatment—that positively influence patient-relevant benefits. Accordingly, these parameters are pertinent to the scope of this review. Articles were selected regardless of the research method used. The inclusion criteria are presented in .
Exclusion criteria for this review were not answering the research question, an exclusively technical description of the algorithms developed, or exclusively mathematical approaches not providing evidence for patient-relevant benefit. In addition, articles were excluded if they focused only on the evaluation of binary classifiers such as sensitivity, specificity, positive predictive value, or negative predictive value, as the superiority of AI-based algorithms over rule-based scores was considered a prerequisite for such systems. AI-based CDSS developed exclusively for neonates and/or children or for animals were also not included, because (1) the treatment of neonatal or pediatric sepsis patients differs significantly from the treatment of adult patients [-] and (2) the focus of the study is on human sepsis. Articles published before 2008 were also excluded, as were those written in languages other than English or German. Research protocols, conference abstracts, letters to the editor, and articles that were only expressions of opinions were also excluded. The exclusion criteria are listed in .
Inclusion criteria
- Articles focusing on sepsis and
- Involving AI-based CDSS, that
- Describe patient-relevant benefits, or
- Describe problems with development, implementation, or application, or
- Describe strategies for success
Exclusion criteria
- Exclusively technical description of systems, or
- Focus on description of the evaluation of binary classifiers, or
- Articles describing AI-based CDSS for neonates and children or animals, or
- Not addressing any of the research questions in more detail, or
- Research protocols, conference abstracts, theses, letters to the editor, or expression of opinions, or
- Article published before 2008, or
- Language other than English or German
Evidence Screening, Selection, and Data Extraction
After identification and deletion of duplicates, title and abstract screening was conducted independently by 2 reviewers (PR and GDG) to decide whether an article was eligible for full-text screening. In a second step, the same 2 reviewers conducted a full-text screening of the included articles against the inclusion and exclusion criteria. In case of disagreement between the 2 reviewers during step 2 of the screening process, other members of the study team (NB, HN, and NT) were involved to decide whether an article was eligible for inclusion.
MAXQDA (VERBI Software GmbH) software was used to identify and tag relevant content in the included articles and to precategorize the patient-relevant benefit categories (PR) using an inductive coding approach. The preliminary categories were discussed and further refined in an in-person workshop based on the affinity mapping technique (PR, NB, and GDG). For this purpose, all relevant text passages were printed as snippets and physically assigned to the respective preliminary categories before being refined and finalized during the workshop. Each assignment was discussed in detail until full consensus among all 3 team members was reached. The results of the workshop were subsequently digitalized in Microsoft Excel. In addition to the patient-relevant benefits of AI-based CDSS, metadata, such as participating authors, year of publication, country of study, database for MLA, or study type, were extracted and summarized (see ).
Analysis and Presentation of Results
The results of the included studies were summarized descriptively, and analysis was conducted to derive implications for policy, practice, and research. The patient-relevant benefits were grouped into 6 main categories. The main categories were presented in tabular form in an Excel file and diagrammatically. The patient-relevant benefit categories are presented chronologically in .
Results
Selection of Sources of Evidence
Selection Process
In the systematic search, a total of 3368 titles and abstracts were retrieved. After removing 850 duplicates, 2518 articles remained for screening (). Of these, 141 articles were screened for full text, and 39 met the inclusion criteria. Among these, 24 provided statements on patient-relevant benefits [-]. In addition, reference lists of the articles identified through the systematic search were analyzed, resulting in the identification of 5 additional articles, 2 of which reported information on patient-relevant benefits [,]. A complementary search of institutional websites led to the inclusion of 5 additional articles, 4 of which contained relevant information on patient-relevant benefits [-]. In total, 30 articles were included in the scoping review about patient-relevant benefits. The full-text screening process, including a detailed account of the reasons for exclusion, is presented in .

Included Studies
Of the 30 articles included, 16 originated from North America, all of which are from the United States (53.3%) [,-,,-,,,,,,,,]. Seven articles stem from Europe (23.3%); 3 from the Netherlands (10%) [,,], 2 from Spain (6.6%) [,], 1 from Austria (3.3%) [], and 1 from the United Kingdom (3.3%) []. Five articles are from Asia (16.7%), including 2 each from China [,] and Taiwan [,] (6.6% each), and 1 from Singapore (3.3%) []. There is 1 article from Australia (3.3%) [] and 1 article from South America (Brazil) (3.3%) [].
The study designs used in the included articles cover a wide range. Overall, 20 quantitative articles were identified. Of these, 7 used a prospective study design, of which 2 are multicenter studies [,] and 5 are single-center studies [,,,,]. Thirteen of the quantitative studies used a retrospective approach, comprising 6 research database studies [,,,,,] and 7 electronic health record database studies [,,,,,,]. In addition to the quantitative articles, 1 article used a qualitative approach [] and another applied a mixed methods approach []. Additionally, 6 review articles [,,,,,] and 2 articles from news sections of institutional websites were identified [,]. All articles are listed in .
Synthesis of Results
In total, 6 main categories of patient-relevant benefit were identified. The 6 main categories identified reflect the patient pathway from pretreatment to posttreatment period. They include (1) prediction, (2) earlier treatment and prioritization of high-risk patients, (3) individualized therapy (which encompasses patient-centered care), (4) improved patient outcomes (which includes improved SOFA score, reduced length of stay, and reduced mortality), (5) general improvements in care, and (6) reduced readmission rate (see ). gives a detailed overview of the benefit categories addressed in each study.

Pretreatment Period (Prediction)
Prediction of sepsis, septic shock, or sepsis-related organ dysfunction was addressed in 12 articles [,,,,,,,,,,,], comprising 7 quantitative studies (1 prospective [] and 6 retrospective [,,,,,]), 1 qualitative study [], 3 reviews [,,], and 1 institutional news report []. The MLAs identified in this review indicate predictive capacity [], which may be further optimized through algorithm fine-tuning []. Findings suggest that these models may predict sepsis between 4 and 48 hours prior to its onset [,,,,], even before significant changes in vital or laboratory parameters become apparent []. MLAs were reported to support the identification of appropriate preventive measures []. Such predictions may have the potential to improve patient outcomes by providing timely warning of sepsis onset [,]. Beyond sepsis, the studies also reported the prediction of septic shock, with MLA predictions occurring between 4 and 7 hours before the onset of septic shock [,]. Compared with traditional rule-based routine screening protocols, predictive MLAs demonstrated superior early warning performance, identifying 58.6% more patients before organ dysfunction [] and potentially contributing to a reduction in septic shock incidence []. Moreover, MLAs were shown to predict sepsis-related organ dysfunction approximately 7.5 hours earlier than rule-based routine screening protocols [].
Treatment Period
Earlier Treatment and Prioritization of High-Risk Patients
Earlier treatment facilitated by MLAs was reported in 10 articles [,,,,,,,,,], including 7 quantitative studies (4 prospective [,,,] and 3 retrospective [,,]), 1 mixed methods study [], 1 qualitative study [], and 1 institutional news report []. One retrospective study reported that the used MLA enabled earlier treatment up to 40 hours before the onset of sepsis [], while another indicated that the use of ML may reduce the time to treatment, not providing a specific time reduction []. Earlier treatment was reported to enable intervention before or during clinical deterioration [] and potentially prevent sepsis progression []. It may allow for early identification and intervention of patients at high risk for severe sepsis prior to clinical onset []. Additionally, the literature highlighted early identification and control of the pathogen causing sepsis []. Detecting patients before the onset of septic shock may facilitate earlier clinical assessment, diagnostic testing, therapeutic interventions, and transfer to appropriate levels of care []. Ultimately, earlier treatment may improve patient outcomes [] and lead to an alteration in the prevalence of septic shock through timely intervention by health care providers from 5.3% in the control group to 1.5% in the experimental group (–71.7%) of the corresponding study []. MLAs were also associated with shorter times to obtain blood cultures (0.98‐2.79 hours) [,], fluid administration (1.05 hours) [] and earlier administration or adjustment of antibiotics (0.55‐2.76) [,,,]. A positive correlation between timely evaluation of MLA alerts and quicker administration of antibiotics was reported, as earlier evaluation of alerts leads to faster use of antibiotics []. Finally, a quantitative prospective study reported that MLAs allow prioritization of high-risk patients, with the targeted real-time early warning system (TREWS) identifying who is most likely to benefit from timely treatment [].
Individualized Therapy
MLAs were reported to support individualized, patient-centered therapy in 11 articles [,,,,,-,,], comprising 10 quantitative studies (1 prospective [] and 9 retrospective [,,,,,-,]) and 1 review []. Four main approaches for individualization were identified: (1) subgroup analyses and clustering, (2) optimized substance administration, (3) personalized nursing care, and (4) general statements. Three articles reported subgroup analyses and clustering of patients [,,], which may enable hospitals to provide targeted treatments tailored to the specific needs of defined subgroups [] and classify patients according to their diverging mortality risk due to factors such as fluid overload or norepinephrine overdose. Such classification might support the development of tailored resuscitation strategies for patients with septic shock []. Furthermore, subgroup analyses applied to populations with differing disease severity and progression allow MLAs to adjust the intensity of therapy []. The use of MLAs for optimal substance administration was reported in 6 articles [,,,,,]. Applications include personalized antibiotic dosing [], faster adjustment to the most effective antibiotics, and drug resistance prediction. The comprehensive Intelligent Antimicrobial System demonstrated potentially faster drug resistance prediction times compared with conventional methods, requiring 39.8 hours for carbapenem-resistant Klebsiella pneumonia and 40.9 hours for methicillin-resistant Staphylococcus aureus, compared with 99.5 and 106.4 hours, respectively, using traditional methods []. MLA use was also associated with an 8% reduction in antibiotic resistance [], may shorten the time to antimicrobial resistance detection by 37 hours [], and was reported to reduce the duration of antibiotic treatment []. Furthermore, MLAs may support physicians in selecting appropriate antibiotic therapy [], with AI-based antibiotic stewardship linked to decreased Clostridium difficile infections []. Beyond antibiotics, MLAs have demonstrated utility in optimizing dosing strategies for norepinephrine [], vasopressors [], corticosteroids [], and fluid volume management []. MLAs also reported to enhance nursing competence and support more evidence-based, personalized nursing care []. General statements on individualized therapy were identified in 4 articles [,,,], including personalized treatment to support physicians in diagnosing and managing bacteremia [], facilitation of shared decision-making through preoperative discussions [], improved physician adherence [], and more precise treatment tailored to individual patients [].
Improved Patient Outcomes
Improved SOFA Score
Improved SOFA scores associated with the application of and timely response to MLAs were reported in 1 quantitative prospective study []. The SOFA score, the predominant measure for assessing the severity of organ dysfunction, is closely linked to the probability of mortality, with a higher score indicating an increased probability of death [,]. Using the TREWS algorithm, Adams et al [] reported a SOFA score progression of –0.8 in their intervention group, compared to –0.4 in the control group. The article highlights a disproportionate reduction in the SOFA score for high-risk patients compared to nonhigh-risk patients. Additionally, timely evaluation and confirmation of the TREWS alerts is associated with improvements in SOFA score progression.
Reduced Length of Stay
Seven quantitative articles reported reductions in LOS [,,,,,,], including 6 prospective [,,,,,] and 1 retrospective study []. Based on the identified literature, a distinction can be drawn between (1) specific reductions, reported in absolute or relative terms [,,,,,], and (2) general statements without precise quantification [,,]. Reported specific reductions in hospital LOS ranged from 0.43 to 8.1 days [,,,,], corresponding to decreases of 12.84%-45.25% [,,,]. Reported reductions in ICU LOS varied between 2.09 and 10.5 days [,]. One study also highlighted that shorter ICU stays may contribute to an overall reduction in hospital LOS, although LOS on the general ward increased by 2.4 days []. Another article reported a potential annual reduction of 1100 days in emergency department stays and the prevention of 34 ICU stays associated with MLA usage in the examined hospital []. General statements indicated a disproportionate, though not statistically significant, reduction in LOS among high-risk patients as well as reduced LOS when MLA-generated alarms were evaluated and confirmed timely []. AI-based antibiotic stewardship was also associated with shorter LOS [] and MLAs were reported to significantly shorten hospital LOS compared to rule-based systems []. Furthermore, 1 study suggested that timely physician responses to MLA-generated alerts may contribute to reduced LOS [].
Reduced Mortality
A reduction in mortality was reported in 14 articles [,,-,,,,,,-], including 9 quantitative studies (6 prospective [,,,,,] and 3 retrospective [,,]) as well as 3 reviews [,,] and 2 institutional news reports [,]. Reported mortality reductions varied in type and presentation, encompassing (1) specific quantitative statements, expressed in relative or absolute terms [,,,,,,,,,], and (2) general statements without numerical specifications [,,,,,,,]. Relative reductions of mortality ranged from 13.19% to 74.94% [,,,,,,], whereas absolute reductions ranged from 1.33% points to 26.4% points [,,,,,,]. One study reported an increase in absolute survival rate of 11.7% and 23.7%, depending on the type of bacteria responsible for the sepsis []. Two articles provided reductions in natural numbers; one projected 22 potentially preventable annual deaths in the emergency department of the China Medical University Hospital [], while another estimated several thousand preventable deaths in the United States alone []. General statements suggested that MLAs may disproportionately reduce mortality among high-risk patient cohorts, particularly when outputs are promptly evaluated and confirmed by physicians []. Improved survival rates may also be linked to the use of MLA-guided antibiotic recommendations [] and the application of the 3PM (predictive, preventive, and personalized medicine) principles []. MLAs are associated with lower mortality compared to traditional physician assessments [,] and predictions generated by rule-based tools []. Additionally, literature provided general statements, offering limited informational depth and indicating that the use of ML may contribute to reduced mortality [,,,,].
General Improvements in Care
Eight articles reported improvements in care associated with MLAs [,,,,-,], including 6 quantitative studies (5 prospective [,,,,] and 1 retrospective []) as well as 2 reviews [,]. Reported benefits can be divided into 2 domains: (1) statements related to time and (2) statements on patient care enhancements. One study reported a reduced duration of septic shock []. Within the patient care enhancement category, MLAs were described as posing no risk to patients and offering potential benefits to patients and health care providers [], reducing events of clinical deterioration [], improving care accuracy [,], and increasing sepsis awareness among physicians [,]. Physicians and nurses also reported perceived improvements in care [].
Posttreatment Period (Reduced Readmission Rate)
Predictive AI-based CDSS were associated with reduced 30-day readmission rates, as reported in 2 quantitative prospective studies [,]. In Burdick et al [], implementation of an MLA reduced the 30-day readmissions from 36.4% to 28.12%, representing a 22.74% reduction compared to the baseline period. McCoy and Das [] reported a decline from 46.19% (188/407) during the preimplementation baseline period to 29.8% (100/336) in a first postimplementation period and further to 25.2% (96/381) in a second postimplementation period. In a subsequent steady-state period, the 30-day readmission rate was further reduced to 7.84% (16/204). Across all surveyed months after implementation, the 30-day readmission rate was 23.03%, representing a 50.14% reduction in the sepsis-related 30-day readmission rate.
Discussion
Principal Findings
This scoping review presents the evidence on the patient-relevant benefits of AI-based CDSS in sepsis care. All articles focusing on sepsis and presenting the influence of AI-based CDSS on patient-relevant benefits, identified through the comprehensive search strategy, were included. In total, 30 articles were identified and integrated into the review. Investigating the literature, there is a number of AI-based CDSS for sepsis treatment developed in the past or currently under development. However, research typically has no or only limited reference to patient-relevant benefits and (1) mostly focuses on problems and/or success strategies [,,-] and/or (2) is indication-independent []. To the best of the authors’ knowledge, this represents the first scoping review on this specific topic.
The findings of this scoping review, systematized into the 6 main categories, (1) prediction, (2) earlier treatment and prioritization of high-risk patients, (3) individualized therapy (which encompasses patient-centered care), (4) improved patient outcomes (which includes improved SOFA score, reduced length of stay, and reduced mortality), (5) general improvements in care, and (6) reduced readmission rate, underscore the potential patient-relevant benefits of AI-based CDSS in sepsis care across the entire inpatient pathway. The literature indicates that MLAs can potentially predict sepsis before its clinical onset [,,,,]. Additionally, septic shock [,] and sepsis-related organ dysfunction [] may be predicted in advance. These predictive capabilities can contribute to reducing the incidence of septic shock [] and supporting decreased mortality rates among sepsis patients []. Sepsis prediction may facilitate timely treatment initiation through the use of MLAs []. This was associated with improved patient outcomes and a decreased prevalence of septic shock []. Furthermore, individualized therapy can potentially have a positive impact on patient-relevant benefits by reducing the time to treatment or LOS for each individual patient []. Moreover, the disproportionate reduction in the SOFA score through the use of ML compared to a control group whose treatment was not supported by MLAs should be mentioned. According to the Sepsis-3 definition, the level of the SOFA score positively correlates with the probability of death [], and a SOFA score of ≥2 points corresponds to a mortality risk of over 10% in hospitalized patients outside the ICU []. The TREWS algorithm presented by Adams et al was able to reduce the SOFA score by 0.8 points, while a reduction of only 0.4 points was observed in the control group. Accordingly, the use of this MLA may contribute to the reduction in mortality. In general, the usage of MLAs was associated with a mortality reduction of up to 74.94% [,,,,,,], with faster response times being associated with greater reductions in mortality []. This demonstrates the medical potential of ML in the treatment of sepsis, particularly when clinical recommendations are accepted and promptly implemented by physicians. With approximately 11 million deaths annually from sepsis according to the WHO [], a corresponding reduction in mortality could translate into a substantial global health impact. Additionally, MLAs were linked to reduced hospital LOS [,,,] and ICU LOS [,]. Beyond their predictive capabilities, facilitation of timely treatment, mortality, and LOS reductions, AI-based CDSS in sepsis care provide further patient benefits, including shortened duration of septic shock [], reduced antibiotic resistance, and reduced duration of antibiotic treatment []. MLAs may also contribute to a reduction of events of clinical deterioration [] and increased physician awareness of sepsis [,]. Finally, the literature indicates that AI-based CDSS in sepsis care can contribute to reducing hospital readmission rates [], further demonstrating their potential to improve patient-relevant benefits.
Comparison With Prior Work
While 6 reviews were included in this work, they primarily focused on other topics and predominantly used less systematic approaches [,,,,,]. Among the included reviews, 4 adopted a narrative review methodology [,,,]. By design, this approach is inherently less systematic than systematic reviews or scoping reviews, and this was evident in the search and selection process of the included narrative reviews. Two relied exclusively on limited, nonsystematic keyword searches, one using 8 keywords across 4 search engines [] and another restricted to 3 keywords in a single database []. Moreover, the review conducted by Ferreira et al [] focused primarily on problems and success strategies related to AI-based CDSS, thereby addressing a different thematic focus than the present scoping review. Another narrative review applied a brief and partial search string without predefined inclusion and exclusion criteria and was limited to a single database [], representing considerable methodological limitations relative to the present comprehensive scoping review. The narrative review conducted by Schinkel et al [] adopted a more systematic approach, using a predefined search string and assessing the clinical value of AI-based systems by evaluating the AUROC as a criterion for article selection. While methodologically more robust, this review nonetheless differed from the present article, as it primarily evaluated the advantage of MLAs over rule-based scores, reflecting the status quo using a binary classifier. The advantage of MLAs over rule-based scores was considered a prerequisite for AI-based CDSS in the present study. With the exception of 1 review, where a manual search of reference lists was conducted [], none of the narrative reviews [,,] undertook a comprehensive search for gray literature or an analysis of the reference lists. Furthermore, only 1 narrative review reported a screening process conducted by 2 independent reviewers [], whereas the other 3 reviews did not provide methodological detail [,,]. By contrast, the present scoping review implemented a rigorous screening process with 2 independent reviewers to enhance objectivity, reliability, and reproducibility. Beyond these narrative reviews, 1 study followed an integrative review approach, explicitly focusing on predictive algorithms and embedding this narrow focus within a brief predefined search string []. In contrast, the present exploratory scoping review aimed to inductively derive patient benefit categories associated with AI-based CDSS in sepsis care. This integrative review relied on a single reviewer for screening [], representing a methodological limitation in comparison with the dual-reviewer approach of the present scoping review. Finally, 1 systematic review included in this study used a largely rigorous and systematic methodology, with the notable exception of a gray literature search, which was not reported. In addition, this systematic review focused primarily on problems and success strategies [], thereby diverging from the present scoping reviews’ explicit focus on patient-relevant benefits. In sum, the present scoping review can be clearly distinguished from the included reviews both methodologically and thematically. By applying a comprehensive, exploratory design, centered on patient-relevant benefits, it makes a substantive and valuable contribution to closing the research gap regarding patient-relevant benefits of AI-based CDSS in sepsis care.
Implications and Recommendations
Patient-relevant benefits identified in the literature are not sufficient to ensure successful implementation of AI-based CDSS. Equally critical is the acceptance of the underlying technology by health care providers and their belief that its use possesses tangible benefits. The unified theory of acceptance and use of technology (UTAUT) provides a framework to understand factors influencing behavioral intention and use behavior using four constructs: (1) performance expectancy, (2) effort expectancy, (3) social influence, and (4) facilitating conditions. In this context, effective design of AI-based CDSS should ensure that providers perceive the system as both beneficial and easy to use, corresponding to the first 2 constructs of the UTAUT. Specifically, (1) users should believe that using AI-based CDSS enhances gains in job performance, and (2) the system is intuitive and easy to operate. Equally important are contextual factors: health care providers should perceive that (3) important others endorse system use, and (4) organizational and technical infrastructure is in place to support usage []. A meta-analysis by Dingel et al [] applying the UTAUT to health care practitioners’ intention to use AI-enabled CDSS confirms that implementation must address not only technical and organizational aspects but also psychological and social factors, particularly fostering user trust. Successful implementation of AI-based CDSS therefore depends only partly on system performance; it is largely contingent on user attitudes and framework conditions.
Beyond the 4 UTAUT constructs, specific barriers [] and facilitators [] must be considered when evaluating AI-based CDSS. A nuanced understanding of these factors is essential to accurately evaluate the potential impact of AI-based CDSS on sepsis care. The current evidence demonstrates a pronounced lack of prospective studies investigating the optimal integration of such systems []. This paucity of implementation-oriented research, coupled with limited clinician acceptance [] and insufficient knowledge of AI among health care providers [], constitutes a substantial barrier to clinical adoption. Concurrently, extant literature highlights pivotal facilitators, emphasizing the importance of prioritizing research on effective integration strategies []. For instance, low acceptance may be mitigated by involving health care providers directly in the design and development of CDSS [-], while targeted training and educational programs could address knowledge gaps among service providers and enhance trust in this technology [,]. These factors must therefore be carefully considered by all stakeholders involved in implementation (eg, caregivers, physicians, and researchers) before real-world adoption can occur. For clinicians, the findings provide insights into realistic benefits, current limitations, and evidence gaps that may guide expectations in clinical decision-making. For researchers, this review underscores the importance of conducting prospective studies and fostering user-centered development to ensure that CDSS effectively translate into clinical practice. In addition, although patient-relevant benefits—and not only measurable patient-relevant outcomes—have been investigated, the findings may contribute to the development of a consistent set of generic patient-relevant outcomes, as proposed by Kersting et al []. This could, in turn, facilitate a shared understanding and enhance comparability across studies targeting patient-relevant outcomes, particularly given the absence of a clear, widely accepted definition and standardized criteria for selecting such outcomes.
Strengths and Limitations
This scoping review was conducted by an interdisciplinary team comprising computer scientists, physicians, statisticians, and (health) economists. This diverse expertise facilitated a comprehensive examination of all relevant aspects across these fields, ensuring a thorough evaluation of the reviewed literature. To address the interdisciplinary research question comprehensively, 4 databases focusing on medicine and informatics were included in the review. In addition, the structured search for gray literature targeted various institutions in 8 countries, including each country’s Ministry of Health, diverse sepsis and intensive care associations for each country, and diverse health informatics associations for each country. The 8 countries were selected based on two criteria: (1) having highly digitalized health care systems and/or (2) holding the status of industrialized nations. These countries were presumed to have a higher likelihood of using AI-based systems. Furthermore, the search encompassed internationally active stakeholders, such as the WHO and OECD, alongside globally active health informatics organizations and sepsis and critical care associations.
Despite all efforts, this scoping review is not free of limitations. Given the exploratory nature of the methodology, publication bias must be considered a potential limitation []. Studies in which AI-based CDSS do not demonstrate improvements in patient-relevant benefits compared with conventional scores may not be submitted in peer-reviewed journals, potentially leading to an overestimation of their true patient benefit. Although no formal risk of bias assessment was conducted, the included studies demonstrated considerable heterogeneity in design. Moreover, 65% (13/20) of the quantitative studies relied solely on retrospective methodologies, in which evidence of patient benefits was demonstrated only theoretically. Consequently, the findings of this scoping review should be interpreted with caution, as the reported effects may be overestimated in the context of real-world care. The overall strength of evidence was limited by the predominance of retrospective study designs and the theoretical nature of reported benefits. In contrast, the included prospective studies provided more robust support for the identified benefit categories. Importantly, each benefit category has been substantiated in prospective studies, thereby affirming its validity in real-world clinical contexts rather than solely theoretically in retrospective or descriptive studies (). Detailed information on the study designs of all included articles is provided in . summarizes benefit categories identified across the respective study designs. Furthermore, the comparability of the reported MLA performance across articles is limited due to varying definitions of sepsis (eg, different causative pathogens, divergent sepsis definitions, and variations in the examined indications such as sepsis, septic shock, or sepsis-related organ dysfunction). The same limitation applies to the databases used for training and validation, which differed substantially in size. In addition, no assessment of the applied MLA methods was conducted, nor was the level of maturity of the individual MLAs explicitly considered. Furthermore, due to the heterogeneity of the included studies, no formal quality assessment was conducted. Rather, the present review was designed to exploratively map and comparatively present the entirety of available evidence in order to identify research gaps, without imposing methodological restrictions on the literature to be included [-]. Finally, a methodological limitation should be noted: Research conducted on institutional websites could only be partially conducted for Estonia, Denmark, and Spain due to language restrictions (English and German), as some stakeholder websites were available exclusively in the respective national languages. The utilization of translation tools was deliberately avoided, as the inclusion of material that none of the authors could fully comprehend and critically appraise in the original language was considered methodologically inappropriate.
The search restriction of 15 years should not be considered a limitation. The inclusion period was defined in consultation with ML experts (NT, HN), and algorithms developed prior to the review period (January 1, 2008-March 2, 2023) were predominantly anticipated to be (1) rule-based systems and/or nonblack-box systems for the users. Both types of algorithms are outside the scope of this review. Moreover, an initial limited search in the databases MEDLINE via PubMed and Embase, which accounted for approximately 75% of the screened literature (), indicated that only a marginal proportion of articles relevant to the research question were published before 2008. Consequently, the time restriction is unlikely to have affected the identification of relevant literature.
Conclusion
The findings of this scoping review highlight the considerable medical relevance of AI-based CDSS in sepsis care. These systems offer benefits across the entire patient care pathway, from early detection and risk stratification to individualized therapy and various improved outcomes. AI-based CDSS has shown the ability to predict sepsis, septic shock, and sepsis-related organ dysfunction, enabling earlier initiation of treatment, prioritization of high-risk patients, and tailored therapeutic strategies. In addition to supporting earlier and more targeted interventions, AI-based CDSS contribute to better clinical outcomes, including improved SOFA scores, reduced LOS both in general wards and ICUs, and lower mortality rates. They may also help reduce readmission rates among sepsis patients, further enhancing long-term care quality. With their transformative potential, AI-based CDSS could fundamentally improve the global management of sepsis. However, further research is needed to optimize the development, implementation, and clinical application of these systems to maximize patient benefits and further improve outcomes for sepsis patients in the future. This is particularly important given the highly heterogeneous evidence base, with a substantial proportion of studies relying on retrospective data, as the results of the included studies cannot be directly generalized or applied without caution.
Acknowledgments
The authors acknowledge the support of the Open Access Publication Fund of the University of Duisburg-Essen and thank Mrs Katrin Wibker for her assistance in quality control during the development of the search string for the scoping review.
Funding
This research is part of a wider research project (Multimethod Research Project on Requirements for Artificial Intelligence-Based Clinical Decision Support Systems Using the Medical Example of Sepsis (="“KI@work"”)) funded by the German Federal Joint Committee (funding code: 01VSF22050 –KI@work). The funders had no influence in the study design, conduct of the study, or the decision to publish or prepare the manuscript.
Authors' Contributions
Conceptualization: PR (lead), GDG (equal), NB (equal), MA (supporting), HN (supporting), LP (supporting), PH (supporting), NT (supporting), MT (supporting), FMB (supporting), JW (supporting)
Data curation: PR (lead), GDG (equal)
Formal analysis: PR (lead), GDG (supporting), NB (supporting)
Funding acquisition: GDG, JW, MA, HN, NT, MT, FMB, NB
Investigation: PR (lead), GDG (equal)
Methodology: PR (lead), GDG (supporting), NB (supporting), NT (supporting), HN (supporting)
Project administration: NB
Supervision: NB
Validation: PR (lead), GDG (equal), NB (equal)
Visualization: PR
Writing – original draft: PR
Writing – review & editing: PR (lead), GDG (supporting), NB (supporting), MA (supporting), HN (supporting), LP (supporting), PH (supporting), NT (supporting), MT (supporting), FMB (supporting), JW (supporting)
Conflicts of Interest
None declared.
References
- Rhee C, Jones TM, Hamad Y, et al. Prevalence, underlying causes, and preventability of sepsis-associated mortality in US acute care hospitals. JAMA Netw Open. Feb 1, 2019;2(2):e187571. [CrossRef] [Medline]
- Global report on the epidemiology and burden of sepsis current evidence, identifying gaps and future directions. World Health Organization. 2020. URL: https://iris.who.int/server/api/core/bitstreams/d4ce3613-bf94-4205-85c8-14f3fc0609db/content [Accessed 2025-12-04]
- Singer M, Deutschman CS, Seymour CW, et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA. Feb 23, 2016;315(8):801-810. [CrossRef] [Medline]
- Ericson O, Hjelmgren J, Sjövall F, Söderberg J, Persson I. The potential cost and cost-effectiveness impact of using a machine learning algorithm for early detection of sepsis in intensive care units in Sweden. J Health Econ Outcomes Res. 2022;9(1):101-110. [CrossRef] [Medline]
- Arriaga-Pizano LA, Gonzalez-Olvera MA, Ferat-Osorio EA, et al. Accurate diagnosis of sepsis using a neural network: Pilot study using routine clinical variables. Comput Methods Programs Biomed. Oct 2021;210:106366. [CrossRef] [Medline]
- Vincent JL, Martin GS, Levy MM. qSOFA does not replace SIRS in the definition of sepsis. Crit Care. Jul 17, 2016;20(1):210. [CrossRef] [Medline]
- Dellinger RP, Levy MM, Rhodes A, et al. Surviving sepsis campaign: international guidelines for management of severe sepsis and septic shock: 2012. Crit Care Med. Feb 2013;41(2):580-637. [CrossRef] [Medline]
- Kumar A, Roberts D, Wood KE, et al. Duration of hypotension before initiation of effective antimicrobial therapy is the critical determinant of survival in human septic shock. Crit Care Med. Jun 2006;34(6):1589-1596. [CrossRef] [Medline]
- Brunkhorst FM, Weigand MA, Pletz M, et al. S3 Guideline Sepsis-prevention, diagnosis, therapy, and aftercare: long version. Med Klin Intensivmed Notfmed. May 2020;115(Suppl 2):37-109. [CrossRef] [Medline]
- Sakr Y, Jaschinski U, Wittebole X, et al. Sepsis in intensive care unit patients: worldwide data from the intensive care over nations audit. Open Forum Infect Dis. Dec 2018;5(12):ofy313. [CrossRef] [Medline]
- Arwyn-Jones J, Brent AJ. Sepsis. Surgery (Oxford). Jan 2019;37(1):1-8. [CrossRef]
- Boussina A, Shashikumar SP, Malhotra A, et al. Impact of a deep learning sepsis prediction model on quality of care and survival. NPJ Digit Med. Jan 23, 2024;7(1):14. [CrossRef] [Medline]
- Alanazi A, Aldakhil L, Aldhoayan M, Aldosari B. Machine learning for early prediction of sepsis in intensive care unit (ICU) patients. Med Bogota Colomb. Jul 9, 2023;59(7):1276. [CrossRef]
- Nemati S, Holder A, Razmi F, Stanley MD, Clifford GD, Buchman TG. An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit Care Med. Apr 2018;46(4):547-553. [CrossRef] [Medline]
- Artificial intelligence and machine learning (AI/ML)-enabled medical devices. US Food and Drug Administration. 2025. URL: https://www.fda.gov/medical-devices/software-medical-device-samd/artificial-intelligence-and-machine-learning-aiml-enabled-medical-devices [Accessed 2025-04-29]
- O’Sullivan C, Tsai DHT, Wu ICY, et al. Machine learning applications on neonatal sepsis treatment: a scoping review. BMC Infect Dis. Jun 29, 2023;23(1):441. [CrossRef] [Medline]
- Heneghan JA, Walker SB, Fawcett A, et al. The pediatric data science and analytics subgroup of the pediatric acute lung injury and sepsis investigators network: use of supervised machine learning applications in pediatric critical care medicine research. Pediatr Crit Care Med. Apr 1, 2024;25(4):364-374. [CrossRef] [Medline]
- Susanto AP, Lyell D, Widyantoro B, Berkovsky S, Magrabi F. Effects of machine learning-based clinical decision support systems on decision-making, care delivery, and patient outcomes: a scoping review. J Am Med Inform Assoc. Nov 17, 2023;30(12):2050-2063. [CrossRef] [Medline]
- Wan YKJ, Wright MC, McFarland MM, et al. Information displays for automated surveillance algorithms of in-hospital patient deterioration: a scoping review. J Am Med Inform Assoc. Dec 22, 2023;31(1):256-273. [CrossRef] [Medline]
- Aromataris E, Lockwood C, Porritt K, Pilla B, Jordan Z, editors. JBI Manual for Evidence Synthesis JBI. 2024. [CrossRef] ISBN: 978-0-6488488-0-6
- Arksey H, O’Malley L. Scoping studies: towards a methodological framework. Int J Soc Res Methodol. Feb 2005;8(1):19-32. [CrossRef]
- Levac D, Colquhoun H, O’Brien KK. Scoping studies: advancing the methodology. Implement Sci. Sep 20, 2010;5(5):69. [CrossRef] [Medline]
- Tricco AC, Lillie E, Zarin W, et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. Oct 2, 2018;169(7):467-473. [CrossRef] [Medline]
- Raszke P, Giebel GD, Abels C, et al. User-oriented requirements for artificial intelligence-based clinical decision support systems in sepsis: protocol for a multimethod research project. JMIR Res Protoc. Jan 30, 2025;14:e62704. [CrossRef] [Medline]
- S3-leitlinie: sepsis – prävention, diagnose, therapie und nachsorge – update 2025 [Website in German]. AWMF-register der leitlinien der wissenschaftlichen medizinischen fachgesellschaften. 2025. URL: https://register.awmf.org/assets/guidelines/079-001l_S3_Sepsis-Praevention-Diagnose-Therapie-Nachsorge_2025-07.pdf [Accessed 2025-12-12]
- S2k-leitlinie 024-025 „sepsis bei kindern jenseits der neonatalperiode [Website in German]. AWMF-Register der Leitlinien der wissenschaftlichen medizinischen Fachgesellschaften. 2015. URL: https://register.awmf.org/assets/guidelines/024-025l_S2k_Sepsis_nach_Neonatalperiode_2016-04-abgelaufen.pdf [Accessed 2025-01-20]
- S2k-leitlinie 024-008: bakterielle infektionen bei neugeborenen [Website in German]. AWMF-Register der Leitlinien der wissenschaftlichen medizinischen Fachgesellschaften. 2021. URL: https://register.awmf.org/assets/guidelines/024-008l_S2k_Bakterielle_Infektionen_Neugeborene_2021-03-abgelaufne.pdf [Accessed 2025-01-20]
- Adams R, Henry KE, Sridharan A, et al. Prospective, multi-site study of patient outcomes after implementation of the TREWS machine learning-based early warning system for sepsis. Nat Med. Jul 2022;28(7):1455-1460. [CrossRef] [Medline]
- Bologheanu R, Kapral L, Laxar D, et al. Development of a reinforcement learning algorithm to optimize corticosteroid therapy in critically ill patients with sepsis. J Clin Med. Feb 14, 2023;12(4):1513. [CrossRef] [Medline]
- Bunn C, Kulshrestha S, Boyda J, et al. Application of machine learning to the prediction of postoperative sepsis after appendectomy. Surgery. Mar 2021;169(3):671-677. [CrossRef] [Medline]
- Burdick H, Pino E, Gabel-Comeau D, et al. Effect of a sepsis prediction algorithm on patient mortality, length of stay and readmission: a prospective multicentre clinical outcomes evaluation of real-world patient data from US hospitals. BMJ Health Care Inform. Apr 2020;27(1):e100109. [CrossRef] [Medline]
- Ferreira LD, McCants D, Velamuri S. Using machine learning for process improvement in sepsis management. J Healthc Qual Res. 2023;38(5):304-311. [CrossRef] [Medline]
- Garnica O, Gómez D, Ramos V, Hidalgo JI, Ruiz-Giardín JM. Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers. EPMA J. Sep 2021;12(3):365-381. [CrossRef] [Medline]
- Ginestra JC, Giannini HM, Schweickert WD, et al. Clinician perception of a machine learning-based early warning system designed to predict severe sepsis and septic shock. Crit Care Med. Nov 2019;47(11):1477-1484. [CrossRef] [Medline]
- Goh KH, Wang L, Yeow AYK, et al. Artificial intelligence in sepsis early prediction and diagnosis using unstructured data in healthcare. Nat Commun. Jan 29, 2021;12(1):711. [CrossRef] [Medline]
- Henry KE, Adams R, Parent C, et al. Factors driving provider adoption of the TREWS machine learning-based early warning system and its effects on sepsis treatment timing. Nat Med. Jul 2022;28(7):1447-1454. [CrossRef] [Medline]
- Joshi M, Mecklai K, Rozenblum R, Samal L. Implementation approaches and barriers for rule-based and machine learning-based sepsis risk prediction tools: a qualitative study. JAMIA Open. Jul 2022;5(2):ooac022. [CrossRef] [Medline]
- Kausch SL, Moorman JR, Lake DE, Keim-Malpass J. Physiological machine learning models for prediction of sepsis in hospitalized adults: an integrative review. Intensive Crit Care Nurs. Aug 2021;65:103035. [CrossRef] [Medline]
- Komorowski M, Celi LA, Badawi O, Gordon AC, Faisal AA. The artificial intelligence clinician learns optimal treatment strategies for sepsis in intensive care. Nat Med. Nov 2018;24(11):1716-1720. [CrossRef] [Medline]
- Kuo YY, Huang ST, Chiu HW. Applying artificial neural network for early detection of sepsis with intentionally preserved highly missing real-world data for simulating clinical situation. BMC Med Inform Decis Mak. Oct 22, 2021;21(1):290. [CrossRef] [Medline]
- Ma P, Liu J, Shen F, et al. Individualized resuscitation strategy for septic shock formalized by finite mixture modeling and dynamic treatment regimen. Crit Care. Jul 12, 2021;25(1):243. [CrossRef]
- Mao Q, Jay M, Hoffman JL, et al. Multicentre validation of a sepsis prediction algorithm using only vital sign data in the emergency department, general ward and ICU. BMJ Open. Jan 26, 2018;8(1):e017833. [CrossRef] [Medline]
- McCoy A, Das R. Reducing patient mortality, length of stay and readmissions through machine learning-based sepsis prediction in the emergency department, intensive care unit and hospital floor units. BMJ Open Qual. 2017;6(2):e000158. [CrossRef] [Medline]
- Ocampo-Quintero N, Vidal-Cortés P, del Río Carbajo L, Fdez-Riverola F, Reboiro-Jato M, Glez-Peña D. Enhancing sepsis management through machine learning techniques: a review. Medicina Intensiva (English Edition). Mar 2022;46(3):140-156. [CrossRef]
- Rogers P, Boussina AE, Shashikumar SP, Wardi G, Longhurst CA, Nemati S. Optimizing the implementation of clinical predictive models to minimize national costs: sepsis case study. J Med Internet Res. Feb 13, 2023;25:e43486. [CrossRef] [Medline]
- Roggeveen L, El Hassouni A, Ahrendt J, et al. Transatlantic transferability of a new reinforcement learning model for optimizing haemodynamic treatment for critically ill patients with sepsis. Artif Intell Med. Feb 2021;112:102003. [CrossRef] [Medline]
- Scherer JDS, Pereira JS, Debastiani MS, Bica CG. Beyond technology: can artificial intelligence support clinical decisions in the prediction of sepsis? Rev Bras Enferm. 2022;75(5):e20210586. [CrossRef] [Medline]
- Schinkel M, Paranjape K, Nannan Panday RS, Skyttberg N, Nanayakkara PWB. Clinical applications of artificial intelligence in sepsis: a narrative review. Comput Biol Med. Dec 2019;115:103488. [CrossRef] [Medline]
- Shimabukuro DW, Barton CW, Feldman MD, Mataraso SJ, Das R. Effect of a machine learning-based severe sepsis prediction algorithm on patient survival and hospital length of stay: a randomised clinical trial. BMJ Open Respir Res. 2017;4(1):e000234. [CrossRef] [Medline]
- Voermans AM, Mewes JC, Broyles MR, Steuten LMG. Cost-effectiveness analysis of a procalcitonin-guided decision algorithm for antibiotic stewardship using real-world U.S. hospital data. OMICS. Oct 2019;23(10):508-515. [CrossRef] [Medline]
- Wu M, Du X, Gu R, Wei J. Artificial intelligence for clinical decision support in sepsis. Front Med (Lausanne). 2021;8:665464. [CrossRef] [Medline]
- Burdick H, Pino E, Gabel-Comeau D, et al. Evaluating a sepsis prediction machine learning algorithm in the emergency department and intensive care unit: a before and after comparative study. Clinical Trials. [CrossRef]
- Henry KE, Hager DN, Pronovost PJ, Saria S. A targeted real-time early warning score (TREWScore) for septic shock. Sci Transl Med. Aug 5, 2015;7(299):299ra122. [CrossRef] [Medline]
- Hsu KC, Cho DY, Hsueh PR, Yu J, Sun PR. Development of a comprehensive intelligent antimicrobial system: an epochal, fast, and digitally precise prediction of therapeutic antibiotics case study. China Medical University Hospital, Taiwan. 2023. URL: https://oregon.himss.org/sites/hde/files/media/file/2023/06/16/3.-iams-davies-award_061423guanwangbanv2.pdf
- van der Vegt AH, Scott IA, Dermawan K, Schnetler RJ, Kalke VR, Lane PJ. Deployment of machine learning algorithms to predict sepsis: systematic review and application of the SALIENT clinical AI implementation framework. J Am Med Inform Assoc. Jun 20, 2023;30(7):1349-1361. [CrossRef] [Medline]
- Dascena pledges support for sepsis allianc clinical community. Sepsis alliance. 2021. URL: https://www.sepsis.org/news/dascena-pledges-support-for-sepsis-alliance-clinical-community/ [Accessed 2025-01-20]
- Saving Lives With Enhanced Sepsis Treatment Protocol. Healthcare information and management systems society (HIMSS). 2020. URL: https://oregon.himss.org/news/saving-lives-enhanced-sepsis-treatment-protocol [Accessed 2025-01-20]
- Vincent JL, de Mendonca A, Cantraine F, et al. Use of the SOFA score to assess the incidence of organ dysfunction/failure in intensive care units. Crit Care Med. Nov 1998;26(11):1793-1800. [CrossRef]
- Sutton RT, Pincock D, Baumgart DC, Sadowski DC, Fedorak RN, Kroeker KI. An overview of clinical decision support systems: benefits, risks, and strategies for success. NPJ Digit Med. 2020;3:17. [CrossRef] [Medline]
- Moor M, Rieck B, Horn M, Jutzeler CR, Borgwardt K. Early prediction of sepsis in the ICU using machine learning: a systematic review. Front Med (Lausanne). 2021;8:607952. [CrossRef] [Medline]
- Venkatesh V, Morris MG, Davis GB, Davis FD. User acceptance of information technology: toward a unified view1. MIS Q. Sep 1, 2003;27(3):425-478. [CrossRef]
- Dingel J, Kleine AK, Cecil J, Sigl AL, Lermer E, Gaube S. Predictors of health care practitioners’ intention to use AI-enabled clinical decision support systems: meta-analysis based on the unified theory of acceptance and use of technology. J Med Internet Res. Aug 5, 2024;26:e57224. [CrossRef] [Medline]
- Giebel GD, Raszke P, Nowak H, et al. Problems and barriers related to the use of AI-based clinical decision support systems: interview study. J Med Internet Res. Feb 3, 2025;27:e63377. [CrossRef] [Medline]
- Giebel GD, Raszke P, Nowak H, et al. Improving AI-based clinical decision support systems and their integration into care from the perspective of experts: interview study among different stakeholders. JMIR Med Inform. Jul 7, 2025;13:e69688. [CrossRef] [Medline]
- Sandhu S, Lin AL, Brajer N, et al. Integrating a machine learning system into clinical workflows: qualitative study. J Med Internet Res. Nov 19, 2020;22(11):e22421. [CrossRef] [Medline]
- Davis FD, Bagozzi RP, Warshaw PR. User acceptance of computer technology: a comparison of two theoretical models. Manage Sci. Aug 1989;35(8):982-1003. [CrossRef]
- Manetti S, Cumetti M, De Benedictis A, Lettieri E. Adoption of novel biomarker test parameters with machine learning-based algorithms for the early detection of sepsis in hospital practice. J Nurs Manag. Nov 2022;30(8):3754-3764. [CrossRef] [Medline]
- Sendak M, Elish MC, Gao M, et al. “The human body is a black box”: supporting clinical decision-making with deep learning. 2020. Presented at: FAT* ’20: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency; Jan 27-30, 2020:99-109; Barcelona, Spain. URL: https://dl.acm.org/doi/10.1145/3351095.3372827 [Accessed 2026-01-02]
- Kersting C, Kneer M, Barzel A. Patient-relevant outcomes: what are we talking about? A scoping review to improve conceptual clarity. BMC Health Serv Res. Jun 29, 2020;20(1):596. [CrossRef] [Medline]
- Nair AS. Publication bias - importance of studies with negative results! Indian J Anaesth. Jun 2019;63(6):505-507. [CrossRef] [Medline]
Abbreviations
| 3PM: predictive, preventive, and personalized medicine |
| AI: artificial intelligence |
| APACHE II: Acute Physiology and Chronic Health Evaluation II |
| CDSS: clinical decision support system |
| ICU: intensive care unit |
| LOS: length of stay |
| ML: machine learning |
| MLA: machine learning algorithm |
| NEWS2: National Early Warning Score 2 |
| OECD: Organisation for Economic Co-operation and Development |
| PCC: Population, Concept, Context |
| PRISMA-ScR: Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews |
| qSOFA score: Quick Sequential Organ Failure Assessment Score |
| SIRS: systemic inflammatory response syndrome |
| SOFA score: Sequential Organ Failure Assessment Score |
| TREWS: targeted real-time early warning system |
| UTAUT: unified theory of acceptance and use of technology |
| WHO: World Health Organization |
Edited by Javad Sarvestan; submitted 30.Apr.2025; peer-reviewed by Ahmed G Gad, Alessio Staffini, Sven Meister; final revised version received 25.Nov.2025; accepted 26.Nov.2025; published 26.Jan.2026.
Copyright© Pascal Raszke, Godwin Denk Giebel, Jürgen Wasem, Michael Adamzik, Hartmuth Nowak, Lars Palmowski, Philipp Heinz, Nina Timmesfeld, Marianne Tokic, Frank Martin Brunkhorst, Nikola Blase. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 26.Jan.2026.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

