Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/74119, first published .
Advancing Real-World Evidence Through a Federated Health Data Network (EHDEN): Descriptive Study

Advancing Real-World Evidence Through a Federated Health Data Network (EHDEN): Descriptive Study

Advancing Real-World Evidence Through a Federated Health Data Network (EHDEN): Descriptive Study

1OHDSI Collaborators, New York, NY, United States

2Department of Medical Informatics, Erasmus MC, Rotterdam, The Netherlands

3Johnson & Johnson (United States), 920 US Route 202, Raritan, NJ, United States

4Department of Biostatistics, University of California, Los Angeles, Los Angeles, CA, United States

5Synapse (Spain), Madrid, Spain

6Department of Biomedical Informatics, Columbia University, New York, NY, United States

*these authors contributed equally

Corresponding Author:

Clair Blacketer, MPH


Background: Real-world data (RWD) are increasingly used in health research and regulatory decision-making to assess the effectiveness, safety, and value of interventions in routine care. However, the heterogeneity of European health care systems, data capture methods, coding standards, and governance structures poses challenges for generating robust and reproducible real-world evidence. The European Health Data & Evidence Network (EHDEN) was established to address these challenges by building a large-scale federated data infrastructure that harmonizes RWD across Europe.

Objective: This study aims to describe the composition and characteristics of the databases harmonized within EHDEN as of September 2024. We seek to provide transparency regarding the types of RWD available and their potential to support collaborative research and regulatory use.

Methods: EHDEN recruited data partners through structured open calls. Selected data partners received funding and technical support to harmonize their data to the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM), with assistance from certified small-to-medium enterprises trained through the EHDEN Academy. Each data source underwent an extract-transform-load process and data quality assessment using the data quality dashboard. Metadata—including country, care setting, capture method, and population criteria—were compiled in the publicly accessible EHDEN Portal.

Results: As of September 1, 2024, the EHDEN Portal includes 210 harmonized data sources from 30 countries. The highest representation comes from Italy (13%), Great Britain (12.5%), and Spain (11.5%). The mean number of persons per data source is 2,147,161, with a median of 457,664 individuals. Regarding care setting, 46.7% (n=98) of data sources reflect data exclusively from secondary care, 42.4% (n=89) from mixed care settings (both primary and secondary), and 11% (n=23) from primary care only. In terms of population inclusion criteria, 55.7% (n=117) of data sources include individuals based on health care encounters, 32.9% (n=69) through disease-specific data collection, and 11.4% (n=24) via population-based sources. Data capture methods also vary, with electronic health records (EHRs) being the most common. A total of 74.7% (n=157) of data sources use EHRs, and more than half of those (n=85) rely on EHRs as their sole method of data collection. Laboratory data are used in 29.5% (n=62) of data sources, although only one relies exclusively on laboratory data. Most laboratory-based data sources combine this method with other forms of data capture.

Conclusions: EHDEN is the largest federated health data network in Europe, enabling standardized, General Data Protection Regulation–compliant analysis of RWD across diverse care settings and populations. This descriptive summary of the network’s data sources enhances transparency and supports broader efforts to scale federated research. These findings demonstrate EHDEN’s potential to enable collaborative studies and generate trusted evidence for public health and regulatory purposes.

J Med Internet Res 2025;27:e74119

doi:10.2196/74119

Keywords



Real-world data (RWD) has become a cornerstone in health care research, especially in regulatory science, due to its ability to capture insights from diverse patient populations and clinical settings. Unlike data generated through traditional randomized controlled trials, which often have stringent inclusion criteria, RWD reflects the everyday health care experiences of a broader patient base [1-5]. This breadth offers a richer context for understanding drug safety and effectiveness, guiding postauthorization safety monitoring, informing risk-benefit evaluations, and supporting regulatory decisions [6]. Regulators, industry, and academics alike rely on real-world evidence (RWE) derived from RWD to answer critical questions about health care interventions in clinical care settings that are more representative of routine practice [7-9].

Europe’s health care landscape presents both challenges and opportunities for generating RWD [10]. Its diversity spans many different health systems, terminology systems, and data collection practices, with variability in health care delivery and data availability across countries. This heterogeneity complicates large-scale representative research but also offers a unique opportunity to study diverse populations [8,11-13]. However, capturing this potential requires overcoming technical, operational, and methodological barriers to ensure data harmonization and quality. A federated network is particularly well suited to Europe’s fragmented health care landscape, where legal, linguistic, and governance diversity necessitate a model that supports local control while enabling cross-border collaboration.

Federated data networks, like the European Health Data & Evidence Network (EHDEN), are well-suited for Europe’s decentralized data landscape [14-19]. In the context of EHDEN, a federated network refers to a collaboration of independently governed data sources that retain full control of their data locally, preserving the autonomy and governance policies of individual data holders. This approach allows for multidatabase studies across diverse populations without requiring centralized data access or query execution, ensuring that personal health information does not leave its original source. By design, this method complies with the General Data Protection Regulation, as it avoids centralizing or transferring personal data and supports the principles of data minimization, purpose limitation, and local control. It is important to note that privacy-preserving practices are also in place for studies conducted using the network. Data partners (DPs) only share aggregate results, typically high-level outputs such as hazard ratios, after applying a minimum cell size threshold (k-anonymity, commonly set to 5) to suppress potentially re-identifiable results.

EHDEN was established as an Innovative Medicines Initiative (IMI), now Innovative Health Initiative, public-private partnership in November 2018 to overcome the challenges and transform how health data is used in Europe [20,21]. The project built a federated data network that standardizes health data across participating sources, making data analysis more feasible and consistent. By harmonizing data and implementing quality assurance protocols, EHDEN enhances the usability and comparability of RWD across Europe. This paper provides an overview of the EHDEN network, examining its data harmonization efforts, quality control processes, and the range of data sources included in the network. Through this discussion, we aim to highlight the scope of RWD available across Europe and its potential for advancing health care research and regulatory decision-making.


Common Data Model

As the foundation for its network, EHDEN adopted the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) [22-24]. The OMOP CDM is widely recognized for its “structure + content” approach whereby the tables and fields (structure) as well as the vocabulary (content) are standardized, allowing for integration of data across multiple systems while maintaining data integrity. The model also supports a wide range of data types, including electronic health records (EHRs), claims data, and patient registries.

The OMOP CDM is maintained by the Observational Health Data Science and Informatics (OHDSI) community, an open science effort that aims to improve health by empowering a community to collaboratively generate the evidence that promotes better health decisions and better care [25]. The open-source nature of OHDSI allows for continuous community-driven improvements, making it adaptable to emerging health care needs [26-28].

Data Partner Calls

Any organization with access to a data source in Europe could apply to be included in the EHDEN network. In this context, a data source is defined as a distinct repository of health care-related data pertaining to a specific set of individuals. Except for the COVID-19 Rapid Collaboration Call, the 7 DP calls executed between September 2019 and October 2022 were aligned to similar timelines for DP identification, grant awarding, and initiation of data harmonization (Figure 1). In each call, candidate partner organizations with access to one or more electronic health care databases applied to the EHDEN Harmonization Fund for a grant to implement or enhance their database (Multimedia Appendix 1). DPs were selected based on 3 criteria: data impact (size, coverage, quality), network impact (track record, uniqueness within network), and readiness (willingness to participate, governance) (Multimedia Appendix 2), reviewed by a Data Source Prioritization Committee. Each application was reviewed and scored by 2 reviewers, and the top applicants per round were awarded a grant.

Figure 1. Timeline of data call 7 (2022‐2023) within the European Health Data & Evidence Network. The figure illustrates key milestones in the selection, contracting, and data harmonization process for data partners onboarded during this call. SME: small-to-medium enterprise

Data Standardization

Once DPs were identified and grants awarded, each data source underwent standardization to the OMOP CDM. A crucial factor in EHDEN’s long-term sustainability and success was the recruitment and training of local small-to-medium enterprises (SMEs). These SMEs were brought on board through separate calls from the EHDEN consortium and certified via the EHDEN Academy education program concluded by an onsite or web-based training. SMEs played a pivotal role in supporting DPs throughout the extract, transform, and load (ETL) process by providing guidance and expertise. In total, 64 SMEs across 22 countries were certified by EHDEN to support DPs.

The ETL process followed by the DPs and supported by the SMEs was largely uniform, as outlined by Voss et al [24], and involved four key steps: (1) summarizing the native data, (2) creating the ETL specification, (3) mapping source vocabulary codes, and (4) implementing the ETL. This standardized approach ensured transparency in the followed procedure and adherence to the conventions in converting data sources to the CDM, while also allowing DPs to benefit from the SMEs’ specialized knowledge.

To promote semantic interoperability across heterogeneous health care systems, all source codes for diagnoses, medications, procedures, and measurements were mapped to standardized vocabularies (eg, SNOMED CT [Systematized Nomenclature of Medicine – Clinical Terms], RxNorm [Prescription Normalized Names], LOINC [Logical Observation Identifiers Names and Codes]) as required by the OMOP CDM. For example, the UK Biobank contributed data that included SNOMED CT-coded diagnoses from EHRs alongside custom-coded fields for self-reported conditions and blood pressure measurements. Mapping these nonstandard elements involved a combination of automated matching and manual curation followed by expert validation. This process facilitated consistent interpretation of clinical concepts across countries and enhanced the analytical interoperability of the network [29].

Payments were structured based on output; to receive full funding, DPs were required to meet 3 different milestones. The ETL specification document entitled DPs to 30%, ETL implementation and infrastructure released the next 40%, and the final 30% was received by the DP after final inspection of the harmonized data (Multimedia Appendix 3).

Data Quality

Each milestone was reviewed by an EHDEN consortium member who was part of the Milestone review committee. The ETL specification document required by milestone 1 was evaluated to ensure the mapping adhered to the OMOP CDM conventions and that the DPs or SMEs had a good understanding of the CDM and their own native data [23,30]. Milestone 2, the ETL implementation, had multiple review steps. The infrastructure was investigated to be sure the DPs were using a supported database platform [31]. The vocabulary mapping was evaluated to ensure most, if not all, source codes were included. The data quality dashboard (DQD) was developed by EHDEN Work Package 5 to provide a standard structure for quality assurance [32]. It was used by DPs throughout the ETL process to continually improve the standardized data sources [33]. As described by Voss et al [24], the median number of times a DP ran the DQD was 3 (IQR 2‐7). In general, conformance issues to the OMOP CDM were identified and addressed in initial runs, with more complex site-specific vocabulary mapping issues addressed in subsequent runs. DPs used the default failure thresholds in the first run. These were updated to reflect the nuances of each data source in later runs of the software. In milestone 3, final DQD results as well as the CDM Inspection Report were reviewed [34]. Once approved, the DP then entered their information into the EHDEN portal, an online platform open to the public designed to catalog metadata on each data source [35].

Analyses

An individual data source was considered one entry in the EHDEN portal. Data sources were categorized based on country, person count, the levels of care represented (primary, secondary, or mixed), why a person was included, and how data were captured. These categories were ascertained from the data source description and metadata provided to the EHDEN portal and verified with the DPs.

There are 3 reasons persons could be included in an EHDEN data source (person inclusion), as defined by population, where a person enters the data source because they live in a certain geographical location or because they are registered with a practice or insurer; encounter, where a person enters the data source upon a visit to a health care provider for any medical reason; or disease, where a person enters the data source when satisfying specific criteria (ie, a person has a specific medical condition). The most restrictive reason was chosen as the classification for each data source.

We identified which types of data capture methods each source contained; it could be one or more of the following: EHR, a bill or adjudicated claim record for health services rendered (claim), measurements taken and results recorded (laboratory), a set of required information collected about participants in a registry (case report form), patient-reported data (survey), documents analyzed by pulling structured data from unstructured data using a natural language processing algorithm, or death information from an official source or government entity (death certificate). If the data source did not provide this information, they were categorized as unknown.

Ethical Considerations

Patients or the public were not involved in the design, conduct, reporting, or dissemination plans of this research.


As of September 1, 2024, there are 210 data sources in the EHDEN portal. Figure 2 shows all countries and the number of data sources available in each. The data sources span 30 countries, with the largest representation from Italy, Great Britain, and Spain, with 13%, 12.5%, and 11.5% of the total data sources in the network, respectively. The mean number of persons per data source is 2,147,161 and the median number of persons is 457,664.

Figure 2. Geographic distribution and characteristics of data sources included in the European Health Data & Evidence Network as of September 1, 2024. The map displays country-level data density based on the number of data sources relative to national population size. Overlaid symbols represent key metadata for each source, including total person count, care setting (primary, secondary, or mixed), data capture methods (eg, electronic health records, laboratory, or claims), and the reason for person inclusion (eg, encounter-based, disease-specific, or population-based).

Table 1 provides the complete list of data sources and their attributes. One row in the table equates to 1 data source. The first column lists the DP, which is the name of the institution or organization that is the custodian of the data source. The individual data sources are identified by an acronym, which is also how they are identified in the EHDEN portal. Country of origin is represented by the 2-digit country code. The number of persons, the person inclusion method, and care level are also provided. Each data capture category has its own column in the table. If a data source uses one of the capture methods, that box is filled with a check mark symbol in the table.

Table 1. Overview of 210 standardized real-world data sources in the European Health Data & Evidence Network as of September 1, 2024. The table includes the full list of data sources by country, total number of persons represented, person inclusion, care level, and data capture methods. These attributes provide essential context for understanding the scope and scale of data available for real-world evidence generation within the European Health Data & Evidence Network.
Data partnerData source acronymCountryPerson countPerson inclusionCare levelEHRaClaimLabCase report formSurveyNLPbDeath certificateUnknown
Centro Clínico Académico – Braga, Associação (2CA-Braga)2CA-BragaPortugal10,70,217EncounterSecondary
INCLIVAABUCASISSpain40,14,819EncounterMixed
The wellbeing services county of Southwest Finland, VarHaACIFinland7,65,000EncounterSecondary
Innovative Medical Research SAADWH IMRGreece6,00,000EncounterPrimary
Fondazione Casa Sollievo della SofferenzaaGMSItaly2140DiseaseSecondary
Akrivia HealthAKRDBGreat Britain30,85,560EncounterSecondary
Amsterdam UMCAmsterdamUMCdbNetherlands20,109EncounterSecondary
Stichting VUmcAMYPAD PNHSNetherlands3368DiseaseMixed
AZIENDA OSPEDALIERO UNIVERSITARIA SAN LUIGI GONZAGAAOU-SANLUIGIItaly1,84,520EncounterSecondary
University Hospital of ParmaAOUPRItaly5,73,205EncounterSecondary
Azienda Ospedaliera Universitaria Integrata VeronaAOVRItaly5,12,000EncounterSecondary
Assistance Publique - Hopitaux de MarseilleAP-HMFrance27,92,497EncounterMixed
APDPAPDPPortugal2,42,000EncounterSecondary
Azienda Ospedaliero-Universitaria di ModenaAPUMItaly3272DiseaseMixed
Servei Català de la SalutAQUAS - CatSalut CMBDSpain68,81,752EncounterSecondary
FONDAZIONE TOSCANA GABRIELE MONASTERIO PER LA RICERCA MEDICA E DI SANITA PUBBLICA (FTGM)ARCAItaly4,64,194EncounterMixed
ASL Roma 1ASL Roma 1Italy11,98,036EncounterMixed
Assuta medical centersAssutaSurgicalIsrael7,76,538EncounterSecondary
ATS BergamoATS-BGItaly13,00,000PopulationMixed
Institute of RheumatologyATTRACzech Republic8006DiseaseMixed
Marco Massari (IRCSSE)AUSL-REItaly43,564DiseaseMixed
Az OostendeAZ OostendeBelgium3,71,097EncounterSecondary
AZ DeltaAZDDBBelgium9,90,559EncounterSecondary
VZW AZ GroeningeAZGBelgium18,571EncounterSecondary
AZ KlinaAZKBelgium5,06,770EncounterSecondary
AZ Maria MiddelaresAZMMBelgium95,341EncounterSecondary
Servicio Navarro de Salud Osasunbidea (SNS-O)BARDENASpain19,72,272EncounterMixed
Barts Health NHS TrustBartsGreat Britain23,12,983EncounterSecondary
National Scientific Program “E-Health in Bulgaria”BDRBulgaria5,01,065DiseaseMixed
Agencia Española de Medicamentos y Productos Sanitarios, AEMPSBIFAPSpain########PopulationPrimary
Instituto Aragonés de Ciencias de la Salud (IACS)BIGANSpain22,91,148EncounterMixed
Instituto de Medicina MolecularBiobank_iMM_ReumaPortugal592DiseaseMixed
Bnai Zion Medical Research Foundation and Infrastructure Development Health ServicesBZMCIsrael10,68,599EncounterSecondary
Inspire-srlCasertaDBItaly13,02,318PopulationMixed
Connected BradfordcBradfordGreat Britain12,00,677EncounterMixed
Clinical Center of MontenegroCCMEMontenegro2,02,322EncounterSecondary
Casa di Cura Privata del Policlinico (CCPP)CCPPItaly16,218EncounterMixed
ISMETTcdm_ismettItaly24,269EncounterSecondary
Bordeaux University HospitalCDWbordeauxFrance19,85,011EncounterSecondary
Charité - UniversitätsmedizinCHA-CANGermany2,14,443EncounterSecondary
Charité - UniversitätsmedizinCHA-DIAGermany60,138EncounterSecondary
Charité - UniversitätsmedizinCHA-IBDGermany2471EncounterSecondary
Institute of Social and Preventive Medicine, University of BernChCR and SCCSSSwitzerland12,000DiseaseMixed
Clinical-hospital center ZvezdaraCHCZSerbia5,15,000EncounterSecondary
Clinical Hospital DubravaCHDubrava–IN2Croatia3,11,754EncounterSecondary
Centro Hospitalar Universitário de Coimbra (CHUC)CHUC OphtalmologyPortugal31,507EncounterSecondary
Center Hospitalier Universitaire de ToulouseCHUTFrance30,59,340EncounterSecondary
Modena Oncology Center - Azienda Ospedaliera ModenaCOMNetItaly89,300EncounterSecondary
Clinical Practice Research Datalink (CPRD)CPRD AURUMGreat Britain########PopulationPrimary
Clinical Practice Research Datalink (CPRD)CPRD HESAPC AURUMGreat Britain########PopulationMixed
The Norwegian Cancer RegistryCRNNorway11,56,806DiseaseSecondary
Basilicata Cancer RegistryCROBItaly54,265DiseasePrimary
Krebsregister Rheinland-PfalzCRRLPGermany2,16,174DiseaseMixed
CUFCUF_CRCPortugal1485DiseaseSecondary
DataLochDataLochGreat Britain4,14,038DiseaseSecondary
Center for Surgical Science (CSS)DCCGDenmark76,849DiseaseSecondary
Amsterdam UMCDDWNetherlands1834DiseaseSecondary
Stockholm CREAtinine Measurements ProjectDH-SCREAMSweden30,85,764EncounterMixed
University of Southern DenmarkDHCRDenmark99,930PopulationSecondary
DIGITAL HEALTH SOLUTIONS SADHS BIOGreece21,02,509EncounterPrimary
German Cancer Society (DKG)DKG EDIUMGermany8680DiseaseMixed
German Cancer Society (DKG)DKG PCOGermany49,300DiseaseMixed
Hospital de DeniaDptoSalud-DENIASpain3,56,723EncounterMixed
University of Ulm, ZIBMTDPVGermany6,38,031DiseaseMixed
Research Institute - Hospital de la Santa Creu i Sant PauDW HSCSPSpain13,15,128EncounterMixed
Primary Healthcare Center ZemunDZ ZemunSerbia3,55,000PopulationPrimary
EBMT: The European Society for Blood and Marrow TransplantationEBMTNetherlands8,99,425DiseaseSecondary
European Clinical Research Alliance on Infectious Diseases (ECRAID)ECRAID-Base POS VAPNetherlands563DiseaseSecondary
Center Hospitalier Universitaire de MontpelliereDOL Entrepôt de DOnnées du LanguedocFrance19,30,844EncounterSecondary
Fondazione IRCCS Policlinico San MatteoELISAItaly4,37,482EncounterMixed
EGAS MONIZ HEALTH ALLIANCEEMHA ULSEDVPortugal563EncounterSecondary
EGAS MONIZ HEALTH ALLIANCEEMHA ULSGEPortugal728EncounterSecondary
EGAS MONIZ HEALTH ALLIANCEEMHA ULSRAPortugal5,14,000EncounterSecondary
European Rare Kidney Disease Registry (ERKReg)ERKRegGermany17,079DiseaseMixed
University of TartuEstonian BiobankEstonia2,02,102PopulationMixed
FIIBAPFIIBAP-COVID19Spain3,38,303EncounterPrimary
Fondazione IRCCS Istituto Neurologico Carlo BestaFINCB - DatasetItaly1,32,408EncounterMixed
Fondazione IRCCS Istituto Neurologico Carlo Besta FINCBFINCB-COVID19Italy766DiseaseSecondary
FinRegistry (Institute of Molecular Medicine Finland (FIMM), University of Helsinki)FinRegistryFinland53,43,204PopulationMixed
Queen Mary University of LondonFLSGreat Britain27,000DiseaseMixed
Fondazione Poliambulanza Istituto OspedalieroFPIOItaly23,116DiseaseSecondary
Geneva Cancer RegistryGCRSwitzerland1,48,929DiseaseMixed
Telavi Regional HospitalGE TelaviGeorgia41,059EncounterSecondary
GENERAL HOSPITAL OF KAVALAGHKGreece1,83,024EncounterSecondary
MS Forschungs- und Projektentwicklungs-GmbHGMSRGermany82,300DiseaseMixed
Grande Ospedale Metropolitano “Bianchi-Melacrino-Morelli”GOM-RCItaly1,99,645PopulationMixed
GOSHGOSH DREGreat Britain1,35,511EncounterMixed
Fundacion de Investigacion Biomedica del Hospital Universitario 12 de OctubreH12OSpain28,09,436EncounterMixed
Hadassah OBGYNHadassahOBGYNIsrael1,19,753EncounterSecondary
Hospital Distrital de Santarém (HDS)HDS Oncology and Obesity EHRPortugal5000DiseaseMixed
Harvey Walsh LtdHESGreat Britain########EncounterSecondary
Health Informatics Center (HIC)HICGreat Britain12,53,625EncounterMixed
SIMGHSDItaly23,99,088EncounterPrimary
Hospital Sant Joan de DéuHSJDSpain12,47,603EncounterSecondary
Fundación para la Investigación del Hospital Universitario La Fe de la Comunidad Valenciana (HULAFE)HULAFESpain22,74,159EncounterMixed
Hospital District of Helsinki and UusimaaHUSFinland33,33,798EncounterSecondary
Virgen Macarena University HospitalHUVMSpain10,89,615EncounterMixed
DIAGNOSTIC & THERAPEUTIC CENTER OF ATHENS “HYGEIA” SINGLE MEMBER SOCIETE ANONYMEHYGEIA-EHDENGreece5,66,798EncounterMixed
IcometrixIcometrixBelgium4595EncounterMixed
Lancashire and South Cumbria Integrated Care BoardIDRIL-1Great Britain14,99,205EncounterSecondary
Fundacio Institut d’Investigacions Mèdiques (FIMIM)IMASISSpain18,00,000EncounterMixed
Lille University HospitalINCLUDEFrance1,914.68EncounterSecondary
InGef - Institute for Applied Health Research Berlin GmbHInGef RDBGermany91,11,064PopulationMixed
Institut Català d’OncologiaInstitut Català d’OncologiaSpain4,06,877DiseaseMixed
Fondazione Istituto Nazionale dei TumoriINTItaly7,21,861DiseaseMixed
NO GRANTIPCINetherlands28,70,221PopulationPrimary
Consorci Corporació Sanitària Parc TaulíIRISSpain12,86,363EncounterMixed
Istanbul UniversityITFTurkey8,99,515EncounterMixed
IUC Cerrahpaşa TIP FakületesiIU-CTFTurkey5,84,043EncounterMixed
E-MEDIT D.O.O. & Hospital TravnikJU TravnikBosnia and Herzegovina52,479EncounterSecondary
IN2 d.o.o. & Clinical Hospital Center OsijekKBC OsijekCroatia3,81,105EncounterMixed
IGEA d.o.o. & University Hospital Center Sestre milosrdniceKBC SMCroatia6,00,000EncounterPrimary
Hierarchia & University Hospital Center ZagrebKBCZgCroatia9,61,568EncounterMixed
MEB KIKI-MEBSweden13,00,000PopulationMixed
Bács-Kiskun Megyei Kórház a Szegedi Tudományegyetem Általános Orvostudományi Kar Oktató KórházaKkh_EMRHungary5,00,000EncounterSecondary
The Directorate of Government Medical Centers at the Israeli Ministry Of HealthKMC-EHRIsrael65,85,681EncounterSecondary
Lambeth DataNetLDNGreat Britain13,50,835EncounterPrimary
Hospital da Luz Learning HealthLH_MMOPortugal7245DiseaseSecondary
Leeds Teaching HospitalsLTHTGreat Britain19,25,447EncounterSecondary
OAKS Consulting s.r.o.LUCASCzech Republic8507DiseaseMixed
MCS Grupa d.o.o. & Health Care Center of Primorje-Gorski Kotar CountyM-DZPGZCroatia2,77,128EncounterPrimary
Azienda Ospedaliera SS Antonio e Biagio e Cesare ArrigoMACADAMItaly879DiseaseMixed
MedamanMHDBelgium1,17,105EncounterSecondary
Hanover Medical SchoolMHHGermany22,52,576EncounterSecondary
CancerDataNet GmbHMM_CDNGermany7006DiseaseMixed
University MS CenterMSDCBelgium872DiseaseMixed
Medical University of ViennaMUVAustria33,948EncounterSecondary
Medical University of ViennaMUV-H2O-BCAustria3508DiseaseSecondary
Medical University of ViennaMUV-H2O-DMAustria170DiseaseSecondary
IKNLNCRNetherlands23,39,983DiseaseMixed
National Institute of Health Insurance Fund Management HungaryNEAKHungary14,140,982PopulationMixed
AO Card. G. Panico - Center for Neurodegenerative Diseases and Aging BrainNeurage-DBItaly552DiseaseSecondary
Queen Mary University of LondonNHFDGreat Britain5,90,584DiseaseSecondary
King’s College LondonNHIC RENAL GSTTGreat Britain2149DiseaseSecondary
University of OsloNHR@UiONorway73,43,868PopulationMixed
National Intensive Care Evaluation foundationNICENetherlands10,72,259EncounterSecondary
UK National Neonatal Research DatabaseNNRDGreat Britain11,80,103EncounterSecondary
Szabolcs-Szatmár-Bereg Megyei Kórházak és Egyetemi OktatókórházNyir_EMRHungary9,67,000EncounterSecondary
Bambino Gesù Children’s HospitalOBG-POHDItaly3901DiseaseSecondary
Onze-Lieve-Vrouwziekenhuis Aalst-Asse-NinoveOLVZ_LUNGBelgium364DiseaseMixed
GermanOncologyOncalizerRegGermany4159DiseaseSecondary
Optimum Patient Care LimitedOPCRDGreat Britain25,953,068EncounterPrimary
Royal College of General Practitioners (RCGP)ORCHIDGreat Britain80,00,000PopulationPrimary
Rioja SaludPASCALSpain7,89,371EncounterMixed
University of Turku (Prostate Cancer Registry of South West Finland)PcaSFFinland22,232DiseaseMixed
ASST Papa Giovanni XXIIIPG23Italy4,51,135EncounterMixed
Papageorgiou General HospitalPGHGreece14,12,857EncounterSecondary
STIZONPHARMONetherlands44,41,048EncounterPrimary
BCB Medical LtdPirha BCB IBDFinland4516DiseaseSecondary
Fondazione IRCCS Ca’ Granda Ospedale Maggiore PoliclinicoPOLIMIItaly14,70,942EncounterSecondary
UZ BrusselPRIMUZBelgium5594EncounterSecondary
Fundació Institut d´Investigació Sanitària Illes BalearsPRISIBSpain24,98,226EncounterMixed
IRCCS Policlinico San DonatoPSDItaly4,85,174EncounterMixed
Finnish Clinical Biobank TamperePSHP OncologyFinland1,14,697DiseaseSecondary
Parc Sanitari Sant Joan de DéuPSSJDSpain6,59,817EncounterMixed
Harm SlijperPulseHandWristNetherlands49,903DiseaseSecondary
QuironsaludQuirónSaludSpain2,98,839EncounterSecondary
Registo Portugues de Doentes ReumaticosReuma.ptPortugal28,325DiseaseSecondary
Czech Myeloma GroupRMGCzech Republic9802DiseaseMixed
Registre National du Cancer du LuxembourgRNCLuxembourg8892DiseaseMixed
Vaud Cancer RegistryRVTSwitzerland1,54,043DiseaseMixed
LynxCareRWEHub_CardiologyBelgium18,296EncounterSecondary
LynxCareRWEHubHFBelgium26,500DiseaseSecondary
SAIL DatabankSAIL - ADDEGreat Britain7,93,300PopulationSecondary
SAIL DatabankSAIL - NCCHGreat Britain19,07,900PopulationPrimary
SAIL DatabankSAIL - PATDGreat Britain11,05,100DiseasePrimary
SAIL DatabankSAIL - PEDWGreat Britain34,23,200EncounterSecondary
SAIL DatabankSAIL - WDSDGreat Britain55,69,400PopulationPrimary
SAT HealthSATHEALTHBulgaria12,451EncounterSecondary
Gothenburg UniversitySCIFI-PEARLSweden11,700,000PopulationMixed
Servicio Cántabro de Salud and IDIVALSCIVALSpain13,77,099EncounterMixed
HUG and SCQMSCQMSwitzerland20,355DiseaseMixed
Consellería de SanidadeSERGASSpain29,16,773EncounterMixed
University of EdinburghSESCDGreat Britain35,395DiseaseSecondary
SIDIAP - The Information System for Research in Primary CareSIDIAPSpain78,87,308EncounterPrimary
King’s College LondonSLSRGreat Britain6242DiseaseMixed
Health Data HubSNDSFrance671,610cPopulationMixed
Bordeaux PharmacoEpiSNDS (BPE)France72,520cPopulationMixed
SWIBREGSWIBREGSweden60,000DiseaseMixed
Pirkanmaa Hospital DistrictTaUHFinland8,93,817EncounterSecondary
CEGEDIM HEALTH DATATHIN FRANCEFrance########EncounterPrimary
CEGEDIM HEALTH DATATHIN RomaniaRomania10,48,994EncounterPrimary
CEGEDIM HEALTH DATATHIN UKGreat Britain########EncounterPrimary
Finnish Institute of Health and WelfareTHL-AVOHILMOFinland72,94,000EncounterPrimary
Finnish Institute for Health and Welfare (THL)THL-HILMOFinland71,02,953EncounterMixed
Trinity St James’s Cancer InstituteTSJCI BREIreland1020DiseaseMixed
Trinity St James’s Cancer InstituteTSJCI COLIreland624DiseaseMixed
Trinity St James’s Cancer InstituteTSJCI GYNIreland922DiseaseMixed
Trinity St James’s Cancer InstituteTSJCI HANIreland1363DiseaseMixed
Trinity St James’s Cancer InstituteTSJCI LNGIreland1920DiseaseMixed
Trinity St James’s Cancer InstituteTSJCI SKNIreland663DiseaseMixed
Trinity St James’s Cancer InstituteTSJCI UGIIreland1785DiseaseMixed
Trinity St James’s Cancer InstituteTSJCI UROIreland2047DiseaseMixed
Clinical center of NisUCCNisSerbia3400EncounterSecondary
Clinical Center of SerbiaUCCSSerbia8,60,000EncounterSecondary
University College London HospitalsUCLHGreat Britain1,88,970EncounterSecondary
University College London (UCL) (UK Biobank)UK BiobankGreat Britain5,02,504PopulationMixed
University Medicine DresdenUKDresdenGermany6,24,697EncounterSecondary
National Cancer InstituteULRUkraine1112DiseaseMixed
ULS AC CardiovascularULS AC CardiovascularPortugal2180EncounterSecondary
ULSMULSM COVIDPortugal9750DiseaseSecondary
Unidade Local de Saúde de MatosinhosULSM RT-DBPortugal6,79,804EncounterSecondary
University of PécsUP-HCDBHungary10,12,198EncounterSecondary
Semmelweis UniversityUSN_EMRHungary20,75,672EncounterSecondary
University Hospital AntwerpUZA_NLP_ONCOBelgium4562DiseaseSecondary
Universitaire Ziekenhuizen KU LeuvenUZLDBBelgium5,82,709DiseaseSecondary
Vall d’Hebrón Hospital CampusVHSpain17,99,398EncounterSecondary
FISABIO-HSRUVID-CONSIGNSpain19,64,588PopulationMixed
VieCuri Medisch CentrumViecuriNetherlands4918EncounterSecondary
Ziekenhuis Oost-LimburgZOL-EPDexport-DBBelgium12,209EncounterSecondary

aEHR: electronic health record.

bNLP: natural language processing.

cThis is a subset of the full data source.

Looking at care settings, 46.7% (98/210) of data sources represent data from the secondary setting only, while 42.4% (89/210) represent data from mixed settings (primary and secondary). A comparatively smaller set of 11.0% (23/210) represents data only from the primary care setting (Table 2). Looking at the ways in which persons are included in the data sources, 55.7% (n=117) do so through health care encounters, 32.9% (n=69) through disease-specific data collection, and 11.4% (n=24) through population-based sources.

Table 2. Stratification of data sources in the European Health Data & Evidence Network by method of person inclusion and care level. Person inclusion reflects the basis by which individuals are represented in the database: health care encounters, disease-specific inclusion, or population-based inclusion. Care settings indicate whether data were captured in primary care, secondary (hospital) care, or across both (mixed).
Care levelPerson inclusionValues, n (%)
DiseaseEncounterPopulation
Mixed, n40341589 (42.4)
Primary, n214723 (11)
Secondary, n2769298 (46.7)
Total, n (%)69 (32.9)117 (55.7)24 (11.4)210 (100)

Figure 3 shows the number of data sources that receive information through each capture method and each combination of capture methods. EHR is the most common, with 74.7% (157/210) of data sources reporting at least 1 capture method as EHR. Over half of those data sources (85/210) report EHR as their only method for receiving data. Laboratory is the second most common way data sources capture information, as it is reported in 29.5% (62/210) of data sources. Unlike EHR, laboratory data is more likely to be coupled with another data capture method, as only one data source lists laboratory as the singular way they receive information.

Figure 3. The frequency and overlap of different data capture methods used across 210 standardized real-world data sources in the European Health Data & Evidence Network as of September 1, 2024. EHR: electronic health record; NLP: natural language processing.

Principal Findings

The varied health care data across Europe, as demonstrated by the summary of 210 data sources in EHDEN from 30 countries, underscores the critical need to generate evidence from more than one data source to comprehensively represent the health care needs or experiences of the entire European population. Across the person inclusion and care levels represented in the network, the data sources are well distributed, emphasizing how health care systems, populations, and data capture methods can differ substantially. While 74.7% (157/210) of the data sources report EHR as at least one of their data capture methods, only 40.4% (85/210) report EHR as their only data capture method. The other 34.3% (72/210) report some combination of EHR, laboratory, case report form, claim, natural language processing, and death register data, showcasing the tremendous heterogeneity of data available in Europe.

Prior Initiatives

Prior initiatives like European Union–Adverse Drug Reactions (EU-ADR) and Innovative Medicines Initiative–European Medical Information Framework (IMI-EMIF) laid the groundwork for EHDEN, with learnings from those projects directly impacting this project [15,18,36,37]. EU-ADR demonstrated the feasibility of building a federated data network for large-scale drug safety monitoring in Europe using common data analysis files. IMI-EMIF made the first transition from using common input files like those in EU-ADR to the OMOP CDM, but it was not scalable due to the lack of funds and need for trained SMEs, both problems which EHDEN addressed.

Sustainability and Success of EHDEN

The sustainability of the EHDEN initiative has been achieved through a combination of mechanisms that foster shared leadership, collaboration, and long-term value creation. One key factor has been the stimulation and enablement of both national and European collaborations. The establishment of OHDSI National Nodes has provided a platform for DPs within individual countries to collaborate, share best practices, and enhance data quality [38]. These nodes facilitate national-level harmonization while ensuring compliance with local regulations and coding systems, thereby strengthening the network’s integrity. Beyond this, EHDEN’s adoption in multiple European projects has further expanded its influence, including its pivotal role in enabling large-scale initiatives such as the Data Analysis and Real World Interrogation Network (DARWIN EU). This has also influenced how the European Federation of Pharmaceutical Industries and Associations (EFPIA) is standardizing its data, demonstrating EHDEN’s impact across sectors.

EHDEN has also delivered economic value by creating local ecosystems that support SMEs and DPs. Through the Harmonization Fund, EHDEN has injected resources into the European health care data landscape, with the return on investment yielding a multiplier effect. By recruiting and training SMEs through the EHDEN Academy, the initiative has built local expertise to support DPs throughout the ETL process, ensuring decentralized and sustainable support for the network.

One of the goals of EHDEN has been to standardize health data, akin to utilities like electricity or the internet, essential and accessible to a rapidly growing number of stakeholders across Europe. Now that EHDEN has transitioned from a project under IHI to the nonprofit EHDEN Foundation, the focus has shifted to sustaining, expanding, and improving the network while leveraging the harmonized data for evidence generation. This next phase aims to generate meaningful RWE for research and regulatory purposes. A recent report by The European Commission on the future of European competitiveness highlights EHDEN’s foundational role in shaping the future of the European Health Data Space, further solidifying its legacy as a critical driver of innovation and collaboration in European health care [39].

The success of EHDEN in harmonizing data to the OMOP CDM has led to significant advances in methods research and evidence generation [40-43]. Many of the DPs involved in EHDEN have used their standardized data to conduct analyses across a broad spectrum of use cases. For instance, several studies have been conducted to describe the natural history of diseases, the safety and effectiveness of treatments, and health care utilization patterns across diverse populations [44-46]. One clear example is a multinational network cohort study by Li et al [45], which used evidence generated from EHDEN DPs to characterize background incidence rates of adverse events of special interest related to COVID-19 vaccines. EHDEN’s standardized data has also enabled and improved the development of predictive models, allowing for personalized predictions of treatment outcomes and disease progression [47,48]. The harmonization of data has facilitated large-scale population-level studies, which are crucial for understanding trends in public health and informing health care policy decisions [49,50]. These examples of evidence generation illustrate the broad applicability of the data in the network, which serves both academic researchers and regulatory agencies. A regularly updated list of EHDEN-supported studies and publications is maintained on the EHDEN website, providing a comprehensive overview of the diverse applications of the network in regulatory, clinical, and methodological research.

A notable demonstration of EHDEN’s success is the network’s use in providing timely information on medicines under surveillance due to shortages in multiple European countries. More than 50 DPs contributed data to a study titled “Incidence, Prevalence, and Characterization of Medicines with Suggested Drug Shortages in Europe” [51]. This study represents the largest observational database study conducted across Europe, both in terms of the number of databases involved and its geographic scope. The findings will support European efforts to monitor the use of critical medicines, contributing to the global fight against medicine shortages.

The EHDEN data network has also affected other European collaboratives around RWD. IMI projects like PIONEER, BigData@Heart, EU-PEARL, and HARMONY also use the OMOP CDM and have partly continued the mapping work done in EHDEN [52-55]. In the EMA-commissioned DARWIN EU initiative, among the 20 DPs onboarded in the first 2 years, 16 are also EHDEN DPs [56].

Future Directions

Building on the progress achieved through the EHDEN network, several key areas offer opportunities for future development. One priority is fostering sustained engagement with DPs. Continuous collaboration will be essential to ensure that DPs remain active contributors to the network by regularly updating and improving their data contributions. Strategies to incentivize engagement, provide ongoing support, and ensure mutual value will be vital for the network’s long-term success, particularly as efforts shift toward more robust evidence generation and ongoing enhancements in data quality.

Expanding the network’s reach and optimizing its databases for specific research use cases are also key areas for growth. With 210 data sources currently included, there is a significant opportunity to onboard additional DPs and expand the network’s coverage across Europe. Future studies will also help identify gaps where further data optimization is required, such as refining mappings or addressing specific quality issues to ensure that the evidence generated is robust, reproducible, and generalizable.

Finally, the newly established EHDEN Foundation will play a critical role in these efforts. By securing funding and fostering collaborations, the Foundation can drive the onboarding of new DPs, address emerging research questions, and ensure that EHDEN continues to adapt to the evolving health care landscape. It also serves as a point of entry for external researchers, who may engage with the network and propose studies through its federated framework. These directions will position the network to remain a cornerstone for RWE generation in Europe, supporting both research and regulatory innovation.

Conclusions

The results of this study demonstrate that the identification, harmonization, and standardization of data sources through EHDEN have contributed significantly to understanding the diverse RWD landscape and advancement of evidence generation across Europe. These efforts are not only improving observational health research but are also influencing broader regulatory initiatives, such as DARWIN EU, which builds on the foundational work of EHDEN to leverage RWD for regulatory decision-making. Now that the initiative has transitioned to the EHDEN Foundation, there is an opportunity to focus even more on generating high-quality evidence, further solidifying the role of real-world data in improving health care and informing policy decisions across Europe.

Acknowledgments

The authors would like to express their deepest gratitude to all European Health Data & Evidence Network (EHDEN) data partners who contributed their time, effort, and expertise in harmonizing their data to the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM). Their commitment to data quality and transparency has been fundamental to the success of this initiative. We extend our appreciation to the small-to-medium enterprises that played a pivotal role in supporting data partners through the extract-transform-load process. Their expertise in data mapping and technical implementation has been invaluable in ensuring high-quality and standardized data across the network. We would also like to acknowledge the contributions of the EHDEN work packages and their leads, whose dedication to data harmonization, quality assessment, infrastructure development, training, and sustainability efforts has made this project possible. Their leadership and vision have shaped EHDEN into a robust and scalable federated data network. Finally, we extend our thanks to all members of the EHDEN Consortium who have worked tirelessly to build and sustain this network. This includes researchers, data scientists, software engineers, governance and regulatory experts, and the broader Observational Health Data Sciences and Informatics (OHDSI) community, whose open-science collaboration and innovation continue to drive the success of EHDEN. We recognize that the continued success of EHDEN is the result of collective contributions from numerous stakeholders across Europe, and we sincerely appreciate the efforts of everyone involved. This project has received support from the EHDEN project. EHDEN received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement 806968. The Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation program and European Federation of Pharmaceutical Industries and Associations.

Data Availability

All data supporting this work can be found on the European Health Data & Evidence Network (EHDEN) portal at [57].

Authors' Contributions

All authors (CB, MJS, MM, EAV, MC, PRR, and PBR) were involved in data collection. CB, MJS, PBR, MM, and PRR were involved in the study design, analysis, and interpretation of results. CB, MJS, PBR, and PRR contributed to writing, and all authors revised and approved the final draft.

Conflicts of Interest

CB, MJS, EAV, and PBR are employees of Johnson & Johnson and hold stock and stock options. PRR works for a department that receives/received unconditional research grants from Amgen, Chiesi, Johnson & Johnson, UCB Biopharma, the European Medicines Agency, and the Innovative Medicines Initiative.

Multimedia Appendix 1

European Health Data & Evidence Network data partner call description.

PDF File, 771 KB

Multimedia Appendix 2

European Health Data & Evidence Network framework for quality benchmarking.

PDF File, 961 KB

Multimedia Appendix 3

European Health Data & Evidence Network subgrant agreement model.

PDF File, 380 KB

  1. Framework for FDA’s real-world evidence program. US Food and Drug Administration. 2018. URL: https://www.fda.gov/media/120060/download [Accessed 2025-07-25]
  2. HMA-EMA joint big data taskforce summary report. European Medicines Agency. 2019. URL: https:/​/www.​ema.europa.eu/​en/​documents/​minutes/​hma-ema-joint-task-force-big-data-summary-report_en.​pdf [Accessed 2025-07-31]
  3. Methodological guide. The European Network of Centres for Pharmacoepidemiology and Pharmacovigilance (ENCePP) Guide on Methodological Standards in Pharmacoepidemiology. URL: https://encepp.europa.eu/encepp-toolkit/methodological-guide_en [Accessed 2024-09-06]
  4. Jonker CJ, Bakker E, Kurz X, Plueschke K. Contribution of patient registries to regulatory decision making on rare diseases medicinal products in Europe. Front Pharmacol. 2022;13:924648. [CrossRef] [Medline]
  5. Eskola SM, Leufkens HGM, Bate A, De Bruin ML, Gardarsdottir H. The role of real-world data and evidence in oncology medicines approved in EU in 2018-2019. J Cancer Policy. Jun 2023;36:100424. [CrossRef] [Medline]
  6. Bolislis WR, Fay M, Kühler TC. Use of real-world data for new drug applications and line extensions. Clin Ther. May 2020;42(5):926-938. [CrossRef] [Medline]
  7. Guidance on good pharmacovigilance practices (GVP) module VIII post-authorisation safety studies. European Medicines Agency. URL: https:/​/www.​ema.europa.eu/​en/​documents/​scientific-guideline/​guideline-good-pharmacovigilance-practices-gvp-module-viii-post-authorisation-safety-studies-rev-3_en.​pdf [Accessed 2024-08-06]
  8. Santoro A, Genov G, Spooner A, Raine J, Arlett P. Promoting and protecting public health: how the European Union pharmacovigilance system works. Drug Saf. Oct 2017;40(10):855-869. [CrossRef] [Medline]
  9. Goedecke T, Morales DR, Pacurariu A, Kurz X. Measuring the impact of medicines regulatory interventions - systematic review and methodological considerations. Br J Clin Pharmacol. Mar 2018;84(3):419-433. [CrossRef] [Medline]
  10. Health at a glance: Europe 2022: state of health in the EU cycle. Organisation for Economic Co-operation and Development. 2022. URL: https:/​/www.​oecd-ilibrary.org/​social-issues-migration-health/​health-at-a-glance-europe-2022_507433b0-en [Accessed 2024-08-22]
  11. Cave A, Kurz X, Arlett P. Real-world data for regulatory decision making: challenges and possible solutions for Europe. Clin Pharmacol Ther. Jul 2019;106(1):36-39. [CrossRef] [Medline]
  12. Bakker E, Plueschke K, Jonker CJ, Kurz X, Starokozhko V, Mol PGM. Contribution of real-world evidence in European Medicines Agency’s regulatory decision making. Clin Pharmacol Ther. Jan 2023;113(1):135-151. [CrossRef] [Medline]
  13. Flynn R, Plueschke K, Quinten C, et al. Marketing authorization applications made to the European Medicines Agency in 2018-2019: what was the contribution of real-world evidence? Clin Pharmacol Ther. Jan 2022;111(1):90-97. [CrossRef] [Medline]
  14. Gini R, Schuemie M, Brown J, et al. Data extraction and management in networks of observational health care databases for scientific research: a comparison of EU-ADR, OMOP, Mini-Sentinel and MATRICE strategies. EGEMS (Wash DC). 2016;4(1):1189. [CrossRef] [Medline]
  15. Coloma PM, Schuemie MJ, Trifirò G, et al. Combining electronic healthcare databases in Europe to allow for large-scale drug safety monitoring: the EU-ADR Project. Pharmacoepidemiol Drug Saf. Jan 2011;20(1):1-11. [CrossRef] [Medline]
  16. Trifirò G, Coloma PM, Rijnbeek PR, et al. Combining multiple healthcare databases for postmarketing drug and vaccine safety surveillance: why and how? J Intern Med. Jun 2014;275(6):551-561. [CrossRef] [Medline]
  17. Stang PE, Ryan PB, Racoosin JA, et al. Advancing the science for active surveillance: rationale and design for the Observational Medical Outcomes Partnership. Ann Intern Med. Nov 2, 2010;153(9):600-606. [CrossRef] [Medline]
  18. Reynolds RF, Kurz X, de Groot MCH, et al. The IMI PROTECT project: purpose, organizational structure, and procedures. Pharmacoepidemiol Drug Saf. Mar 2016;25 Suppl 1(5–10):5-10. [CrossRef] [Medline]
  19. EFPIA recommendations on a connected data system in Europe. European Federation of Pharmaceutical Industries and Associations. URL: https:/​/www.​efpia.eu/​news-events/​the-efpia-view/​efpia-news/​efpia-recommendations-on-a-connected-data-system-in-europe/​ [Accessed 2024-09-06]
  20. European Health Data & Evidence Network. URL: https://www.ehden.eu/ [Accessed 2024-03-12]
  21. Innovative Health Initiative. 2024. URL: http://www.ihi.europa.eu/ [Accessed 2024-09-12]
  22. Voss EA, Makadia R, Matcho A, et al. Feasibility and utility of applications of the common data model to multiple, disparate observational health databases. J Am Med Inform Assoc. May 2015;22(3):553-564. [CrossRef] [Medline]
  23. OMOP Common Data Model. URL: https://ohdsi.github.io/CommonDataModel/ [Accessed 2025-07-25]
  24. Voss EA, Blacketer C, van Sandijk S, et al. European Health Data & Evidence Network – learnings from building out a standardized international health data network. J Am Med Inform Assoc. Dec 22, 2023;31(1):209-219. [CrossRef] [Medline]
  25. Hripcsak G, Duke JD, Shah NH, et al. Observational Health Data Sciences and Informatics (OHDSI): opportunities for observational researchers. Stud Health Technol Inform. 2015;216(574–578):574-578. [Medline]
  26. Schuemie M, Reps J, Black A, et al. Health-Analytics Data to Evidence Suite (HADES): open-source software for observational research. Stud Health Technol Inform. Jan 25, 2024;310:966-970. [CrossRef] [Medline]
  27. Chapter 4: the common data model. In: The Book of OHDSI.; URL: https://ohdsi.github.io/TheBookOfOhdsi/ [Accessed 2025-07-25]
  28. Reich C, Ostropolets A, Ryan P, et al. OHDSI standardized vocabularies – a large-scale centralized reference ontology for international data harmonization. J Am Med Inform Assoc. Feb 16, 2024;31(3):583-590. [CrossRef] [Medline]
  29. Papez V, Moinat M, Voss EA, et al. Transforming and evaluating the UK Biobank to the OMOP Common Data Model for COVID-19 research and beyond. J Am Med Inform Assoc. Dec 13, 2022;30(1):103-111. [CrossRef] [Medline]
  30. THEMIS convention repository for the OMOP CDM. THEMIS. URL: https://ohdsi.github.io/Themis/ [Accessed 2024-09-13]
  31. OHDSI/databaseconnector. GitHub. 2025. URL: https://github.com/OHDSI/DatabaseConnector [Accessed 2024-05-07]
  32. Blacketer C, Defalco FJ, Ryan PB, Rijnbeek PR. Increasing trust in real-world evidence through evaluation of observational data quality. J Am Med Inform Assoc. Sep 18, 2021;28(10):2251-2257. [CrossRef] [Medline]
  33. Blacketer C, Voss EA, DeFalco F, et al. Using the data quality dashboard to improve the EHDEN network. Appl Sci (Basel). Jan 2021;11(24):11920. [CrossRef]
  34. EHDEN/cdminspection: r package to support quality control inspection of an OMOP-CDM instance. GitHub. URL: https://github.com/EHDEN/CdmInspection [Accessed 2024-09-13]
  35. EHDEN Portal. URL: https://portal.ehden.eu/wordpress/index.php/landing-page-2/ [Accessed 2024-09-13]
  36. Klungel OH, Kurz X, de Groot MCH, et al. Multi-centre, multi-database studies with common protocols: lessons learnt from the IMI PROTECT project. Pharmacoepidemiol Drug Saf. Mar 2016;25 Suppl 1:156-165. [CrossRef] [Medline]
  37. Lovestone S, EMIF Consortium. The European medical information framework: a novel ecosystem for sharing healthcare data across Europe. Learn Health Syst. Apr 2020;4(2):e10214. [CrossRef] [Medline]
  38. Observational health data sciences and informatics national nodes. Observational Health Data Sciences and Informatics. URL: https://www.ohdsi-europe.org/index.php/national-nodes [Accessed 2024-09-27]
  39. The Draghi report on EU competitiveness. European Union. 2024. URL: https:/​/commission.​europa.eu/​topics/​strengthening-european-competitiveness/​eu-competitiveness-looking-ahead_en#paragraph_47059 [Accessed 2025-07-25]
  40. Yang C, Fridgeirsson EA, Kors JA, Reps JM, Rijnbeek PR. Impact of random oversampling and random undersampling on the performance of prediction models developed using observational health data. J Big Data. Jan 3, 2024;11(1):7. [CrossRef]
  41. Fridgeirsson EA, Williams R, Rijnbeek P, Suchard MA, Reps JM. Comparing penalization methods for linear models on large observational health data. J Am Med Inform Assoc. Jun 20, 2024;31(7):1514-1521. [CrossRef] [Medline]
  42. Williams RD, den Otter S, Reps JM, Rijnbeek PR. The DELPHI library: improving model validation, transparency and dissemination through a centralised library of prediction models. Stud Health Technol Inform. May 18, 2023;302:139-140. [CrossRef] [Medline]
  43. Markus AF, Kors JA, Rijnbeek PR. The role of explainability in creating trustworthy artificial intelligence for health care: a comprehensive survey of the terminology, design choices, and evaluation strategies. J Biomed Inform. Jan 2021;113:103655. [CrossRef] [Medline]
  44. Morales DR, Ostropolets A, Lai L, et al. Characteristics and outcomes of COVID-19 patients with and without asthma from the United States, South Korea, and Europe. J Asthma. Jan 2023;60(1):76-86. [CrossRef] [Medline]
  45. Li X, Ostropolets A, Makadia R, et al. Characterising the background incidence rates of adverse events of special interest for Covid-19 vaccines in eight countries: multinational network cohort study. BMJ. Jun 14, 2021;373:n1435. [CrossRef] [Medline]
  46. Voss EA, Shoaibi A, Yin Hui Lai L, et al. Contextualising adverse events of special interest to characterise the baseline incidence rates in 24 million patients with COVID-19 across 26 databases: a multinational retrospective cohort study. EClinicalMedicine. Apr 2023;58:101932. [CrossRef] [Medline]
  47. Williams RD, Reps JM, OHDSI/EHDEN Knee Arthroplasty Group, Rijnbeek PR, Ryan PB, Prieto-Alhambra D. 90-Day all-cause mortality can be predicted following a total knee replacement: an international, network study to develop and validate a prediction model. Knee Surg Sports Traumatol Arthrosc. Sep 2022;30(9):3068-3075. [CrossRef] [Medline]
  48. Reps JM, Williams RD, You SC, et al. Feasibility and evaluation of a large-scale external validation approach for patient-level prediction in an international data network: validation of models predicting stroke in female patients newly diagnosed with atrial fibrillation. BMC Med Res Methodol. May 6, 2020;20(1):102. [CrossRef] [Medline]
  49. Lane JCE, Weaver J, Kostka K, et al. Risk of depression, suicide and psychosis with hydroxychloroquine treatment for rheumatoid arthritis: a multinational network cohort study. Rheumatology (Oxford). Jul 1, 2021;60(7):3222-3234. [CrossRef] [Medline]
  50. Burn E, Weaver J, Morales D, et al. Opioid use, postoperative complications, and implant survival after unicompartmental versus total knee replacement: a population-based network study. Lancet Rheumatol. Dec 2019;1(4):e229-e236. [CrossRef] [Medline]
  51. Pineda-Moncusí M, Rekkas A, Pérez ÁM, et al. Trends of drug use with suggested shortages and their alternatives across 41 real world data sources and 18 countries in Europe and North America. medRxiv. Preprint posted online on 2024. [CrossRef]
  52. PIONEER - European Network of Excellence for Big Data in Prostate Cancer. URL: https://prostate-pioneer.eu/ [Accessed 2024-11-07]
  53. BigData@Heart > Home. URL: https://www.bigdata-heart.eu/ [Accessed 2024-11-07]
  54. HARMONY is part of the IMI “Big data for better outcomes. HARMONY Alliance. URL: https://www.harmony-alliance.eu/en/harmony-is-part-of-the-imi-big-data-for-better-outcomes-program [Accessed 2024-11-07]
  55. Innovative patient centric clinical trial platforms. EU-PEARL. URL: https://eu-pearl.eu/ [Accessed 2024-11-07]
  56. Darwin EU. European Medicines Agency. URL: https://www.darwin-eu.org/ [Accessed 2024-09-26]
  57. European Health Data & Evidence Network. EHDEN platform. URL: https://portal.ehden.eu/summary [Accessed 2025-07-15]


DARWIN EU: Data Analysis and Real World Interrogation Network
DP: data partner
DQD: data quality dashboard
EFPIA: European Federation of Pharmaceutical Industries and Associations Declarations
EHDEN: European Health Data & Evidence Network
EHR: electronic health record
EMIF: European Medical Information Framework
ETL: extract-transform-load
EU-ADR: European Union–Adverse Drug Reactions
IMI: Innovative Medicines Initiative
LOINC: Logical Observation Identifiers Names and Codes
OHDSI: Observational Health Data Sciences and Informatics
OMOP CDM : Observational Medical Outcomes Partnership Common Data Model
RWD: real-world data
RWE: real-world evidence
RxNorm: Prescription Normalized Names
SME: small-to-medium enterprise
SNOMED CT: Systematized Nomenclature of Medicine – Clinical Terms


Edited by Andrew Coristine; submitted 18.03.25; peer-reviewed by Chibuzo Onah, Mehmet Burcu, Odumbo Oluwole; final revised version received 30.05.25; accepted 30.05.25; published 07.08.25.

Copyright

© Clair Blacketer, Martijn J Schuemie, Maxim Moinat, Erica A Voss, Montse Camprubi, Peter R Rijnbeek, Patrick B Ryan. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 7.8.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.