Karel G. M. Moons

ORCID: 0009-0004-7797-1056
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Artificial Intelligence in Healthcare and Education
  • Venous Thromboembolism Diagnosis and Management
  • Health Systems, Economic Evaluations, Quality of Life
  • Meta-analysis and systematic reviews
  • Machine Learning in Healthcare
  • Diagnosis and Treatment of Venous Diseases
  • Statistical Methods and Bayesian Inference
  • Radiomics and Machine Learning in Medical Imaging
  • Heart Failure Treatment and Management
  • Central Venous Catheters and Hemodialysis
  • Clinical practice guidelines implementation
  • Reliability and Agreement in Measurement
  • Explainable Artificial Intelligence (XAI)
  • Acute Ischemic Stroke Management
  • Health Promotion and Cardiovascular Prevention
  • COVID-19 diagnosis using AI
  • COVID-19 Clinical Research Studies
  • Frailty in Older Adults
  • COVID-19 and healthcare impacts
  • Artificial Intelligence in Healthcare
  • Clinical Reasoning and Diagnostic Skills
  • Atrial Fibrillation Management and Outcomes
  • Sepsis Diagnosis and Treatment
  • Neonatal and fetal brain pathology
  • Delphi Technique in Research

University Medical Center Utrecht
2006-2025

Utrecht University
2001-2025

Art Innovation (Netherlands)
2024

University Hospital Heidelberg
2016-2022

Heidelberg University
2016-2022

Oklahoma State University Center for Health Sciences
2014-2022

University of Oxford
2020

Nuffield Orthopaedic Centre
2020

Huisarts en Wetenschap
2016

Cancer Research And Biostatistics
2010

Carl Moons and colleagues provide a checklist background explanation for critically appraising extracting data from systematic reviews of prognostic diagnostic prediction modelling studies. Please see later in the article Editors' Summary

10.1371/journal.pmed.1001744 article EN cc-by PLoS Medicine 2014-10-14

Binary logistic regression is one of the most frequently applied statistical approaches for developing clinical prediction models. Developers such models often rely on an Events Per Variable criterion (EPV), notably EPV ≥10, to determine minimal sample size required and maximum number candidate predictors that can be examined. We present extensive simulation study in which we studied influence EPV, events fraction, predictors, correlations distributions predictor variables, area under ROC...

10.1177/0962280218784726 article EN cc-by-nc Statistical Methods in Medical Research 2018-07-03

Ten events per variable (EPV) is a widely advocated minimal criterion for sample size considerations in logistic regression analysis. Of three previous simulation studies that examined this EPV only one supports the use of minimum 10 EPV. In paper, we examine reasons substantial differences between these extensive studies. The current study uses Monte Carlo simulations to evaluate small bias, coverage confidence intervals and mean square error logit coefficients. Logistic models fitted by...

10.1186/s12874-016-0267-3 article EN cc-by BMC Medical Research Methodology 2016-11-24

Evaluating the performance of a clinical prediction model is crucial to establish its predictive accuracy in populations and settings intended for use. In this article, first three part series, Collins colleagues describe importance meaningful evaluation using internal, internal-external, external validation, as well exploring heterogeneity, fairness, generalisability performance.

10.1136/bmj-2023-074819 article EN cc-by BMJ 2024-01-08

The medical field has seen a rapid increase in the development of artificial intelligence (AI)-based prediction models. With introduction such AI-based model tools and software cardiovascular patient care, researcher healthcare professional are challenged to understand opportunities as well limitations predictions. In this article, we present 12 critical questions for health professionals ask when confronted with an model. We aim support distinguish models that can add value care from AI does not.

10.1093/eurheartj/ehac238 article EN cc-by-nc European Heart Journal 2022-04-27

ABSTRACT Introduction Risk prediction models are increasingly used in healthcare to aid clinical decision‐making. In most contexts, model calibration (i.e., assessing the reliability of risk estimates) is critical. Data available for development often not perfectly balanced with modeled outcome individuals vs. without event interest equally prevalent data). It common researchers correct class imbalance, yet, effect such imbalance corrections on machine learning largely unknown. Methods We...

10.1002/sim.10320 article EN cc-by-nc-nd Statistics in Medicine 2025-01-26

To evaluate the possible effects of outpatient preoperative evaluation (OPE) for new surgical patients who will be inpatients, we conducted an observational study at a university hospital in The Netherlands. Various outcomes before and after introduction OPE clinic were compared. population comprised all 21,553 elective adult inpatients operated on between January 1, 1997 December 31, 1999. Cardiac surgery, obstetric pediatric patients, same-day surgery excluded. main outcome measures cases...

10.1097/00000539-200203000-00030 article EN Anesthesia & Analgesia 2002-03-01

The increasing availability of large combined datasets (or big data), such as those from electronic health records and individual participant data meta-analyses, provides new opportunities challenges for researchers developing validating (including updating) prediction models. These typically include individuals multiple clusters (such centres, geographical locations, or different studies). Accounting clustering is important to avoid misleading conclusions enables explore heterogeneity in...

10.1136/bmj-2022-071018 article EN cc-by BMJ 2023-02-07

Large Language Models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. We present TRIPOD-LLM, an extension of the TRIPOD+AI statement, addressing unique challenges LLMs biomedical applications. TRIPOD-LLM provides a comprehensive checklist 19 main items and 50 subitems, covering key aspects from title to discussion. The guidelines introduce modular format accommodating various LLM research designs tasks, with 14 32 subitems applicable across...

10.1101/2024.07.24.24310930 preprint EN medRxiv (Cold Spring Harbor Laboratory) 2024-07-25

To determine the accuracy of gadolinium-enhanced breath-hold magnetic resonance (MR) angiography in diagnosis renal artery stenosis and visualization accessory arteries.Forty-four patients suspected having 10 potential kidney donors, all whom were scheduled to undergo elective intraarterial digital subtraction (DSA), studied. Three-dimensional gradient-echo MR was performed at 1.5 T with following parameters: repetition time, 13.5 msec; echo 3.5 flip angle, 60 degrees; 195 x 512 matrix;...

10.1148/radiology.207.2.9577501 article EN Radiology 1998-05-01

Background: Up to 90% of patients referred for ultrasonography with suspected deep venous thrombosis (DVT) the leg do not have disease. Objective: To evaluate safety and efficiency using a clinical decision rule that includes point-of-care d-dimer assay at initial presentation in primary care exclude DVT. Design: A prospective management study. Setting: Approximately 300 practices 3 regions Netherlands (Amsterdam, Maastricht, Utrecht). Patients: 1028 consecutive clinically Intervention:...

10.7326/0003-4819-150-4-200902170-00003 article EN Annals of Internal Medicine 2009-02-17

Combining several tests is a common way to improve the final classification of disease status in diagnostic accuracy studies but often used ambiguously. This article gives advice on proper use and reporting composite reference standards

10.1136/bmj.f5605 article EN BMJ 2013-10-25

Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international expert consortium created Metrics Reloaded, a comprehensive framework guiding researchers...

10.48550/arxiv.2206.01653 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Objectives The Wells rule is often used in primary care to out pulmonary embolism (PE), but its efficiency low as many referred patients do not have PE. In this study, we evaluated an alternative and potentially more efficient diagnostic strategy—the YEARS algorithm; a simplified three-item version of the combined with pretest probability adjusted D-dimer interpretation. Design comprehensive prospective validation suspected PE were enrolled by their general practitioner. All three items...

10.1136/bmjopen-2024-091543 article EN cc-by-nc-nd BMJ Open 2025-02-01

Abstract The disease course and outcome of COVID-19 greatly varies between individuals. To explore which biological systems may contribute to this variation, we examined how individual metabolites three metabolic scores relate outcomes in hospitalized patients. metabolome 346 patients was measured using the 1H-NMR Nightingale platform. association metabolomic features multi-biomarker scores, i.e. MetaboHealth, MetaboAge, Infectious Disease Score (IDS) (higher reflect poorer health), with...

10.1007/s11357-025-01591-z article EN cc-by GeroScience 2025-03-11

Expert panels are often used as a reference standard when no gold is available in diagnostic test accuracy research. It unclear what study and expert panel characteristics produce the best estimates of accuracy. We simulated large range scenarios to assess impact on index estimates. Simulations were performed which an was estimate sensitivity specificity test. Diagnostic determined by combining probability target condition presence, provided experts using four component tests, through...

10.1186/s12874-025-02557-7 article EN cc-by-nc-nd BMC Medical Research Methodology 2025-04-23

A universal challenge in studies that quantify the accuracy of diagnostic tests is establishing whether each participant has disease interest. Ideally, same preferred reference standard would be used for all participants; however, practical or ethical reasons, alternative standards are often less accurate frequently instead. The use different across participants a single study known as differential verification. Differential verification can cause severely biased estimates test model being...

10.7326/0003-4819-159-3-201308060-00009 article EN Annals of Internal Medicine 2013-08-06

While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment validation used algorithms, but relatively little attention has been given practical pitfalls when using specific a task. These typically related (1) disregard inherent metric properties, such as behaviour in presence class...

10.48550/arxiv.2104.05642 preprint EN cc-by arXiv (Cornell University) 2021-01-01
Coming Soon ...