NFDI4DS | UHH-SEMS - Publication Details

Imputation of Clinical Covariates in Time Series

Imputation (statistics) Categorical variable

DOI: 10.48550/arxiv.1812.00418 Publication Date: 2018-01-01

Abstract Supplemental Material References Cited by

AUTHORS (3)

Dimitris Bertsimas

Agni Orfanoudaki

Colin Pawlowski

ABSTRACT

Missing data is a common problem in real-world settings and particularly relevant healthcare applications where researchers use Electronic Health Records (EHR) results of observational studies to apply analytics methods. This issue becomes even more prominent for longitudinal sets, multiple instances the same individual correspond different observations time. Standard imputation methods do not take into account patient specific information incorporated multivariate panel data. We introduce novel algorithm MedImpute that addresses this problem, extending flexible framework OptImpute suggested by Bertsimas et al. (2018). Our provides imputations sets with missing continuous categorical features, we present formulation implement scalable first-order $K$-NN model. test performance our on from Framingham Heart Study when are completely at random (MCAR). demonstrate leads significant improvements both accuracy downstream model AUC compared state-of-the-art

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENALEX - Publications OPENAIRE - Products

PlumX Metrics

Imputation of Clinical Covariates in Time Series

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....