Distributional data analysis via quantile functions and its application to modeling digital biomarkers of gait in Alzheimer’s Disease

Data Analysis FOS: Computer and information sciences Statistics - Applications 01 natural sciences Methodology (stat.ME) Wearable Electronic Devices Alzheimer Disease Humans Applications (stat.AP) 0101 mathematics Gait Statistics - Methodology
DOI: 10.1093/biostatistics/kxab041 Publication Date: 2021-10-21T19:10:36Z
ABSTRACT
SummaryWith the advent of continuous health monitoring with wearable devices, users now generate their unique streams of continuous data such as minute-level step counts or heartbeats. Summarizing these streams via scalar summaries often ignores the distributional nature of wearable data and almost unavoidably leads to the loss of critical information. We propose to capture the distributional nature of wearable data via user-specific quantile functions (QF) and use these QFs as predictors in scalar-on-quantile-function-regression (SOQFR). As an alternative approach, we also propose to represent QFs via user-specific L-moments, robust rank-based analogs of traditional moments, and use L-moments as predictors in SOQFR (SOQFR-L). These two approaches provide two mutually consistent interpretations: in terms of quantile levels by SOQFR and in terms of L-moments by SOQFR-L. We also demonstrate how to deal with multi-modal distributional data via Joint and Individual Variation Explained using L-moments. The proposed methods are illustrated in a study of association of digital gait biomarkers with cognitive function in Alzheimers disease. Our analysis shows that the proposed methods demonstrate higher predictive performance and attain much stronger associations with clinical cognitive scales compared to simple distributional summaries.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (65)
CITATIONS (19)