Jianhui Zhou

ORCID: 0000-0003-3488-4070
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Statistical Methods and Inference
  • Statistical Methods and Bayesian Inference
  • Bayesian Methods and Mixture Models
  • Advanced Statistical Methods and Models
  • Statistical Methods in Epidemiology
  • Machine Learning and ELM
  • Statistical Methods in Clinical Trials
  • Child Nutrition and Water Access
  • Genetic and phenotypic traits in livestock
  • Optimal Experimental Design Methods
  • Spatial and Panel Data Analysis
  • Functional Brain Connectivity Studies
  • Genetics and Neurodevelopmental Disorders
  • Advanced Causal Inference Techniques
  • Medical Image Segmentation Techniques
  • Advanced MIMO Systems Optimization
  • Memory and Neural Mechanisms
  • Caching and Content Delivery
  • Network Traffic and Congestion Control
  • Tree Root and Stability Studies
  • Advanced Data Storage Technologies
  • Advanced Multi-Objective Optimization Algorithms
  • Morphological variations and asymmetry
  • Sparse and Compressive Sensing Techniques
  • Birth, Development, and Health

University of Virginia
2011-2022

Shanghai University of Engineering Science
2018-2021

University of Michigan–Ann Arbor
2012

Johns Hopkins University
2012

U.S. National Science Foundation
2010

North Carolina State University
2009

Fudan University
2009

University of Illinois Urbana-Champaign
2006-2008

University of Illinois System
2008

Exact Sciences (United States)
2006

Semiparametric models are often considered for analyzing longitudinal data a good balance between flexibility and parsimony. In this paper, we study class of marginal partially linear quantile with possibly varying coefficients. The functional coefficients estimated by basis function approximations. estimation procedure is easy to implement, it requires no specification the error distributions. asymptotic properties proposed estimators established as well constant We develop rank score tests...

10.1214/09-aos695 article EN The Annals of Statistics 2009-10-23

We consider the penalized generalized estimating equations (GEEs) for analyzing longitudinal data with high-dimensional covariates, which often arise in microarray experiments and large-scale health studies. Existing regression procedures assume independent rely on likelihood function. Construction of a feasible joint function is challenging, particularly correlated discrete outcome data. The GEE procedure only requires specifying first two marginal moments working correlation structure....

10.1111/j.1541-0420.2011.01678.x article EN Biometrics 2011-09-28

Abstract We consider the generalized additive model when responses from same cluster are correlated. Incorporating correlation in estimation of nonparametric components for is important because it improves efficiency and increases statistical power selection. In our setting, there no specified likelihood function model, outcomes could be nonnormal discrete, which makes selection very challenging problems. propose consistent that incorporate structure. establish an asymptotic property with...

10.1198/jasa.2010.tm10128 article EN Journal of the American Statistical Association 2010-12-01

Identifying an informative correlation structure is important in improving estimation efficiency for longitudinal data. We approximate the empirical estimator of matrix by groups known basis matrices that represent different structures, and transform selection problem to a covariate problem. To address both complexity informativeness matrix, we minimize objective function consists two parts: difference between information model approximation penalty penalizes models with too many matrices....

10.1080/01621459.2012.682534 article EN Journal of the American Statistical Association 2012-06-01

We propose a shrinkage method to estimate the coefficient function in functional linear regression model when value of is zero within certain sub-regions. Besides identifying null region which zero, we also aim perform estimation and inferences for nonparametrically estimated without over-shrinking values. Our proposal consists two stages. In stage one, Dantzig selector employed provide initial location region. two, group SCAD approach refine inference procedures function. considerations...

10.5705/ss.2010.237 article EN Statistica Sinica 2012-03-14

Abstract Sociability is crucial for survival, whereas social avoidance a feature of disorders such as Rett syndrome, which caused by loss-of-function mutations in MECP2 . To understand how preference interactions encoded, we used vivo calcium imaging to compare medial prefrontal cortex (mPFC) activity female wild-type and Mecp2 -heterozygous mice during three-chamber tests. We found that mPFC pyramidal neurons -deficient are hypo-responsive both nonsocial stimuli. Hypothesizing this limited...

10.1038/s41467-022-31578-9 article EN cc-by Nature Communications 2022-07-06

Quantile regression has emerged as a powerful tool in survival analysis it directly links the quantiles of patients' times to their demographic and genomic profiles, facilitating identification important prognostic factors. In view limited work on variable selection context, we develop new adaptive-lasso-based procedure for quantile with censored outcomes. To account random censoring data multivariate covariates, employ ideas redistribution-of-mass e ective dimension reduction....

10.5705/ss.2011.100 article EN Statistica Sinica 2013-01-01

The ``curse of dimensionality'' has remained a challenge for high-dimensional data analysis in statistics. sliced inverse regression (SIR) and canonical correlation (CANCOR) methods aim to reduce the dimensionality by replacing explanatory variables with small number composite directions without losing much information. However, estimated generally involve all variables, making their interpretation difficult. To simplify direction estimates, Ni, Cook Tsai [Biometrika 92 (2005) 242--247]...

10.1214/07-aos529 article EN The Annals of Statistics 2008-07-16

In this paper, we introduce a family of robust estimates for the parametric and nonparametric components under generalized partially linear model, where data are modeled by $y_i|(\mathbf{x}_i,t_i)\sim F(\cdot,\mu_i)$ with $\mu_i=H(\eta(t_i)+\mathbf{x}_i^{$\mathrm{T}$}\beta)$, some known distribution function F link H. It is shown that $\beta$ root-n consistent asymptotically normal. Through Monte Carlo study, performance these estimators compared classical ones.

10.1214/009053606000000858 article EN The Annals of Statistics 2006-12-01

Environmental Enteropathy (EE) is a subclinical condition caused by constant fecal-oral contamination and resulting in blunting of intestinal villi inflammation. Of primary interest the clinical research to evaluate association between non-invasive EE biomarkers malnutrition cohort Bangladeshi children. The challenges are that number biomarkers/covariates relatively large, some them highly correlated. Many variable selection methods available literature, but which most appropriate for...

10.1186/s40364-017-0089-4 article EN cc-by Biomarker Research 2017-03-09

10.1016/j.jmva.2008.04.003 article EN publisher-specific-oa Journal of Multivariate Analysis 2008-04-17

Early childhood is a critical stage of physical and cognitive growth that forms the foundation future wellbeing. Stunted presented in one every 4 children worldwide contributes to developmental impairment under-five mortality. Better understanding early patterns should allow for detection intervention malnutrition. We aimed characterize child quantify change curves from World Health Organization (WHO) Child Growth Standards. In cohort 626 Bangladesh children, longitudinal height-for-age...

10.1186/s12887-017-0831-y article EN cc-by BMC Pediatrics 2017-03-21

In this paper, we propose a quantile approach to the multi-index semiparametric model for an ordinal response variable. Permitting non-parametric transformation of response, proposed method achieves root-n rate convergence and has attractive robustness properties. Further, allows additional indices remaining correlations between covariates residuals from single-index, considerably reducing error variance thus leading more efficient prediction intervals (PIs). The utility is demonstrated by...

10.1080/02664763.2013.785489 article EN Journal of Applied Statistics 2013-04-03

Covariate-adaptive designs are widely used to balance covariates and maintain randomization in clinical trials. Adaptive for discrete their asymptotic properties have been well studied the literature. However, important continuous often involved studies. Simply discretizing or categorizing can result loss of information. The current understanding adaptive with lacks a theoretical foundation as existing works entirely based on simulations. Consequently, conventional hypothesis testing trials...

10.1177/0962280218770231 article EN Statistical Methods in Medical Research 2018-05-17

10.1016/j.jspi.2010.06.013 article EN Journal of Statistical Planning and Inference 2010-06-14

10.1016/j.jspi.2007.07.007 article EN Journal of Statistical Planning and Inference 2007-08-16

Motivated by a genetic investigation on the progressive decline in renal function clinical trial study of kidney disease, we develop practical test for evaluating group difference trajectories under semi-parametric modeling framework. For temporal patterns or longitudinal data, B-splines are used to approximate non-parametrically. Such approximation asymptotically converts problem testing trajectory into significance regression coefficients that can be simply estimated generalized estimating...

10.1177/0962280215584109 article EN Statistical Methods in Medical Research 2015-05-13

In prior work, we analyzed the GridFTP usage logs collected by data transfer nodes (DTNs) located at national scientific computing centers, and found significant throughput variance even among transfers between same two end hosts. The goal of this work is to quantify impact various factors on variance. Our methodology consisted executing experiments a high-speed research testbed, running large-sized instrumented operational DTNs, creating statistical models from measurements. A non-linear...

10.1145/2534695.2534701 article EN 2013-11-12

Abstract Correlation structure selection for non‐normal longitudinal data is very challenging diverging cluster size because of the high‐dimensional correlation parameters involved and complexity likelihood function data. However identifying correct important it can improve estimation efficiency testing power We propose to approximate inverse empirical matrix using a linear combination candidate basis matrices, select by non‐zero coefficients matrices. This carried out minimizing penalized...

10.1002/cjs.11290 article EN Canadian Journal of Statistics 2016-06-27

Growth curve models have been widely used to analyse longitudinal data in social and behavioural sciences. Although growth with normality assumptions are relatively easy estimate, practical rarely normal. Failing account for non‐normal may lead unreliable model estimation misleading statistical inference. In this work, we propose a robust approach modelling using conditional medians that less sensitive outlying observations. Bayesian methods applied Based on the existing work quantile...

10.1111/bmsp.12216 article EN publisher-specific-oa British Journal of Mathematical and Statistical Psychology 2020-09-14

Alternating direction method of multipliers (ADMM) has been recognized as an efficient approach for solving many large-scale machine learning problems. However, the ADMM under master-slave mode suffers from several limitations, e.g., can't make full use multi-core cluster environment and single master load is too heavy, resulting in huge time overhead. In this paper, we propose a hierarchical communication structure. Since intra-node communications mostly shared memory, divide processes same...

10.1109/bdcloud.2018.00032 article EN 2018-12-01
Coming Soon ...