NFDI4DS | UHH-SEMS - Publication Details

Changgee Chang

ORCID: 0000-0003-3426-1295

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5021164890

Research Areas

Gene expression and cancer classification
Bioinformatics and Genomic Networks
Statistical Methods and Inference
Statistical Methods and Bayesian Inference
Bayesian Methods and Mixture Models
Functional Brain Connectivity Studies
Genetic Associations and Epidemiology
Machine Learning in Bioinformatics
Privacy-Preserving Technologies in Data
COVID-19 and healthcare impacts
Gene Regulatory Network Analysis
Healthcare professionals’ stress and burnout
COVID-19 and Mental Health
Health, Environment, Cognitive Aging
Insurance, Mortality, Demography, Risk Management
Advanced Neuroimaging Techniques and Applications
Single-cell and spatial transcriptomics
Advanced MRI Techniques and Applications
Face and Expression Recognition
Advanced Causal Inference Techniques
Epigenetics and DNA Methylation
Advanced Bandit Algorithms Research
Data-Driven Disease Surveillance
Machine Learning and Data Classification
Markov Chains and Monte Carlo Methods

Indiana University – Purdue University Indianapolis
2023-2025

Indiana University School of Medicine
2023-2025

University of Pennsylvania
2018-2023

Weatherford College
2021

Flint Institute Of Arts
2021

Emory University
2016

University of Chicago
2010-2012

Mental health among otolaryngology resident and attending physicians during theCOVID‐19 pandemic: National study

OPENALEX - Publications

Alyssa M. Civantos Yasmeen M. Byrnes Changgee Chang Aman Prasad Kevin Chorath and 17 more

Abstract Background Otolaryngologists are among the highest risk for COVID‐19 exposure. Methods This is a cross‐sectional, survey‐based, national study evaluating academic otolaryngologists. Burnout, anxiety, distress, and depression were assessed by single‐item Mini‐Z Burnout Assessment, 7‐item Generalized Anxiety Disorder Scale, 15‐item Impact of Event 2‐item Patient Health Questionnaire, respectively. Results A total 349 physicians completed survey. Of them, 165 (47.3%) residents 212...

10.1002/hed.26292 article EN Head & Neck 2020-06-04

Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data

OPENALEX - Publications

Yi Deng Changgee Chang Moges Seyoum Ido Qi Long

Abstract Multiple imputation (MI) has been widely used for handling missing data in biomedical research. In the presence of high-dimensional data, regularized regression as a natural strategy building models, but limited research conducted general patterns where multiple variables have values. Using idea by chained equations (MICE), we investigate two approaches using to impute values that can handle patterns. We compare our MICE methods with several existing simulation studies. Our results...

10.1038/srep21689 article EN cc-by Scientific Reports 2016-02-12

Mental health among head and neck surgeons in Brazil during the COVID-19 pandemic: A national study

OPENALEX - Publications

Alyssa M. Civantos Antônio Augusto Tupinambá Bertelli Antônio José Gonçalves Emily Getzen Changgee Chang and 2 more

10.1016/j.amjoto.2020.102694 article EN American Journal of Otolaryngology 2020-08-21

Snapshot Impact of COVID‐19 on Mental Wellness in Nonphysician Otolaryngology Health Care Workers: A National Study

OPENALEX - Publications

Aman Prasad Alyssa M. Civantos Yasmeen M. Byrnes Kevin Chorath Seerat Poonia and 16 more

Nonphysician health care workers are involved in high-risk patient during the COVID-19 pandemic, placing them at high risk of mental burden. The impact this crucial population has not been studied thus far. Thus, objective study is to assess psychosocial well-being these providers.National cross-sectional online survey (no control group).Academic otolaryngology programs United States.We distributed a nonphysician departments across States. incorporated variety validated assessment tools...

10.1177/2473974x20948835 article EN cc-by-nc OTO Open 2020-07-01

Multiple imputation for analysis of incomplete data in distributed health data networks

OPENALEX - Publications

Changgee Chang Yi Deng Xiaoqian Jiang Qi Long

Abstract Distributed health data networks (DHDNs) leverage from multiple sources or sites such as electronic records (EHRs) healthcare systems and have drawn increasing interests in recent years, they do not require sharing of subject-level hence lower the hurdles for collaboration between institutions considerably. However, DHDNs face a number challenges analysis, particularly presence missing data. The current state-of-the-art methods handling incomplete pooling into central repository...

10.1038/s41467-020-19270-2 article EN cc-by Nature Communications 2020-10-29

Circulating KRAS G12D but not G12V is associated with survival in metastatic pancreatic ductal adenocarcinoma

OPENALEX - Publications

Jacob E. Till Lee McDaniel Changgee Chang Qi Long Shannon M. Pfeiffer and 37 more

Abstract While high circulating tumor DNA (ctDNA) levels are associated with poor survival for multiple cancers, variant-specific differences in the association of ctDNA and have not been examined. Here we investigate KRAS (ctKRAS) associations overall progression-free (OS/PFS) first-line metastatic pancreatic ductal adenocarcinoma (mPDAC) patients receiving chemoimmunotherapy (“PRINCE”, NCT03214250), an independent cohort standard care (SOC) chemotherapy. For PRINCE, higher baseline plasma...

10.1038/s41467-024-49915-5 article EN cc-by Nature Communications 2024-07-09

Estimation of covariance matrix via the sparse Cholesky factor with lasso

OPENALEX - Publications

Changgee Chang Ruey S. Tsay

10.1016/j.jspi.2010.04.048 article EN Journal of Statistical Planning and Inference 2010-05-06

Scalable Bayesian Variable Selection for Structured High-Dimensional Data

OPENALEX - Publications

Changgee Chang Suprateek Kundu Qi Long

Summary Variable selection for structured covariates lying on an underlying known graph is a problem motivated by practical applications, and has been topic of increasing interest. However, most the existing methods may not be scalable to high-dimensional settings involving tens thousands variables pathways such as case in genomics studies. We propose adaptive Bayesian shrinkage approach which incorporates prior network information smoothing parameters connected graph, so that corresponding...

10.1111/biom.12882 article EN Biometrics 2018-05-08

Integrative analysis of multi-omics and imaging data with incorporation of biological information via structural Bayesian factor analysis

OPENALEX - Publications

Jingxuan Bao Changgee Chang Qiyiwen Zhang Andrew J. Saykin Li Shen and 1 more

Abstract Motivation With the rapid development of modern technologies, massive data are available for systematic study Alzheimer’s disease (AD). Though many existing AD studies mainly focus on single-modality omics data, multi-omics datasets can provide a more comprehensive understanding AD. To bridge this gap, we proposed novel structural Bayesian factor analysis framework (SBFA) to extract information shared by through aggregation genotyping gene expression neuroimaging phenotypes and...

10.1093/bib/bbad073 article EN Briefings in Bioinformatics 2023-03-01

A genetically informed brain atlas for enhancing brain imaging genomics

OPENALEX - Publications

Jingxuan Bao Junhao Wen Changgee Chang Shizhuo Mu Jiong Chen and 14 more

Brain imaging genomics has manifested considerable potential in illuminating the genetic determinants of human brain structure and function. This propelled us to develop GIANT (Genetically Informed brAiN aTlas) that accounts for neuroanatomical variations simultaneously. Integrating voxel-wise heritability spatial proximity, clusters voxels into genetically informed regions, while retaining fundamental anatomical knowledge. Compared conventional (non-genetics) atlases, exhibits smaller...

10.1038/s41467-025-57636-6 article EN cc-by-nc-nd Nature Communications 2025-04-14

Incorporating graph information in Bayesian factor analysis with robust and adaptive shrinkage priors

OPENALEX - Publications

Qiyiwen Zhang Changgee Chang Li Shen Qi Long

There has been an increasing interest in decomposing high-dimensional multi-omics data into a product of low-rank and sparse matrices for the purpose dimension reduction feature engineering. Bayesian factor models achieve such low-dimensional representation original through different sparsity-inducing priors. However, few these can efficiently incorporate information encoded by biological graphs, which already proven to be useful many analysis tasks. In this work, we propose model with novel...

10.1093/biomtc/ujad014 article EN public-domain Biometrics 2024-01-29

Generalized Bayesian Factor Analysis for Integrative Clustering with Applications to Multi-Omics Data

OPENALEX - Publications

Eun Jeong Min Changgee Chang Qi Long

Integrative clustering is a approach for multiple datasets, which provide different views of common group subjects. It enables analyzing multi-omics data jointly to, example, identify the subtypes diseases, cells, and so on, capturing complex underlying biological processes more precisely. On other hand, there has been great deal interest in incorporating prior structural knowledge on features into statistical analyses over past decade. The gene regulatory network (pathways) can potentially...

10.1109/dsaa.2018.00021 article EN 2022 IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA) 2018-10-01

Bayesian generalized biclustering analysis via adaptive structured shrinkage

OPENALEX - Publications

Ziyi Li Changgee Chang Suprateek Kundu Qi Long

Biclustering techniques can identify local patterns of a data matrix by clustering feature space and sample at the same time. Various biclustering methods have been proposed successfully applied to analysis gene expression data. While existing many desirable features, most them are developed for continuous few efficiently handle -omics various types, example, binomial as in single nucleotide polymorphism or negative RNA-seq In addition, none utilize biological information such those from...

10.1093/biostatistics/kxy081 article EN Biostatistics 2018-11-27

Knowledge-Guided Bayesian Support Vector Machine for High-Dimensional Data with Application to Analysis of Genomics Data

OPENALEX - Publications

Wenli Sun Changgee Chang Yize Zhao Qi Long

Support vector machine (SVM) is a popular classification method for the analysis of wide range data including big data. Many SVM methods with feature selection have been developed under frequentist regularization or Bayesian shrinkage frameworks. On other hand, importance incorporating priori known biological knowledge, such as gene pathway information which stems from regulatory network, into statistical genomic has recognized in recent years. In this article, we propose new approach that...

10.1109/bigdata.2018.8622484 article EN 2021 IEEE International Conference on Big Data (Big Data) 2018-12-01

Accounting for network noise in graph-guided Bayesian modeling of structured high-dimensional data

OPENALEX - Publications

Wenrui Li Changgee Chang Suprateek Kundu Qi Long

Abstract There is a growing body of literature on knowledge-guided statistical learning methods for analysis structured high-dimensional data (such as genomic and transcriptomic data) that can incorporate knowledge underlying networks derived from functional genomics proteomics. These have been shown to improve variable selection prediction accuracy yield more interpretable results. However, these typically use graphs extracted existing databases or rely subject matter expertise, which are...

10.1093/biomtc/ujae012 article EN Biometrics 2024-01-29

CEDAR: Communication Efficient Distributed Analysis for Regressions

OPENALEX - Publications

Changgee Chang Zhiqi Bu Qi Long

Abstract Electronic health records (EHRs) offer great promises for advancing precision medicine and, at the same time, present significant analytical challenges. Particularly, it is often case that patient-level data in EHRs cannot be shared across institutions (data sources) due to government regulations and/or institutional policies. As a result, there are growing interests about distributed learning over multiple databases without sharing data. To tackle such challenges, we propose novel...

10.1111/biom.13786 article EN Biometrics 2022-10-27

Bayesian network-driven clustering analysis with feature selection for high-dimensional multi-modal molecular data

OPENALEX - Publications

Yize Zhao Changgee Chang Margaret Hannum Jasme Lee Ronglai Shen

Multi-modal molecular profiling data in bulk tumors or single cells are accumulating at a fast pace. There is great need for developing statistical and computational methods to reveal structures complex types toward biological discoveries. Here, we introduce Nebula, novel Bayesian integrative clustering analysis high dimensional multi-modal identify directly interpretable clusters associated biomarkers unified biologically plausible framework. To facilitate efficiency, variational Bayes...

10.1038/s41598-021-84514-0 article EN cc-by Scientific Reports 2021-03-04

Bayesian Non-linear Support Vector Machine for High-Dimensional Data with Incorporation of Graph Information on Features

OPENALEX - Publications

Wenli Sun Changgee Chang Qi Long

Support vector machine (SVM) is a popular classification method for analysis of high dimensional data such as genomics data. Recently number linear SVM methods have been developed to achieve feature selection through either frequentist regularization or Bayesian shrinkage, but the assumption may not be plausible many real applications. In addition, recent work has demonstrated that incorporating known biological knowledge, those from functional genomics, into statistical genomic offers great...

10.1109/bigdata47090.2019.9006473 article EN 2021 IEEE International Conference on Big Data (Big Data) 2019-12-01

Scalable Bayesian Variable Selection for Structured High-dimensional Data

OPENALEX - Publications

Changgee Chang Suprateek Kundu Qi Long

Variable selection for structured covariates lying on an underlying known graph is a problem motivated by practical applications, and has been topic of increasing interest. However, most the existing methods may not be scalable to high dimensional settings involving tens thousands variables pathways such as case in genomics studies. We propose adaptive Bayesian shrinkage approach which incorporates prior network information smoothing parameters connected graph, so that corresponding...

10.48550/arxiv.1604.07264 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Robust knowledge-guided biclustering for multi-omics data

OPENALEX - Publications

Qiyiwen Zhang Changgee Chang Qi Long

Abstract Biclustering is a useful method for simultaneously grouping samples and features has been applied across various biomedical data types. However, most existing biclustering methods lack the ability to integratively analyze multi-modal such as multi-omics genome, transcriptome epigenome. Moreover, potential of leveraging biological knowledge represented by graphs, which demonstrated be beneficial in statistical tasks variable selection prediction, remains largely untapped context...

10.1093/bib/bbad446 article EN cc-by Briefings in Bioinformatics 2023-11-22

A Bayesian Latent Class Model to Predict Kidney Obstruction in the Absence of Gold Standard

OPENALEX - Publications

Changgee Chang Jeong Hoon Jang Amita K. Manatunga Andrew Taylor Qi Long

Kidney obstruction, if untreated in a timely manner, can lead to irreversible loss of renal function. A widely used technology for evaluations kidneys with suspected obstruction is diuresis renography. However, it generally very challenging radiologists who typically interpret renography data practice build high level competency due the low volume studies and insufficient training. Another challenge that there currently no gold standard detection kidney obstruction. Seeking develop...

10.1080/01621459.2019.1689983 article EN Journal of the American Statistical Association 2019-11-08

Genetic Underpinnings of Brain Structural Connectome for Young Adults

OPENALEX - Publications

Yize Zhao Changgee Chang Jingwen Zhang Zhengwu Zhang

With distinct advantages in power over behavioral phenotypes, brain imaging traits have become emerging endophenotypes to dissect molecular contributions behaviors and neuropsychiatric illnesses. Among different features, structural connectivity (i.e., connectome) which summarizes the anatomical connections between regions is one of most cutting edge while under-investigated traits; genetic influence on connectome variation remains highly elusive. Relying a landmark genetics study for young...

10.1080/01621459.2022.2156349 article EN Journal of the American Statistical Association 2022-12-07

A Bayesian multiple imputation approach to bivariate functional data with missing components

OPENALEX - Publications

Jeong Hoon Jang Amita K. Manatunga Changgee Chang Qi Long

Existing missing data methods for functional mainly focus on reconstructing measurements along a single function—a univariate setting. Motivated by renal study, we bivariate setting, where each sampling unit is collection of two distinct component functions, one which may be missing. Specifically, propose Bayesian multiple imputation approach based latent factor model that exploits the joint changing patterns functions to allow accurate and stable given other. We further extend framework...

10.1002/sim.9093 article EN Statistics in Medicine 2021-06-08

Coming Soon ...