Xuan Bi

ORCID: 0000-0002-4683-1411
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Recommender Systems and Techniques
  • Tensor decomposition and applications
  • Advanced Neuroimaging Techniques and Applications
  • Privacy-Preserving Technologies in Data
  • Computational Physics and Python Applications
  • Statistical Methods and Bayesian Inference
  • Advanced Text Analysis Techniques
  • Intelligent Tutoring Systems and Adaptive Learning
  • Seismic Imaging and Inversion Techniques
  • Statistical Methods and Inference
  • Cognitive Science and Mapping
  • Advanced Bandit Algorithms Research
  • Genetic Associations and Epidemiology
  • Energy Load and Power Forecasting
  • Sparse and Compressive Sensing Techniques
  • Face and Expression Recognition
  • Traffic Prediction and Management Techniques
  • Machine Learning and Data Classification
  • Influenza Virus Research Studies
  • Stock Market Forecasting Methods
  • Neural Networks and Applications
  • Urinary Tract Infections Management
  • Blockchain Technology Applications and Security
  • Genetic and phenotypic traits in livestock
  • Music and Audio Processing

University of Minnesota
2017-2024

Decision Sciences (United States)
2019-2024

University of Minnesota System
2024

Twin Cities Orthopedics
2020-2023

University of Illinois Urbana-Champaign
2014-2021

University at Buffalo, State University of New York
2018

Yunnan University
2018

Yale University
2017-2018

University of Illinois System
2018

Anhui Normal University
2015

Recommender systems have been widely adopted by electronic commerce and entertainment industries for individualized prediction recommendation, which benefit consumers improve business intelligence. In this article, we propose an innovative method, namely the recommendation engine of multilayers (REM), tensor recommender systems. The proposed method utilizes structure a response to integrate information from multiple modes, creates additional layer nested latent factors accommodate...

10.1214/17-aos1659 article EN The Annals of Statistics 2018-09-11

Because of the accessibility big data collections from consumers, products, and stores, advanced sales forecasting capabilities have drawn great attention many businesses, especially those in retail, because importance decision making. Improvement accuracy, even by a small percentage, may substantial impact on companies’ production financial planning, marketing strategies, inventory controls, supply chain management. Specifically, our research goal is to forecast each product store near...

10.1287/ijoc.2021.1147 article EN INFORMS journal on computing 2022-02-04

This article provides an overview of tensors, their properties, and applications in statistics. Tensors, also known as multidimensional arrays, are generalizations matrices to higher orders useful data representation architectures. We first review basic tensor concepts decompositions, then we elaborate traditional recent tensors the fields recommender systems imaging analysis. illustrate for network explore relations among interacting units a complex system. Some canonical computational...

10.1146/annurev-statistics-042720-020816 article EN Annual Review of Statistics and Its Application 2020-08-14

In recent years, there has been a growing demand to develop efficient recommender systems which track users' preferences and recommend potential items of interest users. this article, we propose group-specific method use dependency information from users share similar characteristics under the singular value decomposition framework. The new approach is effective for "cold-start" problem, where, in testing set, majority responses are obtained or items, their preference not available training...

10.1080/01621459.2016.1219261 article EN Journal of the American Statistical Association 2016-09-02

Heritability is well documented for psychiatric disorders and cognitive abilities which are, however, complex, involving both genetic environmental factors. Hence, it remains challenging to discover how variations contribute such complex traits. In this article, they propose use mediation analysis bridge gap, where neuroimaging phenotypes were utilized as intermediate variables. The Philadelphia Neurodevelopmental Cohort was investigated using genome-wide association studies (GWAS) analyses....

10.1002/hbm.23650 article EN publisher-specific-oa Human Brain Mapping 2017-05-22

On short video sharing platforms, users often choose background music for their videos. In this paper, we study the problem of recommendation videos on platforms. our setting, item (music) is not recommended directly to user, but created by user. When making recommendations videos, consider three important players: users, and music. We define a unique design novel model address problem. propose based deep learning framework effectively distinctive three-way relationships among Our considers...

10.1287/isre.2022.0093 article EN Information Systems Research 2024-01-31

This work is motivated by multimodality breast cancer imaging data, which quite challenging in that the signals of discrete tumor-associated microvesicles are randomly distributed with heterogeneous patterns. imposes a significant challenge for conventional regression and dimension reduction models assuming homogeneous feature structure. We develop an innovative multilayer tensor learning method to incorporate heterogeneity higher-order decomposition predict disease status effectively...

10.1080/01621459.2019.1585254 article EN Journal of the American Statistical Association 2019-03-22

Recommender systems have been extensively used by the entertainment industry, business marketing and biomedical industry. In addition to its capacity of providing preference-based recommendations as an unsupervised learning methodology, it has also proven useful in sales forecasting, product introduction other production related businesses. Since some consumers companies need a recommendation or prediction for future budget, labor supply chain coordination, dynamic recommender precise...

10.48550/arxiv.2003.05568 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Limited access to large-scale data is a key obstacle building machine learning (ML) applications in practice, partly due reluctance of information exchange among owners out privacy and security concerns. To address this “information silo” problem, federated (FL) techniques have been proposed enable decentralized model training via an orchestrating central server received increasing attention several industries (including healthcare finance). Despite its superior protection property, adoption...

10.1287/mnsc.2023.00611 article EN Management Science 2023-10-17

10.1016/j.jeconom.2022.05.004 article EN Journal of Econometrics 2022-06-18

Brain-imaging data have been increasingly used to understand intellectual disabilities. Despite significant progress in biomedical research, the mechanisms for most of disabilities remain unknown. Finding underlying neurological has proved difficult, especially children due rapid development their brains. We investigate verbal reasoning, which is a reliable measure individuals' general abilities, and develop class high-order imaging regression models identify brain subregions might be...

10.1080/01621459.2020.1766468 article EN Journal of the American Statistical Association 2020-05-15

Correlation structure contains important information about longitudinal data. Existing sufficient dimension reduction approaches assuming independence may lead to substantial loss of efficiency. We apply the quadratic inference function incorporate correlation and transformation method recover central subspace. The proposed estimators are shown be consistent more efficient than ones independence. In addition, esti- mated subspace is also when taken into account. compare with other through...

10.5705/ss.2013.168 article EN Statistica Sinica 2014-06-05

We propose a novel multivariate model for analyzing hybrid traits and identifying genetic factors comorbid conditions. Comorbidity is common phenomenon in mental health which an individual suffers from multiple disorders simultaneously. For example, the Study of Addiction: Genetics Environment (SAGE), alcohol nicotine addiction were recorded through assessments that we refer to as traits. Statistical inference studying basis has not been well developed. Recent rank-based methods have...

10.1214/18-aoas1156 article EN The Annals of Applied Statistics 2018-11-13

Understanding the functional mechanisms of complex biological system as a whole is drawing more and attention in global health care management. Traditional Chinese Medicine (TCM), essentially different from Western (WM), gaining increasing due to its emphasis on individual wellness natural herbal medicine, which satisfies goal integrative medicine. However, with explosive growth biomedical data Web, researchers are now confronted problem large-scale analysis query. Besides that, also has...

10.1155/2014/957231 article EN Computational and Mathematical Methods in Medicine 2014-01-01

Bias in adolescent self-reported height and weight is well documented. Given the importance widespread use of National Longitudinal Study Adolescent to Adult Health (Add Health) data for obesity research, we developed tested feasibility validity an empirically derived statistical correction self-report bias wave 1 (W1) Add Health, a large panel study United States. Participants grades 7–12 with complete at W1 were included (n = 20,175). We used measured (SR) relevant biopsychosocial factors...

10.1186/s12874-016-0227-y article EN cc-by BMC Medical Research Methodology 2016-09-22

In this study, we present an application paradigm in which unsupervised machine learning approach is applied to the high-dimensional influenza genetic sequences investigate whether vaccine a driving force evolution of virus. We first used visualization visualize evolutionary paths vaccine-controlled and non-vaccine-controlled viruses low-dimensional space. then quantified differences between their trajectories through use within- between-scatter matrices computation provide statistical...

10.1089/cmb.2017.0025 article EN Journal of Computational Biology 2017-07-07

How to acquire the most valuable consumers grow your recommender system? We propose a dynamic consumer acquisition model enable value-driven decisions. build of that takes into account value contributes system, cost their participation (e.g., privacy loss), and other (via network externality). also data-driven procedures estimate this informed, On three different data sets, we perform comprehensive simulation-based evaluations demonstrate performance model. find nuanced relationships between...

10.1287/isre.2023.1229 article EN Information Systems Research 2023-05-15

Yu Wanga, Xuan Bib & Annie Quc* a Electrical and Computer Engineering, Duke University, Durham, NC; b Carlson School of Management, University Minnesota, Minneapolis, MN; c Department Statistics, Illinois at Urbana-Champaign, Champaign, IL

10.1080/10618600.2019.1665535 article EN Journal of Computational and Graphical Statistics 2019-09-11

Safeguarding intellectual property and preventing potential misuse of AI-generated images are paramount importance. This paper introduces a robust agile plug-and-play watermark detection framework, dubbed as RAW. As departure from traditional encoder-decoder methods, which incorporate fixed binary codes watermarks within latent representations, our approach learnable directly into the original image data. Subsequently, we employ classifier that is jointly trained with to detect presence...

10.48550/arxiv.2403.18774 preprint EN arXiv (Cornell University) 2024-01-23

Retrieval-Augmented Generation (RAG) has been empirically shown to enhance the performance of large language models (LLMs) in knowledge-intensive domains such as healthcare, finance, and legal contexts. Given a query, RAG retrieves relevant documents from corpus integrates them into LLMs' generation process. In this study, we investigate adversarial robustness RAG, focusing specifically on examining retrieval system. First, across 225 different setup combinations corpus, retriever, targeted...

10.48550/arxiv.2409.17275 preprint EN arXiv (Cornell University) 2024-09-11

Most classification techniques in machine learning are able to produce probability predictions addition class predictions. However, these predicted probabilities often not well calibrated that they deviate from the actual outcome rates (i.e., proportion of data instances actually belong a certain class). A lack calibration can jeopardize downstream decision tasks rely on accurate Although several post hoc methods have been proposed, generally do consider potentially asymmetric costs...

10.1287/ijds.2024.0038 article EN INFORMS Journal on Data Science 2024-12-09

Federated learning (FL) is a privacy-preserving technique that enables distributed computing devices to train shared models across data silos collaboratively. Existing FL works mostly focus on designing advanced algorithms improve the model performance. However, economic considerations of clients, such as fairness and incentive, are yet be fully explored. Without considerations, self-motivated clients may lose interest leave federation. To address this problem, we designed novel incentive...

10.48550/arxiv.2302.08976 preprint EN cc-by arXiv (Cornell University) 2023-01-01

The growing dependence on machine learning in real-world applications emphasizes the importance of understanding and ensuring its safety. Backdoor attacks pose a significant security risk due to their stealthy nature potentially serious consequences. Such involve embedding triggers within model with intention causing malicious behavior when an active trigger is present while maintaining regular functionality without it. This paper evaluates effectiveness any backdoor attack incorporating...

10.48550/arxiv.2310.10780 preprint EN other-oa arXiv (Cornell University) 2023-01-01
Coming Soon ...