Lu Zhang

ORCID: 0000-0002-8972-8799
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Ethics and Social Impacts of AI
  • Genetic Associations and Epidemiology
  • Anomaly Detection Techniques and Applications
  • Bayesian Modeling and Causal Inference
  • Imbalanced Data Classification Techniques
  • Explainable Artificial Intelligence (XAI)
  • Time Series Analysis and Forecasting
  • Evolutionary Algorithms and Applications
  • Traffic Prediction and Management Techniques
  • Rough Sets and Fuzzy Logic
  • Network Security and Intrusion Detection
  • Topic Modeling
  • Data Quality and Management
  • Advanced Causal Inference Techniques
  • Data-Driven Disease Surveillance
  • Advanced Graph Neural Networks
  • Machine Learning and Data Classification
  • Adversarial Robustness in Machine Learning
  • Qualitative Comparative Analysis Research
  • Genetics and Neurodevelopmental Disorders
  • Metaheuristic Optimization Algorithms Research
  • Labor Movements and Unions
  • Statistical Methods and Inference
  • Quantum optics and atomic interactions
  • Privacy-Preserving Technologies in Data

University of Arkansas at Fayetteville
2016-2024

University of Southern California
2024

Chengdu University of Information Technology
2024

Quzhou University
2015-2023

University of Chinese Academy of Sciences
2023

Technology and Engineering Center for Space Utilization
2023

Chinese Academy of Sciences
2018-2023

China Telecom
2023

Beijing Jiaotong University
2014-2023

Anhui University of Traditional Chinese Medicine
2023

Fairness-aware learning is increasingly important in data mining. Discrimination prevention aims to prevent discrimination the training before it used conduct predictive analysis. In this paper, we focus on fair generation that ensures generated free. Inspired by generative adversarial networks (GAN), present fairness-aware networks, called FairGAN, which are able learn a generator producing and also preserving good utility. Compared with naive models, FairGAN further classifiers trained can...

10.1109/bigdata.2018.8622525 article EN 2021 IEEE International Conference on Big Data (Big Data) 2018-12-01

In this paper, we investigate the problem of discovering both direct and indirect discrimination from historical data, removing discriminatory effects before data is used for predictive analysis (e.g., building classifiers). The main drawback existing methods that they cannot distinguish part influence really caused by all correlated influences. our approach, make use causal network to capture structure data. Then model as path-specific effects, which accurately identify two types...

10.24963/ijcai.2017/549 article EN 2017-07-28

Based on imbalanced data, the predictive models for 5-year survivability of breast cancer using decision tree are proposed. After data preprocessing from SEER datasets, it is obviously that category distribution imbalanced. Under-sampling taken to make up disadvantage performance caused by data. The evaluated AUC under ROC curve, accuracy, specificity and sensitivity with 10-fold stratified cross-validation. best while approximately equal. Bagging algorithm used build an integration model...

10.1109/icbbe.2009.5162571 article EN 2009-06-01

Fairness-aware learning studies the problem of building machine models that are subject to fairness requirements. Counterfactual is a notion derived from Pearl's causal model, which considers model fair if for particular individual or group its prediction in real world same as counterfactual where individual(s) had belonged different demographic group. However, an inherent limitation it cannot be uniquely quantified observational data certain situations, due unidentifiability quantity. In...

10.24963/ijcai.2019/199 article EN 2019-07-28

Achieving fairness in learning models is currently an imperative task machine learning. Meanwhile, recent research showed that should be studied from the causal perspective, and proposed a number of criteria based on Pearl's modeling framework. In this paper, we investigate problem building fairness-aware generative adversarial networks (CFGAN), which can learn close distribution given dataset, while also ensuring various graph. CFGAN adopts two generators, whose structures are purposefully...

10.24963/ijcai.2019/201 article EN 2019-07-28

Discrimination discovery and prevention/removal are increasingly important tasks in data mining. aims to unveil discriminatory practices on the protected attribute (e.g., gender) by analyzing dataset of historical decision records, discrimination prevention remove modifying biased before conducting predictive analysis. In this paper, we show that key is find meaningful partitions can be used provide quantitative evidences for judgment discrimination. With support causal graph, present a...

10.1145/3097983.3098167 preprint EN 2017-08-04

10.1007/s41060-017-0058-x article EN International Journal of Data Science and Analytics 2017-05-18

Anti-discrimination is an increasingly important task in data science. In this paper, we investigate the problem of discovering both direct and indirect discrimination from historical data, removing discriminatory effects before are used for predictive analysis (e.g., building classifiers). The main drawback existing methods that they cannot distinguish part influence really caused by all correlated influences. our approach, make use causal graph to capture structure data. Then, model as...

10.1109/tkde.2018.2872988 article EN publisher-specific-oa IEEE Transactions on Knowledge and Data Engineering 2019-10-08

A recent trend of fair machine learning is to define fairness as causality-based notions which concern the causal connection between protected attributes and decisions. However, one common challenge all identifiability, i.e., whether they can be uniquely measured from observational data, a critical barrier applying these real-world situations. In this paper, we develop framework for measuring different fairness. We propose unified definition that covers most previous notions, namely...

10.48550/arxiv.1910.12586 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Most of the spacecraft telemetry anomaly detection methods based on statistical models suffer from problems high false negatives, long time consumption, and poor interpretability. Besides, complex interactions, which may determine propagation anomalous mode between parameters, are often ignored. To discover interaction parameters improve efficiency accuracy detection, we propose an framework parametric causality Double-Criteria Drift Streaming Peaks Over Threshold (DCDSPOT). We Normalized...

10.3390/app12041803 article EN cc-by Applied Sciences 2022-02-09

Predictive models learned from historical data are widely used to help companies and organizations make decisions. However, they may digitally unfairly treat unwanted groups, raising concerns about fairness discrimination. In this paper, we study the fairness-aware ranking problem which aims discover discrimination in ranked datasets reconstruct fair ranking. Existing methods mainly based on statistical parity that cannot measure true discriminatory effect since is causal. On other hand,...

10.1145/3219819.3220087 article EN 2018-07-19

In this paper, we study the fairness-aware classification problem by formulating it as a constrained optimization problem. Several limitations exist in previous works due to lack of theoretical framework for guiding formulation. We propose general address limitations. Our provides: (1) various fairness metrics that can be incorporated into classic models constraints; (2) convex solved efficiently; and (3) lower upper bounds real-world measures are established using surrogate functions,...

10.1145/3308558.3313723 article EN 2019-05-13

Automatic heuristic design through reinforcement learning opens a promising direction for solving computationally difficult problems. Unlike most previous works that aimed at solution construction, we explore the possibility of acquiring local search heuristics massive experiments. To illustrate applicability, an agent is trained to perform walk in space by selecting candidate neighbor each step. Specifically, target floorplanning problem, where generated perturbing sequence pair encoding...

10.1109/iccd50377.2020.00061 article EN 2022 IEEE 40th International Conference on Computer Design (ICCD) 2020-10-01

Investigating causality to establish novel criteria for training robust natural language processing (NLP) models is an active research area. However, current methods face various challenges such as the difficulties in identifying keyword lexicons and obtaining data from multiple labeled environments. In this paper, we study problem of NLP a complementary but different angle: treat behavior attack model complex causal mechanism quantify its algorithmic information using minimum description...

10.3390/e26050354 article EN cc-by Entropy 2024-04-24

In discrimination-aware classification, the pre-process methods for constructing a discrimination-free classifier first remove discrimination from training data, and then learn cleaned data. However, they lack theoretical guarantee potential when is deployed prediction. this paper, we fill gap by mathematically bounding in We adopt causal model modeling data generation mechanism, formally defining population, dataset, obtain two important results: (1) prediction can still exist even if...

10.24963/ijcai.2018/430 preprint EN 2018-07-01

Multi-mode NOON states have been attracting increasing attentions recently for their abilities of obtaining supersensitive and superresolved measurements simultaneous multiple-phase estimation. In this paper, four different methods generating multi-mode with a high photon number were proposed. The first method is linear optical approach that makes use the Fock state filtration to reduce lower-order terms from coherent inputs, which are jointly combined produce triggering multi-fold...

10.1038/s41598-018-29828-2 article EN cc-by Scientific Reports 2018-07-24

Fair machine learning is receiving an increasing attention in fields. Researchers fair have developed correlation or association-based measures such as demographic disparity, mistreatment calibration, causal-based total effect, direct and indirect discrimination, counterfactual fairness, fairness notions equality of opportunity equalized odds that consider both decisions the training data made by predictive models. In this paper, we develop a new notation, called effort. Different from...

10.1145/3366424.3383558 article EN Companion Proceedings of the The Web Conference 2018 2020-04-20

In online recommendation, customers arrive in a sequential and stochastic manner from an underlying distribution the decision model recommends chosen item for each arriving individual based on some strategy. We study how to recommend at step maximize expected reward while achieving user-side fairness customers, i.e., who share similar profiles will receive regardless of their sensitive attributes items being recommended. By incorporating causal inference into bandits adopting soft...

10.1609/aaai.v36i6.20653 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

In the urban transportation system, unbalanced relationship between taxi demand and number of running taxis reduces drivers' income levels passengers' satisfaction. With help vehicular global positioning system (GPS) data, distribution city can be analyzed to provide advice for drivers. A clustering algorithm called Density-Based Spatial Clustering Applications with Noise (DBSCAN) is suitable discovering hotspots. However, execution efficiency still a big challenge when DBSCAN applied on...

10.1109/vtcfall.2016.7881010 article EN 2021 IEEE 94th Vehicular Technology Conference (VTC2021-Fall) 2016-09-01
Coming Soon ...