NFDI4DS | UHH-SEMS - Publication Details

Backdoor Attacks to Graph Neural Networks

OPENALEX - Publications

Zaixi Zhang Jinyuan Jia Binghui Wang Neil Zhenqiang Gong

In this work, we propose the first backdoor attack to graph neural networks (GNN). Specifically, a subgraph based GNN for classification. our attack, classifier predicts an attacker-chosen target label testing once predefined is injected graph. Our empirical results on three real-world datasets show that attacks are effective with small impact GNN's prediction accuracy clean graphs. Moreover, generalize randomized smoothing certified defense defend against attacks. in some cases but...

10.1145/3450569.3463560 article EN 2021-06-11

FLDetector: Defending Federated Learning Against Model Poisoning Attacks via Detecting Malicious Clients

OPENALEX - Publications

Zaixi Zhang Xiaoyu Cao Jinyuan Jia Neil Zhenqiang Gong

Federated learning (FL) is vulnerable to model poisoning attacks, in which malicious clients corrupt the global via sending manipulated updates server. Existing defenses mainly rely on Byzantine-robust or provably robust FL methods, aim learn an accurate even if some are malicious. However, they can only resist a small number of clients. It still open challenge how defend against attacks with large Our FLDetector addresses this detecting aims detect and remove majority such that method using...

10.1145/3534678.3539231 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2022-08-12

Motif-based Graph Self-Supervised Learning for Molecular Property Prediction

OPENALEX - Publications

Zaixi Zhang Qi Liu Hao Wang Chengqiang Lu Chee‐Kong Lee

Predicting molecular properties with data-driven methods has drawn much attention in recent years. Particularly, Graph Neural Networks (GNNs) have demonstrated remarkable success various generation and prediction tasks. In cases where labeled data is scarce, GNNs can be pre-trained on unlabeled to first learn the general semantic structural information before being fine-tuned for specific However, most existing self-supervised pre-training frameworks only focus node-level or graph-level...

10.48550/arxiv.2110.00987 preprint EN cc-by-sa arXiv (Cornell University) 2021-01-01

ProtGNN: Towards Self-Explaining Graph Neural Networks

OPENALEX - Publications

Zaixi Zhang Qi Liu Hao Wang Chengqiang Lu Chee‐Kong Lee

Despite the recent progress in Graph Neural Networks (GNNs), it remains challenging to explain predictions made by GNNs. Existing explanation methods mainly focus on post-hoc explanations where another explanatory model is employed provide for a trained GNN. The fact that fail reveal original reasoning process of GNNs raises need building with built-in interpretability. In this work, we propose Prototype Network (ProtGNN), which combines prototype learning and provides new perspective...

10.1609/aaai.v36i8.20898 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

FLCert: Provably Secure Federated Learning Against Poisoning Attacks

OPENALEX - Publications

Xiaoyu Cao Zaixi Zhang Jinyuan Jia Neil Zhenqiang Gong

Due to its distributed nature, federated learning is vulnerable poisoning attacks, in which malicious clients poison the training process via manipulating their local data and/or model updates sent cloud server, such that poisoned global misclassifies many indiscriminate test inputs or attacker-chosen ones. Existing defenses mainly leverage Byzantine-robust methods detect clients. However, these do not have provable security guarantees against attacks and may be more advanced attacks. In...

10.1109/tifs.2022.3212174 article EN IEEE Transactions on Information Forensics and Security 2022-01-01

FedRecover: Recovering from Poisoning Attacks in Federated Learning using Historical Information

OPENALEX - Publications

Xiaoyu Cao Jinyuan Jia Zaixi Zhang Neil Zhenqiang Gong

Federated learning is vulnerable to poisoning attacks in which malicious clients poison the global model via sending updates server. Existing defenses focus on preventing a small number of from robust federated methods and detecting when there are large them. However, it still an open challenge how recover after detected. A naive solution remove detected train new scratch using remaining clients. such train-from-scratch recovery method incurs computation communication cost, may be...

10.1109/sp46215.2023.10179336 article EN 2022 IEEE Symposium on Security and Privacy (SP) 2023-05-01

Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense

OPENALEX - Publications

Yang Yu Qi Liu Likang Wu Runlong Yu Sanshi Lei Yu and 1 more

Federated recommendation (FedRec) can train personalized recommenders without collecting user data, but the decentralized nature makes it susceptible to poisoning attacks. Most previous studies focus on targeted attack promote certain items, while untargeted that aims degrade overall performance of FedRec system remains less explored. In fact, attacks disrupt experience and bring severe ﬁnancial loss service provider. However, existing methods are either inapplicable or ineffective against...

10.1609/aaai.v37i4.25611 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Model Inversion Attacks Against Graph Neural Networks

OPENALEX - Publications

Zaixi Zhang Qi Liu Zhenya Huang Hao Wang Chee‐Kong Lee and 1 more

Many data mining tasks rely on graphs to model relational structures among individuals (nodes). Since are often sensitive, there is an urgent need evaluate the privacy risks in graph data. One famous attack against analysis models inversion attack, which aims infer sensitive training dataset and leads great concerns. Despite its success grid-like domains, directly applying attacks non-grid domains such as poor performance. This mainly due failure consider unique properties of graphs. To...

10.1109/tkde.2022.3207915 article EN IEEE Transactions on Knowledge and Data Engineering 2022-09-19

GraphMI: Extracting Private Graph Data from Graph Neural Networks

OPENALEX - Publications

Zaixi Zhang Qi Liu Zhenya Huang Hao Wang Chengqiang Lu and 2 more

As machine learning becomes more widely used for critical applications, the need to study its implications in privacy urgent. Given access target model and auxiliary information, inversion attack aims infer sensitive features of training dataset, which leads great concerns. Despite success grid domain, directly applying techniques on non domains such as graph achieves poor performance due difficulty fully exploit intrinsic properties graphs attributes nodes GNN models. To bridge this gap, we...

10.24963/ijcai.2021/516 article EN 2021-08-01

Backdoor Defense via Deconfounded Representation Learning

OPENALEX - Publications

Zaixi Zhang Qi Liu Zhicai Wang Zepu Lu Qingyong Hu

Deep neural networks (DNNs) are recently shown to be vulnerable backdoor attacks, where attackers embed hidden backdoors in the DNN model by injecting a few poisoned examples into training dataset. While extensive efforts have been made detect and remove from backdoored DNNs, it is still not clear whether backdoor-free clean can directly obtained datasets. In this paper, we first construct causal graph generation process of data find that attack acts as confounder, which brings spurious...

10.1109/cvpr52729.2023.01177 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Hierarchical Graph Transformer with Adaptive Node Sampling

OPENALEX - Publications

Zaixi Zhang Qi Liu Qingyong Hu Chee‐Kong Lee

The Transformer architecture has achieved remarkable success in a number of domains including natural language processing and computer vision. However, when it comes to graph-structured data, transformers have not competitive performance, especially on large graphs. In this paper, we identify the main deficiencies current graph transformers:(1) Existing node sampling strategies Graph Transformers are agnostic characteristics training process. (2) Most only focus local neighbors neglect...

10.48550/arxiv.2210.03930 preprint EN cc-by-sa arXiv (Cornell University) 2022-01-01

An equivariant generative framework for molecular graph-structure Co-design

OPENALEX - Publications

Zaixi Zhang Qi Liu Chee‐Kong Lee Chang‐Yu Hsieh Enhong Chen

Designing molecules with desirable physiochemical properties and functionalities is a long-standing challenge in chemistry, material science, drug discovery. Recently, machine learning-based generative models have emerged as promising approaches for

10.1039/d3sc02538a article EN cc-by-nc Chemical Science 2023-01-01

BioMiner: A Multi-modal System for Automated Mining of Protein-Ligand Bioactivity Data from Literature

OPENALEX - Publications

Jiaxian Yan Jintao Zhu Yuhang Yang Qi Liu Kai Zhang and 6 more

Protein-ligand bioactivity data published in literature are essential for drug discovery, yet manual curation struggles to keep pace with rapidly growing literature. Automated extraction is challenging due the multi-modal distribution of information (text, tables, figures, structures) and complexity chemical representations (e.g., Markush structures). Furthermore, lack standardized benchmarks impedes evaluation development methods. In this work, we introduce BioMiner, a system designed...

10.1101/2025.04.22.648951 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2025-04-24

Binding-Adaptive Diffusion Models for Structure-Based Drug Design

OPENALEX - Publications

Zhilin Huang L. Yang Zaixi Zhang Xiangxin Zhou Yu Bao and 4 more

Structure-based drug design (SBDD) aims to generate 3D ligand molecules that bind specific protein targets. Existing deep generative models including diffusion have shown great promise for SBDD. However, it is complex capture the essential protein-ligand interactions exactly in space molecular generation. To address this problem, we propose a novel framework, namely Binding-Adaptive Diffusion Models (BindDM). In BindDM, adaptively extract subcomplex, part of binding sites responsible...

10.1609/aaai.v38i11.29162 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Efficient generation of protein pockets with PocketGen

OPENALEX - Publications

Zaixi Zhang Wan Xiang Shen Qi Liu Marinka Žitnik

Abstract Designing protein-binding proteins is critical for drug discovery. However, artificial-intelligence-based design of such challenging due to the complexity protein–ligand interactions, flexibility ligand molecules and amino acid side chains, sequence–structure dependencies. We introduce PocketGen, a deep generative model that produces residue sequence atomic structure protein regions in which interactions occur. PocketGen promotes consistency between by using graph transformer...

10.1038/s42256-024-00920-9 article EN cc-by Nature Machine Intelligence 2024-11-15

A Call for Built-in Biosecurity Safeguards for Generative AI Tools

OPENALEX - Publications

Mengdi Wang Zaixi Zhang Amrit Singh Bedi Saulo Philipe Sebastião Guerra Sheng Lin-Gibson and 6 more

The rapid adoption of generative AI (GenAI) in biotechnology offers immense potential but also raises serious safety concerns. models for protein engineering, genome editing, and molecular synthesis can be misused to enhance viral virulence, design toxins, or modify human embryos, while ethical policy discussions lag behind technological advances. This Correspondence calls proactive, built-in, AI-native safeguards within GenAI tools. With more research development, emerging...

10.20944/preprints202503.1761.v1 preprint EN 2025-03-26

ATOMICA: Learning Universal Representations of Intermolecular Interactions

OPENALEX - Publications

Ada Fang Zaixi Zhang Xiao‐Hua Zhou Marinka Žitnik

Molecular interactions underlie nearly all biological processes, but most machine learning models treat molecules in isolation or specialize a single type of interaction, such as protein-ligand protein-protein binding. This siloed approach prevents generalization across biomolecular classes and limits the ability to model interaction interfaces systematically. We introduce ATOMICA, geometric deep that learns atomic-scale representations intermolecular diverse modalities, including small...

10.1101/2025.04.02.646906 preprint EN cc-by 2025-04-08

A Call for Built-in Biosecurity Safeguards for Generative AI Tools

OPENALEX - Publications

Mengdi Wang Zaixi Zhang Amrit Singh Bedi Saulo Philipe Sebastião Guerra Sheng Lin-Gibson and 6 more

10.2139/ssrn.5187173 preprint EN 2025-01-01

Learning Subpocket Prototypes for Generalizable Structure-based Drug Design

OPENALEX - Publications

Zaixi Zhang Qi Liu

Generating molecules with high binding affinities to target proteins (a.k.a. structure-based drug design) is a fundamental and challenging task in discovery. Recently, deep generative models have achieved remarkable success generating 3D conditioned on the protein pocket. However, most existing methods consider molecular generation for pockets independently while neglecting underlying connections such as subpocket-level similarities. Subpockets are local environments of ligand fragments...

10.48550/arxiv.2305.13997 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

PocketGen: Generating Full-Atom Ligand-Binding Protein Pockets

OPENALEX - Publications

Zaixi Zhang Wan Xiang Shen Qi Liu Marinka Žitnik

Abstract Designing protein-binding proteins is critical for drug discovery. However, the AI-based design of such challenging due to complexity ligand-protein interactions, flexibility ligand molecules and amino acid side chains, sequence-structure dependencies. We introduce PocketGen, a deep generative model that simultaneously produces both residue sequence atomic structure protein regions where interactions occur. PocketGen ensures consistency between by using graph transformer structural...

10.1101/2024.02.25.581968 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-02-28

ProtGNN: Towards Self-Explaining Graph Neural Networks

OPENALEX - Publications

Zaixi Zhang Qi Liu Hao Wang Chengqiang Lu Chee‐Kong Lee

Despite the recent progress in Graph Neural Networks (GNNs), it remains challenging to explain predictions made by GNNs. Existing explanation methods mainly focus on post-hoc explanations where another explanatory model is employed provide for a trained GNN. The fact that fail reveal original reasoning process of GNNs raises need building with built-in interpretability. In this work, we propose Prototype Network (ProtGNN), which combines prototype learning and provides new perspective...

10.48550/arxiv.2112.00911 preprint EN cc-by arXiv (Cornell University) 2021-01-01

A Systematic Survey in Geometric Deep Learning for Structure-based Drug Design

OPENALEX - Publications

Zaixi Zhang Jiaxian Yan Qi Liu Enhong Chen

Structure-based drug design (SBDD) utilizes the three-dimensional geometry of proteins to identify potential candidates. Traditional methods, grounded in physicochemical modeling and informed by domain expertise, are resource-intensive. Recent developments geometric deep learning, focusing on integration processing 3D data, coupled with availability accurate protein structure predictions from tools like AlphaFold, have greatly advanced field structure-based design. This paper systematically...

10.48550/arxiv.2306.11768 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

RNAGenesis: Foundation Model for Enhanced RNA Sequence Generation and Structural Insights

OPENALEX - Publications

Zaixi Zhang Chao Liu Ruofan Jin Yikun Zhang Guo‐Wei Zhou and 8 more

ABSTRACT RNA molecule plays an essential role in a wide range of biological processes. Gaining deeper understanding their functions can significantly advance our knowledge life’s mechanisms and drive the development drugs for various diseases. Recently, advances foundation models have enabled new approaches to engineering, yet existing methods fall short generating novel sequences with specific functions. Here, we introduce RNAGenesis, model that combines sequence de novo design through...

10.1101/2024.12.30.630826 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-12-31

Backdoor Attacks to Graph Neural Networks

OPENALEX - Publications

Zaixi Zhang Jinyuan Jia Binghui Wang Neil Zhenqiang Gong

In this work, we propose the first backdoor attack to graph neural networks (GNN). Specifically, a \emph{subgraph based attack} GNN for classification. our attack, classifier predicts an attacker-chosen target label testing once predefined subgraph is injected graph. Our empirical results on three real-world datasets show that attacks are effective with small impact GNN's prediction accuracy clean graphs. Moreover, generalize randomized smoothing certified defense defend against attacks. in...

10.48550/arxiv.2006.11165 preprint EN cc-by arXiv (Cornell University) 2020-01-01