NFDI4DS | UHH-SEMS - Publication Details

Zhanxing Zhu

ORCID: 0000-0002-2141-6553

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5045305860

Research Areas

Adversarial Robustness in Machine Learning
Advanced Neural Network Applications
Anomaly Detection Techniques and Applications
Domain Adaptation and Few-Shot Learning
Stochastic Gradient Optimization Techniques
Gaussian Processes and Bayesian Inference
Markov Chains and Monte Carlo Methods
Neural Networks and Applications
Sparse and Compressive Sensing Techniques
Advanced Graph Neural Networks
Generative Adversarial Networks and Image Synthesis
Machine Learning and ELM
Model Reduction and Neural Networks
Multimodal Machine Learning Applications
Machine Learning and Algorithms
Blind Source Separation Techniques
Fault Detection and Control Systems
Statistical Methods and Inference
Traffic Prediction and Management Techniques
Advanced Malware Detection Techniques
Machine Learning and Data Classification
Bayesian Methods and Mixture Models
Topic Modeling
Complex Network Analysis Techniques
Medical Image Segmentation Techniques

University of Southampton
2024-2025

Peking University
2009-2023

Baidu (China)
2023

Beijing Institute of Big Data Research
2018-2021

Shanghai Jiao Tong University
1983-2021

Beijing Haidian Hospital
2019-2021

King University
2019

University of Edinburgh
2013-2016

Aalto University
2010-2013

Beihang University
2009-2011

Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting

OPENALEX - Publications

Bing Yu Haoteng Yin Zhanxing Zhu

Timely accurate traffic forecast is crucial for urban control and guidance. Due to the high nonlinearity complexity of flow, traditional methods cannot satisfy requirements mid-and-long term prediction tasks often neglect spatial temporal dependencies. In this paper, we propose a novel deep learning framework, Spatio-Temporal Graph Convolutional Networks (STGCN), tackle time series problem in domain. Instead applying regular convolutional recurrent units, formulate on graphs build model with...

10.24963/ijcai.2018/505 preprint EN 2018-07-01

Spatial-Temporal Fusion Graph Neural Networks for Traffic Flow Forecasting

OPENALEX - Publications

Mengzhang Li Zhanxing Zhu

Spatial-temporal data forecasting of traffic flow is a challenging task because complicated spatial dependencies and dynamical trends temporal pattern between different roads. Existing frameworks usually utilize given adjacency graph sophisticated mechanisms for modeling correlations. However, limited representations structure with incomplete adjacent connections may restrict effective spatial-temporal learning those models. Furthermore, existing methods were out at elbows when solving data:...

10.1609/aaai.v35i5.16542 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labeled Nodes

OPENALEX - Publications

Ke Sun Zhouchen Lin Zhanxing Zhu

Graph Convolutional Networks (GCNs) play a crucial role in graph learning tasks, however, embedding with few supervised signals is still difficult problem. In this paper, we propose novel training algorithm for Network, called Multi-Stage Self-Supervised (M3S) Training Algorithm, combined self-supervised approach, focusing on improving the generalization performance of GCNs graphs labeled nodes. Firstly, Framework provided as basis M3S method. Then leverage DeepCluster technique, popular...

10.1609/aaai.v34i04.6048 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Reinforced Continual Learning

OPENALEX - Publications

Ju Xu Zhanxing Zhu

Most artificial intelligence models have limiting ability to solve new tasks faster, without forgetting previously acquired knowledge. The recently emerging paradigm of continual learning aims this issue, in which the model learns various a sequential fashion. In work, novel approach for is proposed, searches best neural architecture each coming task via sophisticatedly designed reinforcement strategies. We name it as Reinforced Continual Learning. Our method not only has good performance on...

10.48550/arxiv.1805.12369 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Towards Understanding Generalization of Deep Learning: Perspective of Loss Landscapes

OPENALEX - Publications

Lei Wu Zhanxing Zhu E Weinan

It is widely observed that deep learning models with learned parameters generalize well, even much more model than the number of training samples. We systematically investigate underlying reasons why neural networks often and reveal difference between minima (with same error) well those they don't. show it characteristics landscape loss function explains good generalization capability. For for networks, volume basin attraction dominates over poor minima, which guarantees optimization methods...

10.48550/arxiv.1706.10239 preprint EN public-domain arXiv (Cornell University) 2017-01-01

Yet Another Text Captcha Solver

OPENALEX - Publications

Guixin Ye Zhanyong Tang Dingyi Fang Zhanxing Zhu Yansong Feng and 3 more

Despite several attacks have been proposed, text-based CAPTCHAs are still being widely used as a security mechanism. One of the reasons for pervasive use text captchas is that many prior scheme-specific and require labor-intensive time-consuming process to construct. This means change in captcha features like noisier background can simply invalid an earlier attack. paper presents generic, yet effective solver based on generative adversarial network. Unlike machine-learning-based approaches...

10.1145/3243734.3243754 article EN Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security 2018-10-15

Efficient Neural Architecture Search via Proximal Iterations

OPENALEX - Publications

Quanming Yao Ju Xu Wei-Wei Tu Zhanxing Zhu

Neural architecture search (NAS) attracts much research attention because of its ability to identify better architectures than handcrafted ones. Recently, differentiable methods become the state-of-the-arts on NAS, which can obtain high-performance in several days. However, they still suffer from huge computation costs and inferior performance due construction supernet. In this paper, we propose an efficient NAS method based proximal iterations (denoted as NASP). Different previous works,...

10.1609/aaai.v34i04.6143 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

You Only Propagate Once: Accelerating Adversarial Training via Maximal Principle

OPENALEX - Publications

Dinghuai Zhang Tianyuan Zhang Yiping Lu Zhanxing Zhu Bin Dong

Deep learning achieves state-of-the-art results in many tasks computer vision and natural language processing. However, recent works have shown that deep networks can be vulnerable to adversarial perturbations, which raised a serious robustness issue of networks. Adversarial training, typically formulated as robust optimization problem, is an effective way improving the A major drawback existing training algorithms computational overhead generation examples, far greater than network...

10.48550/arxiv.1905.00877 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Understanding and Enhancing the Transferability of Adversarial Examples

OPENALEX - Publications

Lei Wu Zhanxing Zhu Cheng Tai E Weinan

State-of-the-art deep neural networks are known to be vulnerable adversarial examples, formed by applying small but malicious perturbations the original inputs. Moreover, can \textit{transfer across models}: examples generated for a specific model will often mislead other unseen models. Consequently adversary leverage it attack deployed systems without any query, which severely hinder application of learning, especially in areas where security is crucial. In this work, we systematically...

10.48550/arxiv.1802.09707 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Learning with Noise: Enhance Distantly Supervised Relation Extraction with Dynamic Transition Matrix

OPENALEX - Publications

Bingfeng Luo Yansong Feng Zheng Wang Zhanxing Zhu Songfang Huang and 2 more

Bingfeng Luo, Yansong Feng, Zheng Wang, Zhanxing Zhu, Songfang Huang, Rui Yan, Dongyan Zhao. Proceedings of the 55th Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2017.

10.18653/v1/p17-1040 preprint EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2017-01-01

Spatio-Temporal Manifold Learning for Human Motions via Long-Horizon Modeling

OPENALEX - Publications

He Wang Edmond S. L. Ho Hubert P. H. Shum Zhanxing Zhu

Data-driven modeling of human motions is ubiquitous in computer graphics and vision applications, such as synthesizing realistic or recognizing actions. Recent research has shown that problems can be approached by learning a natural motion manifold using deep on large amount data, to address the shortcomings traditional data-driven approaches. However, previous methods sub-optimal for two reasons. First, skeletal information not been fully utilized feature extraction. Unlike images, it...

10.1109/tvcg.2019.2936810 article EN IEEE Transactions on Visualization and Computer Graphics 2019-08-22

Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction

OPENALEX - Publications

Ruochen Li Tanqiu Qiao Stamos Katsigiannis Zhanxing Zhu Hubert P. H. Shum

10.1109/tcsvt.2025.3539522 article EN IEEE Transactions on Circuits and Systems for Video Technology 2025-01-01

ST-UNet: A Spatio-Temporal U-Network for Graph-structured Time Series Modeling

OPENALEX - Publications

Bing Yu Haoteng Yin Zhanxing Zhu

The spatio-temporal graph learning is becoming an increasingly important object of study. Many application domains involve highly dynamic graphs where temporal information crucial, e.g. traffic networks and financial transaction graphs. Despite the constant progress made on structured data, there still a lack effective means to extract complex features from structures. Particularly, conventional models such as convolutional or recurrent neural are incapable revealing patterns in short long...

10.48550/arxiv.1903.05631 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Informative Dropout for Robust Representation Learning: A Shape-bias Perspective

OPENALEX - Publications

Baifeng Shi Dinghuai Zhang Qi Dai Jingdong Wang Zhanxing Zhu and 1 more

Convolutional Neural Networks (CNNs) are known to rely more on local texture rather than global shape when making decisions. Recent work also indicates a close relationship between CNN's texture-bias and its robustness against distribution shift, adversarial perturbation, random corruption, etc. In this work, we attempt at improving various kinds of universally by alleviating bias. With inspiration from the human visual system, propose light-weight model-agnostic method, namely Informative...

10.48550/arxiv.2008.04254 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Adversarial attacks on Faster R-CNN object detector

OPENALEX - Publications

Yutong Wang Kunfeng Wang Zhanxing Zhu Fei‐Yue Wang

10.1016/j.neucom.2019.11.051 article EN Neurocomputing 2019-12-04

3D Graph Convolutional Networks with Temporal Graphs: A Spatial Information Free Framework For Traffic Forecasting

OPENALEX - Publications

Bing Yu Mengzhang Li Jiyong Zhang Zhanxing Zhu

Spatio-temporal prediction plays an important role in many application areas especially traffic domain. However, due to complicated spatio-temporal dependency and high non-linear dynamics road networks, task is still challenging. Existing works either exhibit heavy training cost or fail accurately capture the patterns, also ignore correlation between distant roads that share similar patterns. In this paper, we propose a novel deep learning framework overcome these issues: 3D Temporal Graph...

10.48550/arxiv.1903.00919 preprint EN other-oa arXiv (Cornell University) 2019-01-01

The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects

OPENALEX - Publications

Zhanxing Zhu Jingfeng Wu Bing Yu Lei Wu Jinwen Ma

Understanding the behavior of stochastic gradient descent (SGD) in context deep neural networks has raised lots concerns recently. Along this line, we study a general form based optimization dynamics with unbiased noise, which unifies SGD and standard Langevin dynamics. Through investigating dynamics, analyze on escaping from minima its regularization effects. A novel indicator is derived to characterize efficiency through measuring alignment noise covariance curvature loss function. Based...

10.48550/arxiv.1803.00195 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Coming Soon ...