NFDI4DS | UHH-SEMS - Publication Details

Xiao Zhang

ORCID: 0000-0003-4927-5016

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5032417256

Research Areas

Reinforcement Learning in Robotics
Opinion Dynamics and Social Influence
Autonomous Vehicle Technology and Safety
Network Security and Intrusion Detection
Complex Network Analysis Techniques
Advanced Multi-Objective Optimization Algorithms
Transportation and Mobility Innovations
Robotic Path Planning Algorithms
Military Defense Systems Analysis
Advanced Decision-Making Techniques
Vehicular Ad Hoc Networks (VANETs)
Economic theories and models
Traffic control and management
Advanced Control Systems Optimization
Mobile Agent-Based Network Management
Game Theory and Applications
Evolutionary Algorithms and Applications
Optimization and Search Problems
Network Traffic and Congestion Control
Anomaly Detection Techniques and Applications
Metaheuristic Optimization Algorithms Research
Advanced Text Analysis Techniques
Neural Networks and Reservoir Computing
Traffic Prediction and Management Techniques
AI and Big Data Applications

Beihang University
2023-2025

State Grid Corporation of China (China)
2024

Ministry of Education of the People's Republic of China
2023-2024

Ji Hua Laboratory
2023

Nanjing University of Posts and Telecommunications
2023

Peng Cheng Laboratory
2023

Large-Scale Group Opinion Evolution With Coexistence of Influential Individuals and Strongly Organized Groups Based on Mean Field Games

OPENALEX - Publications

Lu Ren Yuxin Jin Wang Yao Xiao Zhang Guohui Jiao

10.1109/tnse.2025.3546295 article EN IEEE Transactions on Network Science and Engineering 2025-01-01

Hierarchical Cooperation in LQ Multi-Population Mean Field Game With Its Application to Opinion Evolution

OPENALEX - Publications

Lu Ren Yuxin Jin Zijia Niu Wang Yao Xiao Zhang

10.1109/tnse.2024.3418832 article EN IEEE Transactions on Network Science and Engineering 2024-09-01

Policy Optimization with Smooth Guidance Rewards Learned from Sparse-Reward Demonstrations

OPENALEX - Publications

Guojian Wang Faguo Wu Xiao Zhang Tianyuan Chen

The sparsity of reward feedback remains a challenging problem in online deep reinforcement learning (DRL). Previous approaches have utilized temporal credit assignment (CA) to achieve impressive results multiple hard tasks. However, many CA methods relied on complex architectures or introduced sensitive hyperparameters estimate the impact state-action pairs. Meanwhile, premise feasibility is obtain trajectories with sparse rewards, which can be troublesome sparse-reward environments large...

10.48550/arxiv.2401.00162 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Integrated Task Assignment and Trajectory Planning for a Massive Number of Agents Based on Bilayer-Coupled Mean Field Games

OPENALEX - Publications

Zijia Niu Wang Yao Yuxin Jin S.K.Stephen Huang Xiao Zhang and 1 more

Aiming at the problem of integrated task assignment and trajectory planning a massive number agents in scenario with different priority nodes multiple static obstacles, this paper proposes general framework based on bilayer-coupled mean field games, which couples minimum cost an agent process to achieve reasonable, globally optimal, targeted adjustable result. In proposed framework, firstly, multi-population game is used plan optimal between each pair adjacent nodes, costs are calculated....

10.1109/tase.2024.3370619 article EN IEEE Transactions on Automation Science and Engineering 2024-01-01

Differential Pricing Strategies for Bandwidth Allocation With LFA Resilience: A Stackelberg Game Approach

OPENALEX - Publications

Lijia Xie Shuai Meng Wang Yao Xiao Zhang

Link flooding attacks (LFAs) have always been a security concern as the impact of volumetric on transit links are increasingly severe. Capacity expansion, while being effective in combating LFAs, involves considerable deployment costs. Therefore, how to efficiently manage link resource among spatio-temporal dynamic customers remains challenge for Internet service providers (ISPs). In this paper, we study differential pricing strategy bandwidth allocation with LFA resilience by leveraging...

10.1109/tifs.2023.3299181 article EN IEEE Transactions on Information Forensics and Security 2023-01-01

FP-WDDQN: An improved deep reinforcement learning algorithm for adaptive traffic signal control

OPENALEX - Publications

Xiao Zhang Xiaolong Xu

Current adaptive traffic signal control methods based on centralized deep reinforcement learning are not applicable in large-scale environment. The scalability problem is overcome by assigning global to each local RL agent through multi-intelligence learning, but the environment now becomes partially visible ami non-stationarity from perspective of due limited communication between agents. In this paper, we propose a multi-agent framework called Forgetful Priority Weighed Double Deep...

10.1109/icdmw60847.2023.00015 article EN 2022 IEEE International Conference on Data Mining Workshops (ICDMW) 2023-12-04

Learning Diverse Policies with Soft Self-Generated Guidance

OPENALEX - Publications

Guojian Wang Faguo Wu Xiao Zhang Jianxiang Liu

Reinforcement learning (RL) with sparse and deceptive rewards is challenging because non-zero are rarely obtained. Hence, the gradient calculated by agent can be stochastic without valid information. Recent studies that utilize memory buffers of previous experiences lead to a more efficient process. However, existing methods often require these successful may overly exploit them, which cause adopt suboptimal behaviors. This paper develops an approach uses diverse past trajectories for faster...

10.1155/2023/4705291 article EN cc-by International Journal of Intelligent Systems 2023-04-18

Adaptive trajectory-constrained exploration strategy for deep reinforcement learning

OPENALEX - Publications

Guojian Wang Faguo Wu Xiao Zhang Ning Guo Zhiming Zheng

10.1016/j.knosys.2023.111334 article EN Knowledge-Based Systems 2023-12-27

Rethink prevalent machine learning attack detection methods from a generalization perspective

OPENALEX - Publications

Mingjie Shi Xiao Zhang Chong Ruan Po Wu Chuanfu Zhang and 3 more

10.1109/icpeca60615.2024.10470969 article EN 2024-01-26

Trajectory-Oriented Policy Optimization with Sparse Rewards

OPENALEX - Publications

Guojian Wang Faguo Wu Xiao Zhang

10.1109/cipcv61763.2024.00023 article EN 2024-05-17

Social optimum of finite mean field games: existence and uniqueness of equilibrium solutions in the finite horizon and stationary solutions in the infinite horizon

OPENALEX - Publications

Zijia Niu S.K.Stephen Huang Lu Ren Wang Yao Xiao Zhang

In this paper, we consider the social optimal problem of discrete time finite state space mean field games (referred to as [1]). Unlike individual optimization their own cost function in competitive models, consider, individuals aim optimize by finding a fixed point distribution achieve equilibrium game. We provide sufficient condition for existence and uniqueness strategies used minimize cost. According definition optimum derived properties cost, conditions solutions under initial-terminal...

10.48550/arxiv.2408.04291 preprint EN arXiv (Cornell University) 2024-08-08

Model-free robust reinforcement learning via Polynomial Chaos

OPENALEX - Publications

Jianxiang Liu Faguo Wu Xiao Zhang

10.1016/j.knosys.2024.112783 article EN Knowledge-Based Systems 2024-11-01

A Global Optimal Task Allocation Model for Large-scale Agents Based on Mean Field Game

OPENALEX - Publications

S.K.Stephen Huang Wang Yao Zijia Niu Xiao Zhang

10.1109/cdc56724.2024.10886441 article EN 2024-12-16

Feature-Based Local Ensemble Framework for Multi-Agent Reinforcement Learning

OPENALEX - Publications

Xinyu Zhao Jianxiang Liu Faguo Wu Xiao Zhang

10.1109/iscsic64297.2024.00060 article EN 2024-09-06

Research on the Construction Scheme of University Smart Financial Innovation Practice Base under the Background of Big Data

OPENALEX - Publications

Xiao Zhang

After the rise of education reform in China, more and schools began to pay attention construction practice bases. As an indispensable auxiliary part each enterprise's development, finance naturally received high attention. Especially with advent big data era, demand for high-quality compound financial talents is growing rapidly. However, practical skills cultivated from traditional bases built by various universities are relatively weak, it difficult really fill required modern enterprises....

10.23977/aduhe.2023.050606 article EN Adult and Higher Education 2023-01-01

Adaptive trajectory-constrained exploration strategy for deep reinforcement learning

OPENALEX - Publications

Guojian Wang Faguo Wu Xiao Zhang Ning Guo Zhiming Zheng

Deep reinforcement learning (DRL) faces significant challenges in addressing the hard-exploration problems tasks with sparse or deceptive rewards and large state spaces. These severely limit practical application of DRL. Most previous exploration methods relied on complex architectures to estimate novelty introduced sensitive hyperparameters, resulting instability. To mitigate these issues, we propose an efficient adaptive trajectory-constrained strategy for The proposed method guides policy...

10.48550/arxiv.2312.16456 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01

Coming Soon ...