NFDI4DS | UHH-SEMS - Publication Details

Shiguang Wu

ORCID: 0000-0001-9091-5236

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5103000550

Research Areas

Reinforcement Learning in Robotics
Distributed Control Multi-Agent Systems
Domain Adaptation and Few-Shot Learning
Robotic Path Planning Algorithms
Energy Efficient Wireless Sensor Networks
Robotics and Sensor-Based Localization
Neural Networks and Reservoir Computing
Modular Robots and Swarm Intelligence
UAV Applications and Optimization
Space Satellite Systems and Control
Evolutionary Game Theory and Cooperation
Opportunistic and Delay-Tolerant Networks
Supply Chain and Inventory Management
Dynamics and Control of Mechanical Systems
Robotic Mechanisms and Dynamics
Video Surveillance and Tracking Methods
Artificial Intelligence in Games
Evacuation and Crowd Dynamics
Robotic Locomotion and Control
Autonomous Vehicle Technology and Safety
Auction Theory and Applications
Multimodal Machine Learning Applications
Constraint Satisfaction and Optimization
Software-Defined Networks and 5G
Advanced Graph Neural Networks

Shandong Institute of Automation
2020-2024

Chinese Academy of Sciences
2020-2024

University of Chinese Academy of Sciences
2019-2023

Beijing Academy of Artificial Intelligence
2019-2023

Institute of Automation
2020-2022

Northeastern University
2021

Comtech Telecommunications (United States)
1989

Formation Control With Collision Avoidance Through Deep Reinforcement Learning Using Model-Guided Demonstration

OPENALEX - Publications

Zezhi Sui Zhiqiang Pu Jianqiang Yi Shiguang Wu

Generating collision-free, time-efficient paths in an uncertain dynamic environment poses huge challenges for the formation control with collision avoidance (FCCA) problem a leader-follower structure. In particular, followers have to take both maintenance and into account simultaneously. Unfortunately, most of existing works are simple combinations methods dealing two problems separately. this article, new method based on deep reinforcement learning (RL) is proposed solve FCCA. Especially,...

10.1109/tnnls.2020.3004893 article EN IEEE Transactions on Neural Networks and Learning Systems 2020-07-16

Attention Enhanced Reinforcement Learning for Multi agent Cooperation

OPENALEX - Publications

Zhiqiang Pu Huimu Wang Zhen Liu Jianqiang Yi Shiguang Wu

In this article, a novel method, called attention enhanced reinforcement learning (AERL), is proposed to address issues including complex interaction, limited communication range, and time-varying topology for multi agent cooperation. AERL includes network (CEN), graph spatiotemporal long short-term memory (GST-LSTM), parameters sharing multi-pseudo critic proximal policy optimization (PS-MPC-PPO). Specifically, CEN based on mechanism designed enlarge the agents' range deal with interaction...

10.1109/tnnls.2022.3146858 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-02-18

SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning

OPENALEX - Publications

Yuecheng Liu Dafeng Chi Shiguang Wu Zhanguang Zhang Yaochen Hu and 9 more

Spatial reasoning is an essential problem in embodied AI research. Efforts to enhance spatial abilities through supplementary data and fine-tuning have proven limited ineffective when addressing complex tasks, largely due their dependence on language-based outputs. While some approaches introduced a point-based action space mitigate this issue, they fall short managing more intricate tasks within environments. This deficiency arises from failure fully exploit the inherent thinking...

10.48550/arxiv.2501.10074 preprint EN arXiv (Cornell University) 2025-01-17

Deep-Reinforcement-Learning-Based Multitarget Coverage With Connectivity Guaranteed

OPENALEX - Publications

Shiguang Wu Zhiqiang Pu Tenghai Qiu Jianqiang Yi Tianle Zhang

Deriving a distributed, time-efficient, and connectivity-guaranteed coverage policy in multitarget environment poses huge challenges for multirobot team with limited communication. In particular, the robot needs to cover multiple targets while preserving connectivity. this article, novel deep-reinforcement-learning-based approach is proposed take both connectivity preservation into account simultaneously, which consists of four parts: hierarchical observation attention representation, an...

10.1109/tii.2022.3160629 article EN IEEE Transactions on Industrial Informatics 2022-03-19

Multi-Robot Cooperative Target Encirclement through Learning Distributed Transferable Policy

OPENALEX - Publications

Tianle Zhang Zhen Liu Shiguang Wu Zhiqiang Pu Jianqiang Yi

Making efficient motion decisions for a multi-robot system is challenging problem in target encirclement with collision avoidance. Specifically, each robot local communication has to consider cooperative and avoidance simultaneously. In this paper, distributed transferable policy network framework based on deep reinforcement learning proposed solve the of The able process information uncertain number robots obstacles, which desirable property systems. particular, graph attention mechanism...

10.1109/ijcnn48605.2020.9207248 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2020-07-01

Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

OPENALEX - Publications

Qingxu Fu Tenghai Qiu Jianqiang Yi Zhiqiang Pu Shiguang Wu

When dealing with a series of imminent issues, humans can naturally concentrate on subset these concerning issues by prioritizing them according to their contributions motivational indices, e.g., the probability winning game. This idea concentration offers insights into reinforcement learning sophisticated Large-scale Multi-Agent Systems (LMAS) participated hundreds agents. In such an LMAS, each agent receives long entity observations at step, which overwhelm existing aggregation networks as...

10.1609/aaai.v36i9.21165 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

Multi-target Coverage with Connectivity Maintenance using Knowledge-incorporated Policy Framework

OPENALEX - Publications

Shiguang Wu Zhiqiang Pu Zhen Liu Tenghai Qiu Jianqiang Yi and 1 more

This paper considers a multi-target coverage problem where robot team aims to efficiently cover multi-targets while maintaining connectivity in distributed manner. A novel knowledge-incorporated policy framework is proposed derive distributed, efficient, and guaranteed policy. In particular, knowledge-guided network (KGPnet) designed, which consists of observation attention representation, interaction learning. Giving credit the KGPnet, can be applied different number targets. Moreover,...

10.1109/icra48506.2021.9562017 article EN 2021-05-30

Adaptive Flocking of Multi-Agent Systems with Uncertain Nonlinear Dynamics and Unknown Disturbances Using Neural Networks

OPENALEX - Publications

Shiguang Wu Zhiqiang Pu Jianqiang Yi Jinlin Sun Tianyi Xiong and 1 more

Collective behavior of multi-agent systems brings some new problems in control theory and application. Especially, flocking problem with uncertain nonlinear dynamics unknown external disturbances is a challenging problem. Some existing works assume that the intrinsic virtual leader same as those agents, which unreasonable impractical. To solve this issue, we consider an adaptive paper, where allowed to be different from agents. Firstly, approximate each agent, neural network used, whose...

10.1109/case48305.2020.9216754 article EN 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE) 2020-08-01

Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments

OPENALEX - Publications

Shiguang Wu Tenghai Qiu Zhiqiang Pu Jianqiang Yi

This paper proposes a collaborative policy framework via relational graph reasoning for multi-agent systems to accomplish adversarial tasks. A module consisting of an agent and opponent module, is designed enable each learn mixture state representation enhance the effectiveness policy. In particular, agent, infer different underlying influences from opponents generate agent-level representation. The creatively reason relations their surrounding objects including agents based on latent...

10.1109/iros51168.2021.9636636 article EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021-09-27

A Hybrid Formation Control Design for Multi-Robot System with Obstacle Avoidance

OPENALEX - Publications

Shiguang Wu Zezhi Sui Jianqiang Yi Zhiqiang Pu

The formation control of multi-robot systems in complex environments is a challenging problem, which needs to handle velocity estimation and obstacle avoidance problems. However, few researches considered these problems simultaneously the literature. In this paper, leader-follower hybrid design for system proposed, solves problem. First, based on architecture, law designed. Second, considering difficulty obtaining whole accurate information, estimator leader designed, Lyapunov function used...

10.23919/chicc.2019.8866217 article EN 2019-07-01

Multi-Robot Coverage Path Planning based on Deep Reinforcement Learning

OPENALEX - Publications

Xiaolin Zhou Xiaojie Liu Xingwei Wang Shiguang Wu Mingyang Sun

The multi-robot coverage path planning (CPP) is the design of optimal motion sequence robots, which can make robots execute task covering all positions work area except obstacles. In this article, communication capability system applied, and a CPP mechanism proposed to control perform tasks in an unknown environment. mechanism, algorithm based on deep reinforcement learning proposed, generate next action for real-time according current state robots. addition, obstacle avoidance scheme...

10.1109/cse53436.2021.00015 article EN 2021-10-01

ConML: A Universal Meta-Learning Framework with Task-Level Contrastive Learning

OPENALEX - Publications

Shiguang Wu Yaqing Wang Yatao Bian Quanming Yao

Meta-learning enables learning systems to adapt quickly new tasks, similar humans. To emulate this human-like rapid and enhance alignment discrimination abilities, we propose ConML, a universal meta-learning framework that can be applied various algorithms without relying on specific model architectures nor target models. The core of ConML is task-level contrastive learning, which extends from the representation space in unsupervised meta-learning. By leveraging task identity as an...

10.48550/arxiv.2410.05975 preprint EN arXiv (Cornell University) 2024-10-08

Cognition-Oriented Multiagent Reinforcement Learning

OPENALEX - Publications

Tenghai Qiu Shiguang Wu Zhen Liu Zhiqiang Pu Jianqiang Yi and 2 more

Inspired by psychological insights into individual behavior, we propose a novel cognition-oriented multiagent reinforcement learning (CORL) framework. CORL equips agents with two distinct types of cognition-situational and self-cognition-derived from local observations. To enhance the informativeness precision these cognition types, introduce information-theoretical regularizers: one to align situational global state other self-cognition each agent's identity for improved role...

10.1109/tnnls.2024.3502176 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-12-04

Multi-robot Dynamic Task Allocation Based on Improved Auction Algorithm

OPENALEX - Publications

Shiguang Wu Xiaojie Liu Xingwei Wang Xiaolin Zhou Mingyang Sun

Multi-robot task detection and execution is an important direction that people have considered recently. We the dynamic issues related to tasks of multiple robots. This paper presents improved auction algorithm solve this problem combines it with by optimizing capacity utilization load balancing robot. By designing a new cost function, ability robot are added algorithm, matching difficulty assignment execution. In addition, we also add sequence adjustment mechanism avoid redundant loss due...

10.1109/cacre52464.2021.9501305 article EN 2021 6th International Conference on Automation, Control and Robotics Engineering (CACRE) 2021-07-01

Intrinsic Reward with Peer Incentives for Cooperative Multi-Agent Reinforcement Learning

OPENALEX - Publications

Tianle Zhang Zhen Liu Shiguang Wu Zhiqiang Pu Jianqiang Yi

In this paper, we propose a novel Intrinsic Reward method with Peer Incentives (IRPI) to promote the inter-agent direct interactions and implicitly address credit assignment problem in cooperative multi-agent reinforcement learning (MARL). The IRPI can build mutual incentives between agents by using their causal effect, realize advanced cooperation. Specifically, new intrinsic reward mechanism is conducted, which equips each agent ability other effect them. Moreover, built through neural...

10.1109/ijcnn55064.2022.9892092 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2022-07-18

Multiexperience-Assisted Efficient Multiagent Reinforcement Learning

OPENALEX - Publications

Tianle Zhang Zhen Liu Jianqiang Yi Shiguang Wu Zhiqiang Pu and 1 more

Recently, multiagent reinforcement learning (MARL) has shown great potential for cooperative policies in systems (MASs). However, a noticeable drawback of current MARL is the low sample efficiency, which causes huge amount interactions with environment. Such greatly hinders real-world application MARL. Fortunately, effectively incorporating experience knowledge can assist to quickly find effective solutions, significantly alleviate drawback. In this article, novel multiexperience-assisted...

10.1109/tnnls.2023.3264275 article EN IEEE Transactions on Neural Networks and Learning Systems 2023-04-10

Modal-aware Bias Constrained Contrastive Learning for Multimodal Recommendation

OPENALEX - Publications

Wei Yang Zhengru Fang Tianle Zhang Shiguang Wu Chi Lu

Multimodal recommendation system has been widely used in short video platform, e-commerce platform and news media. data contains information such as product image text, which is often auxiliary signal to improve the effect of significantly. In order alleviate problems sparsity noise, some researchers construct augmentation use self-supervised learning help model training. These methods have achieved certain results. However, most work based on random ways, masking perturbation. This method...

10.1145/3581783.3612568 article EN 2023-10-26

Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems

OPENALEX - Publications

Yiqun Chen Wei Yang Tianle Zhang Shiguang Wu Hongxing Chang

In ball sports, such as basketball, the coach can guide players to better offend and defend from a holistic perspective win game. Inspired by scenarios, we introduce coach-like concept into decision-making process of cooperative multi-agent systems. We propose new framework Commander-Soldiers Reinforcement Learning (CSRL), for Multi-Agent Specifically, virtual role, Commander, which obtain encode global information every T steps send encoded guidance Soldiers (real agents). Furthermore,...

10.1109/ijcnn55064.2022.9892794 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2022-07-18

Multi-Agent Local Information Reconstruction for Situational Cognition

OPENALEX - Publications

Shiguang Wu Zhiqiang Pu Tianle Zhang Tenghai Qiu Jianqiang Yi

Learning an effective strategy is challenging for agents in partially observable environment, where the can only observe a part of environment information and make decisions based on local information. Hence, how to effectively utilize achieve efficient cooperation among particularly important. The establish understanding themselves their surrounding historical observation However, lacking global or limited, resulting low performance some complex tasks. To solve problem, situational...

10.1109/ijcnn55064.2022.9892770 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2022-07-18

Intelligent Routing Mechanism in Mobile Robot Networks

OPENALEX - Publications

Mingyang Sun Xingwei Wang Xiaojie Liu Shiguang Wu Xiaolin Zhou and 1 more

A mobile robot network is a system composed of multiple robots with simple functions through communication and collaboration. To achieve coordination cooperation between robots, designing an efficient mechanism key issue. However, due to the characteristics autonomous movement harsh environment in which perform tasks, facing difficulties such as exhaustion node power, damage, link failure. In order solve these problems, we design intelligent routing for networks based on capabilities aims...

10.1109/icicsp54369.2021.9611922 article EN 2021-09-24

A Multi-agent Reinforcement Learning Routing Protocol in Mobile Robot Network

OPENALEX - Publications

Mingyang Sun Xingwei Wang Xiaojie Liu Shiguang Wu Xiaolin Zhou and 1 more

Robots are now essential in unreachable, repeated, and dangerous real-world applications where they take place of human beings. One the important capabilities a multi-robot system is that it should be able to form an autonomous robot network transmit information. However, due limited communication capability single robot, highly variable environment robots work, mobility robots, difficult for them exchange information with each other need. In this article, we propose novel routing protocol...

10.1109/icicsp54369.2021.9611881 article EN 2021-09-24

Coming Soon ...