Shiguang Wu

ORCID: 0000-0001-9091-5236
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Reinforcement Learning in Robotics
  • Distributed Control Multi-Agent Systems
  • Domain Adaptation and Few-Shot Learning
  • Robotic Path Planning Algorithms
  • Energy Efficient Wireless Sensor Networks
  • Robotics and Sensor-Based Localization
  • Neural Networks and Reservoir Computing
  • Modular Robots and Swarm Intelligence
  • UAV Applications and Optimization
  • Space Satellite Systems and Control
  • Evolutionary Game Theory and Cooperation
  • Opportunistic and Delay-Tolerant Networks
  • Supply Chain and Inventory Management
  • Dynamics and Control of Mechanical Systems
  • Robotic Mechanisms and Dynamics
  • Video Surveillance and Tracking Methods
  • Artificial Intelligence in Games
  • Evacuation and Crowd Dynamics
  • Robotic Locomotion and Control
  • Autonomous Vehicle Technology and Safety
  • Auction Theory and Applications
  • Multimodal Machine Learning Applications
  • Constraint Satisfaction and Optimization
  • Software-Defined Networks and 5G
  • Advanced Graph Neural Networks

Shandong Institute of Automation
2020-2024

Chinese Academy of Sciences
2020-2024

University of Chinese Academy of Sciences
2019-2023

Beijing Academy of Artificial Intelligence
2019-2023

Institute of Automation
2020-2022

Northeastern University
2021

Comtech Telecommunications (United States)
1989

Generating collision-free, time-efficient paths in an uncertain dynamic environment poses huge challenges for the formation control with collision avoidance (FCCA) problem a leader-follower structure. In particular, followers have to take both maintenance and into account simultaneously. Unfortunately, most of existing works are simple combinations methods dealing two problems separately. this article, new method based on deep reinforcement learning (RL) is proposed solve FCCA. Especially,...

10.1109/tnnls.2020.3004893 article EN IEEE Transactions on Neural Networks and Learning Systems 2020-07-16

In this article, a novel method, called attention enhanced reinforcement learning (AERL), is proposed to address issues including complex interaction, limited communication range, and time-varying topology for multi agent cooperation. AERL includes network (CEN), graph spatiotemporal long short-term memory (GST-LSTM), parameters sharing multi-pseudo critic proximal policy optimization (PS-MPC-PPO). Specifically, CEN based on mechanism designed enlarge the agents' range deal with interaction...

10.1109/tnnls.2022.3146858 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-02-18

Spatial reasoning is an essential problem in embodied AI research. Efforts to enhance spatial abilities through supplementary data and fine-tuning have proven limited ineffective when addressing complex tasks, largely due their dependence on language-based outputs. While some approaches introduced a point-based action space mitigate this issue, they fall short managing more intricate tasks within environments. This deficiency arises from failure fully exploit the inherent thinking...

10.48550/arxiv.2501.10074 preprint EN arXiv (Cornell University) 2025-01-17

Deriving a distributed, time-efficient, and connectivity-guaranteed coverage policy in multitarget environment poses huge challenges for multirobot team with limited communication. In particular, the robot needs to cover multiple targets while preserving connectivity. this article, novel deep-reinforcement-learning-based approach is proposed take both connectivity preservation into account simultaneously, which consists of four parts: hierarchical observation attention representation, an...

10.1109/tii.2022.3160629 article EN IEEE Transactions on Industrial Informatics 2022-03-19

Making efficient motion decisions for a multi-robot system is challenging problem in target encirclement with collision avoidance. Specifically, each robot local communication has to consider cooperative and avoidance simultaneously. In this paper, distributed transferable policy network framework based on deep reinforcement learning proposed solve the of The able process information uncertain number robots obstacles, which desirable property systems. particular, graph attention mechanism...

10.1109/ijcnn48605.2020.9207248 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2020-07-01

When dealing with a series of imminent issues, humans can naturally concentrate on subset these concerning issues by prioritizing them according to their contributions motivational indices, e.g., the probability winning game. This idea concentration offers insights into reinforcement learning sophisticated Large-scale Multi-Agent Systems (LMAS) participated hundreds agents. In such an LMAS, each agent receives long entity observations at step, which overwhelm existing aggregation networks as...

10.1609/aaai.v36i9.21165 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

This paper considers a multi-target coverage problem where robot team aims to efficiently cover multi-targets while maintaining connectivity in distributed manner. A novel knowledge-incorporated policy framework is proposed derive distributed, efficient, and guaranteed policy. In particular, knowledge-guided network (KGPnet) designed, which consists of observation attention representation, interaction learning. Giving credit the KGPnet, can be applied different number targets. Moreover,...

10.1109/icra48506.2021.9562017 article EN 2021-05-30

Collective behavior of multi-agent systems brings some new problems in control theory and application. Especially, flocking problem with uncertain nonlinear dynamics unknown external disturbances is a challenging problem. Some existing works assume that the intrinsic virtual leader same as those agents, which unreasonable impractical. To solve this issue, we consider an adaptive paper, where allowed to be different from agents. Firstly, approximate each agent, neural network used, whose...

10.1109/case48305.2020.9216754 article EN 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE) 2020-08-01

This paper proposes a collaborative policy framework via relational graph reasoning for multi-agent systems to accomplish adversarial tasks. A module consisting of an agent and opponent module, is designed enable each learn mixture state representation enhance the effectiveness policy. In particular, agent, infer different underlying influences from opponents generate agent-level representation. The creatively reason relations their surrounding objects including agents based on latent...

10.1109/iros51168.2021.9636636 article EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021-09-27

The formation control of multi-robot systems in complex environments is a challenging problem, which needs to handle velocity estimation and obstacle avoidance problems. However, few researches considered these problems simultaneously the literature. In this paper, leader-follower hybrid design for system proposed, solves problem. First, based on architecture, law designed. Second, considering difficulty obtaining whole accurate information, estimator leader designed, Lyapunov function used...

10.23919/chicc.2019.8866217 article EN 2019-07-01

The multi-robot coverage path planning (CPP) is the design of optimal motion sequence robots, which can make robots execute task covering all positions work area except obstacles. In this article, communication capability system applied, and a CPP mechanism proposed to control perform tasks in an unknown environment. mechanism, algorithm based on deep reinforcement learning proposed, generate next action for real-time according current state robots. addition, obstacle avoidance scheme...

10.1109/cse53436.2021.00015 article EN 2021-10-01

Meta-learning enables learning systems to adapt quickly new tasks, similar humans. To emulate this human-like rapid and enhance alignment discrimination abilities, we propose ConML, a universal meta-learning framework that can be applied various algorithms without relying on specific model architectures nor target models. The core of ConML is task-level contrastive learning, which extends from the representation space in unsupervised meta-learning. By leveraging task identity as an...

10.48550/arxiv.2410.05975 preprint EN arXiv (Cornell University) 2024-10-08

Inspired by psychological insights into individual behavior, we propose a novel cognition-oriented multiagent reinforcement learning (CORL) framework. CORL equips agents with two distinct types of cognition-situational and self-cognition-derived from local observations. To enhance the informativeness precision these cognition types, introduce information-theoretical regularizers: one to align situational global state other self-cognition each agent's identity for improved role...

10.1109/tnnls.2024.3502176 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-12-04

Multi-robot task detection and execution is an important direction that people have considered recently. We the dynamic issues related to tasks of multiple robots. This paper presents improved auction algorithm solve this problem combines it with by optimizing capacity utilization load balancing robot. By designing a new cost function, ability robot are added algorithm, matching difficulty assignment execution. In addition, we also add sequence adjustment mechanism avoid redundant loss due...

10.1109/cacre52464.2021.9501305 article EN 2021 6th International Conference on Automation, Control and Robotics Engineering (CACRE) 2021-07-01

In this paper, we propose a novel Intrinsic Reward method with Peer Incentives (IRPI) to promote the inter-agent direct interactions and implicitly address credit assignment problem in cooperative multi-agent reinforcement learning (MARL). The IRPI can build mutual incentives between agents by using their causal effect, realize advanced cooperation. Specifically, new intrinsic reward mechanism is conducted, which equips each agent ability other effect them. Moreover, built through neural...

10.1109/ijcnn55064.2022.9892092 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2022-07-18

Recently, multiagent reinforcement learning (MARL) has shown great potential for cooperative policies in systems (MASs). However, a noticeable drawback of current MARL is the low sample efficiency, which causes huge amount interactions with environment. Such greatly hinders real-world application MARL. Fortunately, effectively incorporating experience knowledge can assist to quickly find effective solutions, significantly alleviate drawback. In this article, novel multiexperience-assisted...

10.1109/tnnls.2023.3264275 article EN IEEE Transactions on Neural Networks and Learning Systems 2023-04-10

Multimodal recommendation system has been widely used in short video platform, e-commerce platform and news media. data contains information such as product image text, which is often auxiliary signal to improve the effect of significantly. In order alleviate problems sparsity noise, some researchers construct augmentation use self-supervised learning help model training. These methods have achieved certain results. However, most work based on random ways, masking perturbation. This method...

10.1145/3581783.3612568 article EN 2023-10-26

In ball sports, such as basketball, the coach can guide players to better offend and defend from a holistic perspective win game. Inspired by scenarios, we introduce coach-like concept into decision-making process of cooperative multi-agent systems. We propose new framework Commander-Soldiers Reinforcement Learning (CSRL), for Multi-Agent Specifically, virtual role, Commander, which obtain encode global information every T steps send encoded guidance Soldiers (real agents). Furthermore,...

10.1109/ijcnn55064.2022.9892794 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2022-07-18

Learning an effective strategy is challenging for agents in partially observable environment, where the can only observe a part of environment information and make decisions based on local information. Hence, how to effectively utilize achieve efficient cooperation among particularly important. The establish understanding themselves their surrounding historical observation However, lacking global or limited, resulting low performance some complex tasks. To solve problem, situational...

10.1109/ijcnn55064.2022.9892770 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2022-07-18

A mobile robot network is a system composed of multiple robots with simple functions through communication and collaboration. To achieve coordination cooperation between robots, designing an efficient mechanism key issue. However, due to the characteristics autonomous movement harsh environment in which perform tasks, facing difficulties such as exhaustion node power, damage, link failure. In order solve these problems, we design intelligent routing for networks based on capabilities aims...

10.1109/icicsp54369.2021.9611922 article EN 2021-09-24

Robots are now essential in unreachable, repeated, and dangerous real-world applications where they take place of human beings. One the important capabilities a multi-robot system is that it should be able to form an autonomous robot network transmit information. However, due limited communication capability single robot, highly variable environment robots work, mobility robots, difficult for them exchange information with each other need. In this article, we propose novel routing protocol...

10.1109/icicsp54369.2021.9611881 article EN 2021-09-24
Coming Soon ...