NFDI4DS | UHH-SEMS - Publication Details

Haiyin Piao

ORCID: 0000-0002-8519-4750

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5004540162

Research Areas

Guidance and Control Systems
Reinforcement Learning in Robotics
Robotic Path Planning Algorithms
Military Defense Systems Analysis
Autonomous Vehicle Technology and Safety
Advanced Neural Network Applications
Optimization and Search Problems
Aerospace and Aviation Technology
Anomaly Detection Techniques and Applications
Artificial Intelligence in Games
Age of Information Optimization
Advanced Bandit Algorithms Research
UAV Applications and Optimization
Stochastic Gradient Optimization Techniques
Adaptive Control of Nonlinear Systems
Human Pose and Action Recognition
Image Processing Techniques and Applications
Robotics and Sensor-Based Localization
Adaptive Dynamic Programming Control
Adversarial Robustness in Machine Learning
Metaheuristic Optimization Algorithms Research
Data Stream Mining Techniques
Advanced Vision and Imaging
Supply Chain and Inventory Management
Machine Learning in Bioinformatics

Jilin University
2024-2025

Northwestern Polytechnical University
2020-2024

Shenyang Aerospace University
2017-2024

Shenyang Institute of Engineering
2023

Dalian University of Technology
2021

Multi-agent hierarchical policy gradient for Air Combat Tactics emergence via self-play

OPENALEX - Publications

Zhixiao Sun Haiyin Piao Zhen Yang Yiyang Zhao Guang Zhan and 6 more

10.1016/j.engappai.2020.104112 article EN Engineering Applications of Artificial Intelligence 2020-12-07

A Two-Stage Attentive Network for Single Image Super-Resolution

OPENALEX - Publications

Jiqing Zhang Chengjiang Long Yuxin Wang Haiyin Piao Haiyang Mei and 2 more

Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and contribute remarkable progress. However, most of the existing CNNs-based SISR methods do not adequately explore contextual information feature extraction stage pay little attention to final high-resolution (HR) reconstruction step, hence hindering desired SR performance. To address above two issues, this paper, we propose a two-stage attentive network (TSAN) for accurate...

10.1109/tcsvt.2021.3071191 article EN IEEE Transactions on Circuits and Systems for Video Technology 2021-04-05

Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion

OPENALEX - Publications

Chengchao Bai Yan Peng Haiyin Piao Wei Pan Jifeng Guo

This article explores deep reinforcement learning (DRL) for the flocking control of unmanned aerial vehicle (UAV) swarms. The policy is trained using a centralized-learning-decentralized-execution (CTDE) paradigm, where centralized critic network augmented with additional information about entire UAV swarm utilized to improve efficiency. Instead inter-UAV collision avoidance capabilities, repulsion function encoded as an inner-UAV "instinct." In addition, UAVs can obtain states other through...

10.1109/tcyb.2023.3246985 article EN IEEE Transactions on Cybernetics 2023-03-08

Online hierarchical recognition method for target tactical intention in beyond-visual-range air combat

OPENALEX - Publications

Zhen Yang Zhixiao Sun Haiyin Piao Jichuan Huang Deyun Zhou and 1 more

Online accurate recognition of target tactical intention in beyond-visual-range (BVR) air combat is an important basis for deep situational awareness and autonomous decision-making, which can create pre-emptive opportunities the fighter to gain superiority. The existing methods solve this problem have some defects such as dependence on empirical knowledge, difficulty interpreting results, inability meet requirements actual combat. So online hierarchical method BVR based cascaded support...

10.1016/j.dt.2022.02.001 article EN cc-by-nc-nd Defence Technology 2022-02-09

Camouflaged Object Segmentation with Omni Perception

OPENALEX - Publications

Haiyang Mei Ke Xu Yunduo Zhou Yang Wang Haiyin Piao and 2 more

10.1007/s11263-023-01838-2 article EN International Journal of Computer Vision 2023-07-12

Evasive Maneuver Strategy for UCAV in Beyond-Visual-Range Air Combat Based on Hierarchical Multi-Objective Evolutionary Algorithm

OPENALEX - Publications

Zhen Yang Deyun Zhou Haiyin Piao Kai Zhang Weiren Kong and 1 more

This study deals with the autonomous evasive maneuver strategy of unmanned combat air vehicle (UCAV), which is threatened by a high-performance beyond-visual-range (BVR) air-to-air missile (AAM). Considering tactical demands achieving self-conflicting objectives in actual combat, including higher miss distance, less energy consumption and longer guidance support time, problem BVR defined reformulated into multi-objective optimization problem. Effective maneuvers UCAV used different evasion...

10.1109/access.2020.2978883 article EN cc-by IEEE Access 2020-01-01

Combining sequence and network information to enhance protein–protein interaction prediction

OPENALEX - Publications

Leilei Liu Xianglei Zhu Yi Ma Haiyin Piao Yaodong Yang and 4 more

Abstract Background Protein–protein interactions (PPIs) are of great importance in cellular systems organisms, since they the basis structure and function many essential processes related to that. Most proteins perform their functions by interacting with other proteins, so predicting PPIs accurately is crucial for understanding cell physiology. Results Recently, graph convolutional networks (GCNs) have been proposed capture information generate representations nodes graph. In our paper, we...

10.1186/s12859-020-03896-6 article EN cc-by BMC Bioinformatics 2020-12-01

Beyond-Visual-Range Air Combat Tactics Auto-Generation by Reinforcement Learning

OPENALEX - Publications

Haiyin Piao Zhixiao Sun Guanglei Meng Hechang Chen Bohao Qu and 4 more

For quite a long time, effective Beyond-Visual-Range (BVR) air combat tactics can only be discovered by human pilots in the actual process. However, due to lack of opportunities, making new innovation was generally considered difficult. To address this challenge, we first introduced solely end-to-end Reinforcement Learning (RL) approach for training competitive agents with adversarial self-play from scratch high fidelity simulation environment during training. Furthermore, Key Air Combat...

10.1109/ijcnn48605.2020.9207088 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2020-07-01

Monocular Camera-Based Complex Obstacle Avoidance via Efficient Deep Reinforcement Learning

OPENALEX - Publications

Jianchuan Ding Lingping Gao Wenxi Liu Haiyin Piao Jia Pan and 3 more

Deep reinforcement learning has achieved great success in laser-based collision avoidance works because the laser can sense accurate depth information without too much redundant data, which maintain robustness of algorithm when it is migrated from simulation environment to real world. However, high-cost devices are not only difficult deploy for a large scale robots but also demonstrate unsatisfactory towards complex obstacles, including irregular e.g., tables, chairs, and shelves, as well...

10.1109/tcsvt.2022.3203974 article EN IEEE Transactions on Circuits and Systems for Video Technology 2022-09-05

Complex relationship graph abstraction for autonomous air combat collaboration: A learning and expert knowledge hybrid approach

OPENALEX - Publications

Haiyin Piao Yue Han Hechang Chen Xuanqi Peng Songyuan Fan and 5 more

10.1016/j.eswa.2022.119285 article EN Expert Systems with Applications 2022-11-19

Cooperative Multiagent Learning and Exploration With Min–Max Intrinsic Motivation

OPENALEX - Publications

Yaqing Hou Jiarui Kang Haiyin Piao Yifeng Zeng Yew-Soon Ong and 2 more

In the field of multiagent reinforcement learning (MARL), ability to effectively explore unknown environments and collect information experiences that are most beneficial for policy represents a critical research area. However, existing work often encounters difficulties in addressing uncertainties caused by state changes inconsistencies between agents' local observations global information, which presents significant challenges coordinated exploration among multiple agents. To address this...

10.1109/tcyb.2025.3557694 article EN IEEE Transactions on Cybernetics 2025-01-01

Generalizable Causal Reinforcement Learning for Out-of-Distribution Environments

OPENALEX - Publications

Sili Huang Jifeng Hu Hechang Chen Peng Cui Haiyin Piao and 2 more

10.1109/tii.2025.3556029 article EN IEEE Transactions on Industrial Informatics 2025-01-01

Multi-agent air combat with two-stage graph-attention communication

OPENALEX - Publications

Zhixiao Sun Huahua Wu Yandong Shi Xiangchao Yu Yifan Gao and 4 more

10.1007/s00521-023-08784-7 article EN Neural Computing and Applications 2023-07-06

The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure

OPENALEX - Publications

Xing Chen Dongcui Diao Hechang Chen Hengshuai Yao Haiyin Piao and 5 more

The popular Proximal Policy Optimization (PPO) algorithm approximates the solution in a clipped policy space. Does there exist better policies outside of this space? By using novel surrogate objective that employs sigmoid function (which provides an interesting way exploration), we found answer is "YES", and are fact located very far from We show PPO insufficient "off-policyness", according to off-policy metric called DEON. Our explores much larger space than PPO, it maximizes Conservative...

10.1609/aaai.v37i6.25864 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Nondominated Maneuver Strategy Set With Tactical Requirements for a Fighter Against Missiles in a Dogfight

OPENALEX - Publications

Zhen Yang Deyun Zhou Weiren Kong Haiyin Piao Kai Zhang and 1 more

Dogfight is often a continuous and multi-round process with missile attacks. If the fighter only considers security when evading incoming missile, it will easily lose superiority in subsequent air combat. Therefore, necessary to maintain as much tactical possible while ensuring successful evasion. The amalgamative requirements of achieving multiple evasive objectives dogfight are taken into account this paper. A method generating nondominated maneuver strategy set for missiles proposed....

10.1109/access.2020.3004864 article EN cc-by IEEE Access 2020-01-01

Cooperative Multiple Task Assignment Problem With Target Precedence Constraints Using a Waitable Path Coordination and Modified Genetic Algorithm

OPENALEX - Publications

Yiyang Zhao Deyun Zhou Haiyin Piao Zhen Yang Rui Hou and 2 more

Task assignment is a critical technology for heterogeneous unmanned aerial vehicle (UAV) applications. Target precedence has typically been ignored in previous studies, such that it possible to obtain task solution with an unreasonable target execution order. For this reason, cooperative multiple problem constraints (CMTAPTPC) model proposed paper, which considers not only kinematic, resource, and of the UAV, but also achieve more realistic scenarios. In addition, graph method improved...

10.1109/access.2021.3063263 article EN cc-by IEEE Access 2021-01-01

Behavior Reasoning for Opponent Agents in Multi-Agent Learning Systems

OPENALEX - Publications

Yaqing Hou Mingyang Sun Wenxuan Zhu Yifeng Zeng Haiyin Piao and 2 more

One important component of developing autonomous agents lies in the accurate prediction their opponents' behaviors when interact with others an uncertain environment. Most recent study focuses on first constructing predictive types (or models) opponents, considering various properties interest, and subsequently using these models to predict accordingly. However, as possible type space can be rather large, it is time-consuming, sometimes even infeasible, actual opponents all candidate types....

10.1109/tetci.2022.3147011 article EN IEEE Transactions on Emerging Topics in Computational Intelligence 2022-02-11

A Vision-based Irregular Obstacle Avoidance Framework via Deep Reinforcement Learning

OPENALEX - Publications

Lingping Gao Jianchuan Ding Wenxi Liu Haiyin Piao Yuxin Wang and 2 more

Deep reinforcement learning has achieved great success in laser-based collision avoidance work because the laser can sense accurate depth information without too much redundant data, which maintain robustness of algorithm when it is migrated from simulation environment to real world. However, high-cost devices are not only difficult apply on a large scale but also have poor irregular objects, e.g., tables, chairs, shelves, etc. In this paper, we propose vision-based framework solve...

10.1109/iros51168.2021.9636512 article EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021-09-27

Spatiotemporal Relationship Cognitive Learning for Multirobot Air Combat

OPENALEX - Publications

Haiyin Piao Yue Han Shaoming He Chao Yu Songyuan Fan and 3 more

Relationship cognition is crucial to learning-based Multi-Robot Systems (MRSs). As an advanced application of MRSs for fierce confrontation, the relationships among autonomous air combat robots inherently present complex time-varying characteristics, which makes relationship even more difficult. However, previous studies have only focused on spatial cooperative relationships, thus ignoring potential impact temporal dynamics long-term behaviors. To tackle this drawback, we propose a novel...

10.1109/tcds.2023.3250819 article EN IEEE Transactions on Cognitive and Developmental Systems 2023-03-01

Tube-based robust reinforcement learning for autonomous maneuver decision for UCAVs

OPENALEX - Publications

Lixin Wang Sizhuang Zheng Haiyin Piao Changqian LU Ting Yue and 1 more

Reinforcement Learning (RL) algorithms enhance intelligence of air combat Autonomous Maneuver Decision (AMD) policy, but they may underperform in target environments with disturbances. To the robustness AMD strategy learned by RL, this study proposes a Tube-based Robust RL (TRRL) method. First, introduces tube to describe reachable trajectories under disturbances, formulates method for calculating tubes based on sum-of-squares programming, and TRRL algorithm that enhances utilizing size as...

10.1016/j.cja.2024.03.025 article EN cc-by-nc-nd Chinese Journal of Aeronautics 2024-03-20

Coordinated Proximal Policy Optimization

OPENALEX - Publications

Zifan Wu Chao Yu Deheng Ye Junge Zhang Haiyin Piao and 1 more

We present Coordinated Proximal Policy Optimization (CoPPO), an algorithm that extends the original (PPO) to multi-agent setting. The key idea lies in coordinated adaptation of step size during policy update process among multiple agents. prove monotonicity improvement when optimizing a theoretically-grounded joint objective, and derive simplified optimization objective based on set approximations. then interpret such CoPPO can achieve dynamic credit assignment agents, thereby alleviating...

10.48550/arxiv.2111.04051 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Three-Dimensional Bearing-Only Helical Homing Guidance

OPENALEX - Publications

Wang Ya-dong Ziyi Wu Haiyin Piao Shaoming He

This paper presents a three-dimensional (3D) homing guidance law against stationary target by using only bearing or angle measurement. The unobservable conditions of the 3D bearing-only relative kinematics are derived and weak observability under classical proportional navigation (PNG) is revealed. An analytical observability-enhancement considering energy consumption interception accuracy then designed based on optimal control theory in plane. To further improve system observability, we...

10.1109/taes.2024.3380585 article EN IEEE Transactions on Aerospace and Electronic Systems 2024-03-26

Discovering Expert-Level Air Combat Knowledge via Deep Excitatory-Inhibitory Factorized Reinforcement Learning

OPENALEX - Publications

Haiyin Piao Shengqi Yang Hechang Chen Junnan Li Jin Yu and 5 more

Artificial Intelligence (AI) has achieved a wide range of successes in autonomous air combat decision-making recently. Previous research demonstrated that AI-enabled approaches could even acquire beyond human-level capabilities. However, there remains lack evidence regarding two major difficulties. First, the existing methods with fixed decision intervals are mostly devoted to solving what act but merely pay attention when act, which occasionally misses optimal opportunities. Second, method...

10.1145/3653979 article EN ACM Transactions on Intelligent Systems and Technology 2024-03-27

UAV Maneuvering Decision-Making Algorithm Based on Deep Reinforcement Learning Under the Guidance of Expert Experience

OPENALEX - Publications

Guang Zhan Kun Zhang Ke Li Haiyin Piao

Autonomous umanned aerial vehicle (UAV) manipulation is necessary for the defense department to execute tactical missions given by commanders in future unmanned battlefield. A large amount of research has been devoted improving autonomous decision-making ability UAV an interactive environment, where finding optimal maneuvering policy became one key issues enabling intelligence UAV. In this paper, we propose a algorithm air-delivery based on deep reinforcement learning under guidance expert...

10.23919/jsee.2024.000022 article EN Journal of Systems Engineering and Electronics 2024-04-23

Coming Soon ...