Qichao Zhang

ORCID: 0000-0001-9747-391X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Autonomous Vehicle Technology and Safety
  • Reinforcement Learning in Robotics
  • Adaptive Dynamic Programming Control
  • Traffic control and management
  • Advanced Neural Network Applications
  • Traffic and Road Safety
  • Adaptive Control of Nonlinear Systems
  • Mechanical Circulatory Support Devices
  • Viral Infections and Vectors
  • Job Satisfaction and Organizational Behavior
  • Hydrogen embrittlement and corrosion behaviors in metals
  • Traffic Prediction and Management Techniques
  • Frequency Control in Power Systems
  • Corrosion Behavior and Inhibition
  • Occupational Health and Safety Research
  • Remote Sensing and LiDAR Applications
  • Robotic Path Planning Algorithms
  • Infrastructure Maintenance and Monitoring
  • Risk and Safety Analysis
  • Vehicle License Plate Recognition
  • Nuclear Materials and Properties
  • Advanced Vision and Imaging
  • Distributed Control Multi-Agent Systems
  • Video Surveillance and Tracking Methods
  • Neural Networks Stability and Synchronization

University of Chinese Academy of Sciences
2016-2024

Institute of Oceanology
2016-2024

Chinese Academy of Sciences
2015-2024

Zhengzhou University of Light Industry
2024

Institute of Automation
2017-2024

State Grid Corporation of China (China)
2024

Beijing Information Science & Technology University
2021-2024

Beijing Academy of Artificial Intelligence
2018-2024

Southwest Jiaotong University
2024

Peng Cheng Laboratory
2022-2023

In this paper, an approximate online equilibrium solution is developed for N -player nonzero-sum (NZS) game systems with completely unknown dynamics. First, a model identifier based on three-layer neural network (NN) established to reconstruct the NZS games systems. Moreover, weight vector updated experience replay technique which can relax traditional persistence of excitation condition simplified recorded data. Then, single-network adaptive dynamic programming (ADP) algorithm proposed each...

10.1109/tcyb.2015.2488680 article EN IEEE Transactions on Cybernetics 2015-10-26

In this paper, the H <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">∞</sub> optimal control problem for a class of continuous-time nonlinear systems is investigated using event-triggered method. First, formulated as two-player zero-sum (ZS) differential game. Then, an adaptive triggering condition derived ZS game with policy and time-triggered disturbance policy. The controller updated only when not satisfied. Therefore, communication between...

10.1109/tsmc.2016.2531680 article EN IEEE Transactions on Systems Man and Cybernetics Systems 2016-03-28

In this paper, the infinite-horizon robust optimal control problem for a class of continuous-time uncertain nonlinear systems is investigated by using data-based adaptive critic designs. The neural network identification scheme combined with traditional technique, in order to design under environment. First, controller original system specified cost function established adding feedback gain nominal system. Then, identifier employed reconstruct unknown dynamics stability analysis. Hence,...

10.1109/tsmc.2015.2492941 article EN IEEE Transactions on Systems Man and Cybernetics Systems 2015-10-30

In this paper, the robust control problem for a class of continuous-time nonlinear system with unmatched uncertainties is investigated using an event-based method. First, transformed into corresponding optimal augmented and appropriate cost function. Under mechanism, we prove that solution can asymptotically stabilize uncertain adaptive triggering condition. That is, designed controller to original system. Note updated only when condition satisfied, which save communication resources between...

10.1109/tnnls.2016.2614002 article EN IEEE Transactions on Neural Networks and Learning Systems 2016-10-17

This paper investigates the automatic exploration problem under unknown environment, which is key point of applying robotic system to some social tasks. The solution this via stacking decision rules impossible cover various environments and sensor properties. Learning-based control methods are adaptive for these scenarios. However, damaged by low learning efficiency awkward transferability from simulation reality. In paper, we construct a general framework decomposing process into decision,...

10.1109/tnnls.2019.2927869 article EN IEEE Transactions on Neural Networks and Learning Systems 2019-08-06

t his paper investigates the vision- based autonomous driving with deep learning and reinforcement methods.Different from end-to-end method, our method breaks vision-based lateral control system down into a perception module module.The which is on multi-task neural network first takes driver-view image as its input predicts track features.The then makes decision these features.In order to improve data efficiency, we propose visual TORCS (VTORCS), environment open racing car simulator...

10.1109/mci.2019.2901089 article EN IEEE Computational Intelligence Magazine 2019-04-12

Autonomous driving decision-making is a great challenge due to the complexity and uncertainty of traffic environment. Combined with rule-based constraints, Deep Q-Network (DQN) based method applied for autonomous lane change task in this study. Through combination high-level lateral low-level trajectory modification, safe efficient behavior can be achieved. With setting our state representation reward function, trained agent able take appropriate actions real-world-like simulator. The...

10.1109/ijcnn.2019.8852110 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2019-07-01

The Practical Byzantine Fault Tolerance algorithm (PBFT)has been highly applied in consortium blockchain systems, however, this kind of consensus can hardly identify and remove faulty nodes time, also vulnerable to many attacks against the primary node PBFT. equality members' discourse rights is inapplicable some real scenarios where dominating members are likely have a larger voting process. To address these problems, paper presents Reputation-based (RBFT)algorithm that incorporates...

10.1109/padsw.2018.8644933 article EN 2018-12-01

This paper is concerned about the nonlinear optimization problem of nonzero-sum (NZS) games with unknown drift dynamics. The data-based integral reinforcement learning (IRL) method proposed to approximate Nash equilibrium NZS iteratively. Furthermore, we prove that IRL equivalent model-based policy iteration algorithm, which guarantees convergence method. For implementation purpose, a single-critic neural network structure for given. To enhance application capability method, design updating...

10.1109/tcyb.2018.2830820 article EN IEEE Transactions on Cybernetics 2018-05-16

Self-supervised depth estimation draws a lot of attention recently as it can promote the 3D sensing capa-bilities self-driving vehicles. However, intrinsically relies upon photometric consistency assumption, which hardly holds during nighttime. Although various supervised night-time image enhancement methods have been proposed, their generalization performance in challenging driving scenarios is not satisfactory. To this end, we propose first method that jointly learns nighttime enhancer and...

10.1109/icra48891.2023.10160708 article EN 2023-05-29

Named data networking (NDN) enables fast and efficient content dissemination in mission-critical unmanned aerial vehicle ad hoc networks (UAANETs); however, its in-network caching mechanism brings a new security challenge: poisoning. Poisoned can contaminate the cache on routers isolate valid from network, leading to performance degradation or denial of service. To mitigate such attacks enhance network-layer trust NDN-based UAANETs, this article proposes novel systematic framework that...

10.1109/mcom.2019.1800722 article EN IEEE Communications Magazine 2019-06-01

Stable carbocations such as tritylium ions have been widely explored organic Lewis acid catalysts and reagents in synthesis. However, achieving asymmetric carbocation catalysis remains elusive ever since they were first identified over one century ago. The challenges mainly come from their limited compatibility, scarcity of chiral carbocations, well the extremely low barrier to racemization carbenium ions. We reported here a latent concept for catalysis. In this strategy, trityl phosphate is...

10.1021/jacs.5b11085 article EN Journal of the American Chemical Society 2015-11-23

As job insecurity becomes increasingly common, seeking its palliatives has become a hot topic for scholars, especially high-speed railway drivers who are vital the development of China's railway. Researches have demonstrated that, organizational support is valuable psychosocial resource that can alleviate individual negative behavior, yet buffering effect between and safety performance attracted little attention. In this study, in field was identified as supervisory coworker safety. Using...

10.1016/j.ssci.2019.04.017 article EN cc-by-nc-nd Safety Science 2019-04-25

Sum of squares (SOS) polynomials have provided a computationally tractable way to deal with inequality constraints appearing in many control problems. It can also act as an approximator the framework adaptive dynamic programming. In this paper, approximate solution optimal polynomial nonlinear systems is proposed. Under given attenuation coefficient, Hamilton-Jacobi-Isaacs equation relaxed optimization problem set inequalities. After applying policy iteration technique and constraining...

10.1109/tcyb.2016.2643687 article EN IEEE Transactions on Cybernetics 2017-01-10

Block copolymer polymersomes offer considerable access for applications in a variety of fields; however, the traditional cosolvent self-assembly method can only produce at low solids content (typically <1%). Recently, an situ growth method, termed polymerization-induced (PISA), has been developed to allow preparation high (10–50%). Synthesis and block copolymers occur simultaneously PISA, therefore, morphological evolution occurs throughout polymerization. It is highly desirable provide...

10.1021/acs.macromol.0c01624 article EN Macromolecules 2020-10-05

Deep reinforcement learning (DRL), combining the perception capability of deep (DL) and decision-making (RL) [1], has been widely investigated for autonomous driving tasks. In this letter, we would like to discuss impact different types state input on performance DRL-based lane change decision-making.

10.1109/jas.2021.1004395 article EN IEEE/CAA Journal of Automatica Sinica 2021-12-28

Generalizing policies to unseen scenarios remains a critical challenge in visual reinforcement learning, where agents often overfit the specific observations of training environment. In environments, distracting pixels may lead extract representations containing task-irrelevant information. As result, deviate from optimal behaviors learned during training, thereby hindering generalization.To address this issue, we propose Salience-Invariant Consistent Policy Learning (SCPL) algorithm, an...

10.48550/arxiv.2502.08336 preprint EN arXiv (Cornell University) 2025-02-12
Coming Soon ...