- Thermal Radiation and Cooling Technologies
- Transition Metal Oxide Nanomaterials
- Aerodynamics and Fluid Dynamics Research
- Assembly Line Balancing Optimization
- Pigment Synthesis and Properties
- Adaptive Dynamic Programming Control
- Advanced Manufacturing and Logistics Optimization
- Gas Sensing Nanomaterials and Sensors
- Scheduling and Optimization Algorithms
- Military Defense Systems Analysis
- Aeroelasticity and Vibration Control
- Multilevel Inverters and Converters
- Agricultural risk and resilience
- 3D IC and TSV technologies
- Urban Heat Island Mitigation
- Wind Turbine Control Systems
- Wireless Networks and Protocols
- Thin-Film Transistor Technologies
- Reinforcement Learning in Robotics
- Low-power high-performance VLSI design
- Remote Sensing and LiDAR Applications
- Catalysis and Oxidation Reactions
- Mechanical Circulatory Support Devices
- Cognitive Radio Networks and Spectrum Sensing
- Ga2O3 and related materials
Guangdong University of Technology
2023-2024
Ministry of Education of the People's Republic of China
2024
Tianjin University
2024
Qingdao Agricultural University
2021
Yanshan University
2015
Policy iteration (PI), an iterative method in reinforcement learning, has the merit of interactions with a little-known environment to learn decision law through policy evaluation and improvement. However, existing PI-based results for output-feedback (OPFB) continuous-time systems relied heavily on initial stabilizing full state-feedback (FSFB) policy. It thus raises question violating OPFB principle. This article addresses such establishes PI under We prove that off-policy Bellman equation...
Because unmanned combat platforms can enhance capabilities and expand areas, they minimize casualties play an important role in wars. With the increasing application of platforms, need for multi-domain coordinated operations land, sea, air has become prominent. The effectiveness traditional single-platform single-area model is extremely limited no longer meets needs warfare. Therefore, mode company gradually developed from a single platform to more flexible multi-platform cluster combat. How...
Many redundancy based hardened latch would become invalid when multiple nodes collect charge concurrently. To mitigate multi-node collection, a novel on the combinational of layout and circuit design is proposed. The number sensitive node pairs reduced by design. And adjusting placement transistors in layout, distance between increased. SPICE simulation results illustrate that proposed significantly reduce pairs. performance mitigating multinode collection improved.
With the neutral-point-clamped (NPC) converter widely applied to direct-drive wind energy generation system, wheather guarantee dc-link neutral-point potential (NPP) balance has a crucial influence on system.Research gridside NPC three-level converter, firstly, this paper established NPP fluctuation mathematical model, analyzed mechanism of AC-DC fluctuations, deduced relationship between error calculation and system operation mode (rectifier-mode or inverter-mode), control model.Secondly,...
This paper presents data-driven output regulation designs for unknown discrete-time systems with the prescribed convergence rate. After integrating rate specified by designer, tracking problem is composed of learning two components: a feedback control gain and solutions to regulator equations. The RL algorithm can learn from system measurements auxiliary without knowing its full dynamics. Then equations associated are solved as feedforward leveraging data rather than dynamic model. proposed...
The integration of radiative cooling and thermochromic technology methods, as a passive-regulating environmentally friendly thermal management strategy, is highly desirable in reshaping the global energy landscape. Despite numerous efforts, most designs for are focused on improving modulation performance non-transmissive devices, often neglecting transmissive high emissivity devices. Herein, we propose transmittance thermochromic(T-PRCT) smart window based phase change material, capable...
In this paper, we investigate a learning-based framework that addresses the point-to-point vehicular navigation problem through robust output regulation approach and test it Ultra-Wide Band (UWB) communication noise data. Our focus is on control policy design ensuring stability of closed-loop system for vehicle dynamics. It seen our proposed solution, which relies reinforcement learning, data-driven does not require accurate knowledge motion model. Specifically, explore optimal tracking...