An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Zero-sum game
DOI: 10.1049/cth2.12610 Publication Date: 2023-12-25T04:45:07Z
ABSTRACT
Abstract To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on integral reinforcement method, new HJ equation that can quickly cooperatively determine strategies all players is proposed. By leveraging neural network approximation gradient descent simultaneous adaptive tuning laws are provided both critic actor weights. These facilitate estimation optimal value function policy without requiring knowledge or identification system's dynamics. The closed‐loop system stability convergence weights guaranteed through Lyapunov analysis. Additionally, enhanced to number auxiliary NNs used in critic. simulation results two‐player game validate effectiveness proposed algorithm.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (43)
CITATIONS (0)