NFDI4DS | UHH-SEMS - Publication Details

Solving Time-Continuous Stochastic Optimal Control Problems: Algorithm Design and Convergence Analysis of Actor-Critic Flow

Optimization and Control (math.OC) G.1.6; G.1.8 G.1.6 G.1.8 FOS: Mathematics 93E20 (Primary), 49L12 49M25 (secondary) Mathematics - Optimization and Control

DOI: 10.48550/arxiv.2402.17208 Publication Date: 2024-01-01

Abstract Supplemental Material References Cited by

AUTHORS (2)

Zhou, Mo

Lu, Jianfeng

ABSTRACT

We propose an actor-critic framework to solve the time-continuous stochastic optimal control problem. A least square temporal difference method is applied to compute the value function for the critic. The policy gradient method is implemented as policy improvement for the actor. Our key contribution lies in establishing a linear rate of convergence for our proposed actor-critic flow. Theoretical findings are further validated through numerical examples, showing the efficacy of our approach in practical applications.<br/>arXiv admin note: text overlap with arXiv:2302.05816<br/>

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Solving Time-Continuous Stochastic Optimal Control Problems: Algorithm Design and Convergence Analysis of Actor-Critic Flow

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....