Seminar Information
In this presentation we present an iterative forward-backward algorithm which aims to solve discrete time stochastic optimal control problems. It is inspired by both max-plus numerical methods introduced by McEneaney and the Stochastic Dual Dynamic Programming (SDDP) algorithm of Pereira and Pinto.
At each time step, our algorithm builds two sequences of approximations of the (Bellman) value function respectively as a max-plus linear combinations of basic functions and as a min-plus linear combinations of other basic functions.