Header menu link for other important links
X
A constrained optimization perspective on actor–critic algorithms and application to network routing
, H.L. Prasad, Bhatnagar Shalabh,
Published in Elsevier BV
2016
Volume: 92
   
Pages: 46 - 51
Abstract

We propose a novel actor–critic algorithm with guaranteed convergence to an optimal policy for a discounted reward Markov decision process. The actor incorporates a descent direction that is motivated by the solution of a certain non-linear optimization problem. We also discuss an extension to incorporate function approximation and demonstrate the practicality of our algorithms on a network routing application.

About the journal
JournalData powered by TypesetSystems & Control Letters
PublisherData powered by TypesetElsevier BV
Open AccessNo