Header menu link for other important links
X
Reinforcement learning with average cost for adaptive control of traffic lights at intersections
, Bhatnagar S.
Published in IEEE
2011
Pages: 1640 - 1645
Abstract
We propose for the first time two reinforcement learning algorithms with function approximation for average cost adaptive control of traffic lights. One of these algorithms is a version of Q-learning with function approximation while the other is a policy gradient actor-critic algorithm that incorporates multi-timescale stochastic approximation. We show performance comparisons on various network settings of these algorithms with a range of fixed timing algorithms, as well as a Q-learning algorithm with full state representation that we also implement. We observe that whereas (as expected) on a two-junction corridor, the full state representation algorithm shows the best results, this algorithm is not implementable on larger road networks. The algorithm PG-AC-TLC that we propose is seen to show the best overall performance. © 2011 IEEE.
About the journal
JournalData powered by TypesetIEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
PublisherData powered by TypesetIEEE
Open AccessNo