Reinforcement learning with average cost for adaptive control of traffic lights at intersections

L. A. Prashanth; Bhatnagar S.

doi:10.1109/ITSC.2011.6082823

Profiles Research Units Publications

Other

Reinforcement learning with average cost for adaptive control of traffic lights at intersections

, Bhatnagar S.

Published in IEEE

2011

DOI: 10.1109/ITSC.2011.6082823

Pages: 1640 - 1645

Abstract

We propose for the first time two reinforcement learning algorithms with function approximation for average cost adaptive control of traffic lights. One of these algorithms is a version of Q-learning with function approximation while the other is a policy gradient actor-critic algorithm that incorporates multi-timescale stochastic approximation. We show performance comparisons on various network settings of these algorithms with a range of fixed timing algorithms, as well as a Q-learning algorithm with full state representation that we also implement. We observe that whereas (as expected) on a two-junction corridor, the full state representation algorithm shows the best results, this algorithm is not implementable on larger road networks. The algorithm PG-AC-TLC that we propose is seen to show the best overall performance. © 2011 IEEE.

Topics: Approximation algorithm (64)%, Weighted Majority Algorithm (63)%, Q-learning (58)%, Reinforcement learning (56)% and Function approximation (55)%

View more info for "Reinforcement learning with average cost for adaptive control of traffic lights at intersections"

About the journal

Journal	Data powered by TypesetIEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
Publisher	Data powered by TypesetIEEE
Open Access	No

Authors (1)

L. A. Prashanth
- Department of Computer Science and Engineering

ABOUT IIT MADRAS

R & D

RANKINGS & ACHIEVEMENTS

QUICK FIND