A Reinforcement Learning Based Traffic Control Strategy in a Macroscopic Fundamental Diagram Region
Urban traffic control systems (UTCSs) are deployed to a great number of urban cities despite lacking feedback when adjusting the traffic signals. The development of reinforcement learning (RL) makes it possible to apply feedback to UTCS, and great efforts have been made on RL-based traffic control s...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2022-01-01
|
Series: | Journal of Advanced Transportation |
Online Access: | http://dx.doi.org/10.1155/2022/5681234 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832561960880701440 |
---|---|
author | Lingyu Zheng Bing Wu |
author_facet | Lingyu Zheng Bing Wu |
author_sort | Lingyu Zheng |
collection | DOAJ |
description | Urban traffic control systems (UTCSs) are deployed to a great number of urban cities despite lacking feedback when adjusting the traffic signals. The development of reinforcement learning (RL) makes it possible to apply feedback to UTCS, and great efforts have been made on RL-based traffic control strategies. However, those studies are regardless of the traffic flow theory of the network and the road users’ perspectives on the performance of traffic. This study proposes a multiagent reinforcement learning (MARL) based traffic control strategy, in which each intersection in a macroscopic fundamental diagram (MFD) region was controlled by one agent using the level of services (LOS) and MFD-based parameters as rewards. The proposed MARL strategy was evaluated by simulation in a 3×3 grid network compared with pretimed, actuated, and MFD-based traffic control strategies. The evaluation results showed that, at different demand levels, the proposed MARL strategy outperforms the other three traffic control strategies in terms of average intersection queue length and average intersection waiting time to a different extent. Results also showed that the proposed MARL dissipated the congestion faster than the other three control strategies. Results of the Friedman test indicated that the differences in performances between the proposed MARL and other strategies were statistically significant regardless of the demand level. The MFD in the testbed network controlled by the proposed MARL was different from that controlled by the pretimed strategy, especially the MFD scatter plot. It provides insights on considering the traffic flow theory of the network when applying MARL to traffic control strategies. |
format | Article |
id | doaj-art-1814dc724efd41d9a8e2090eb772615c |
institution | Kabale University |
issn | 2042-3195 |
language | English |
publishDate | 2022-01-01 |
publisher | Wiley |
record_format | Article |
series | Journal of Advanced Transportation |
spelling | doaj-art-1814dc724efd41d9a8e2090eb772615c2025-02-03T01:23:38ZengWileyJournal of Advanced Transportation2042-31952022-01-01202210.1155/2022/5681234A Reinforcement Learning Based Traffic Control Strategy in a Macroscopic Fundamental Diagram RegionLingyu Zheng0Bing Wu1College of Transport and CommunicationsKey Laboratory of Road and Traffic Engineering of Ministry of EducationUrban traffic control systems (UTCSs) are deployed to a great number of urban cities despite lacking feedback when adjusting the traffic signals. The development of reinforcement learning (RL) makes it possible to apply feedback to UTCS, and great efforts have been made on RL-based traffic control strategies. However, those studies are regardless of the traffic flow theory of the network and the road users’ perspectives on the performance of traffic. This study proposes a multiagent reinforcement learning (MARL) based traffic control strategy, in which each intersection in a macroscopic fundamental diagram (MFD) region was controlled by one agent using the level of services (LOS) and MFD-based parameters as rewards. The proposed MARL strategy was evaluated by simulation in a 3×3 grid network compared with pretimed, actuated, and MFD-based traffic control strategies. The evaluation results showed that, at different demand levels, the proposed MARL strategy outperforms the other three traffic control strategies in terms of average intersection queue length and average intersection waiting time to a different extent. Results also showed that the proposed MARL dissipated the congestion faster than the other three control strategies. Results of the Friedman test indicated that the differences in performances between the proposed MARL and other strategies were statistically significant regardless of the demand level. The MFD in the testbed network controlled by the proposed MARL was different from that controlled by the pretimed strategy, especially the MFD scatter plot. It provides insights on considering the traffic flow theory of the network when applying MARL to traffic control strategies.http://dx.doi.org/10.1155/2022/5681234 |
spellingShingle | Lingyu Zheng Bing Wu A Reinforcement Learning Based Traffic Control Strategy in a Macroscopic Fundamental Diagram Region Journal of Advanced Transportation |
title | A Reinforcement Learning Based Traffic Control Strategy in a Macroscopic Fundamental Diagram Region |
title_full | A Reinforcement Learning Based Traffic Control Strategy in a Macroscopic Fundamental Diagram Region |
title_fullStr | A Reinforcement Learning Based Traffic Control Strategy in a Macroscopic Fundamental Diagram Region |
title_full_unstemmed | A Reinforcement Learning Based Traffic Control Strategy in a Macroscopic Fundamental Diagram Region |
title_short | A Reinforcement Learning Based Traffic Control Strategy in a Macroscopic Fundamental Diagram Region |
title_sort | reinforcement learning based traffic control strategy in a macroscopic fundamental diagram region |
url | http://dx.doi.org/10.1155/2022/5681234 |
work_keys_str_mv | AT lingyuzheng areinforcementlearningbasedtrafficcontrolstrategyinamacroscopicfundamentaldiagramregion AT bingwu areinforcementlearningbasedtrafficcontrolstrategyinamacroscopicfundamentaldiagramregion AT lingyuzheng reinforcementlearningbasedtrafficcontrolstrategyinamacroscopicfundamentaldiagramregion AT bingwu reinforcementlearningbasedtrafficcontrolstrategyinamacroscopicfundamentaldiagramregion |