<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Tarek Amine, Haddad</style></author><author><style face="normal" font="default" size="100%">Djalal HEDJAZI</style></author><author><style face="normal" font="default" size="100%">Sofiane Aouag</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">A deep reinforcement learning-based cooperative approach for multi-intersection traffic signal control</style></title><secondary-title><style face="normal" font="default" size="100%">Engineering Applications of Artificial Intelligence</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2022</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://doi-org.sndl1.arn.dz/10.1016/j.engappai.2022.105019</style></url></web-urls></urls><volume><style face="normal" font="default" size="100%">114 (2022)</style></volume><pages><style face="normal" font="default" size="100%">105019</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Recently, Adaptive Traffic Signal Control (&lt;em&gt;ATSC&lt;/em&gt;&lt;span&gt;) in the multi-intersection system is considered as one of the most critical issues in &lt;a class=&quot;topic-link&quot; href=&quot;https://www-sciencedirect-com.sndl1.arn.dz/topics/engineering/intelligent-transportation-system&quot; title=&quot;Learn more about Intelligent Transportation Systems from ScienceDirect's AI-generated Topic Pages&quot;&gt;Intelligent Transportation Systems&lt;/a&gt; (&lt;/span&gt;&lt;em&gt;ITS&lt;/em&gt;). Among the proposed &lt;em&gt;AI&lt;/em&gt;&lt;span&gt;-based approaches, &lt;a class=&quot;topic-link&quot; href=&quot;https://www-sciencedirect-com.sndl1.arn.dz/topics/computer-science/deep-reinforcement-learning&quot; title=&quot;Learn more about Deep Reinforcement Learning from ScienceDirect's AI-generated Topic Pages&quot;&gt;Deep Reinforcement Learning&lt;/a&gt; (&lt;/span&gt;&lt;em&gt;DRL&lt;/em&gt;) has been largely applied while showing better performances. This paper proposes a new &lt;em&gt;DRL&lt;/em&gt;-based cooperative approach for controlling multiple intersections. The problem is modelled as a Multi-Agent Reinforcement Learning (&lt;em&gt;MARL&lt;/em&gt;&lt;span&gt;) system, while each agent is trained to select the best action to control an intersection by obtaining information about its local lanes state. The cooperation aspect is manifested in this approach by considering the effect of the state, action and reward of neighbour agents in the process of policy learning. An &lt;a class=&quot;topic-link&quot; href=&quot;https://www-sciencedirect-com.sndl1.arn.dz/topics/computer-science/intersection-controller&quot; title=&quot;Learn more about intersection controller from ScienceDirect's AI-generated Topic Pages&quot;&gt;intersection controller&lt;/a&gt; applies a Deep Q-Network (&lt;/span&gt;&lt;em&gt;DQN&lt;/em&gt;) method, while transferring state, action and reward received from their neighbour agents to its own loss function during the learning process. The experimental results under different scenarios shows that the proposed approach outperforms many state-of-the-art approaches in terms of three metrics: Average Waiting Time (&lt;em&gt;AWT&lt;/em&gt;&lt;span&gt;), &lt;a class=&quot;topic-link&quot; href=&quot;https://www-sciencedirect-com.sndl1.arn.dz/topics/computer-science/average-queue-length&quot; title=&quot;Learn more about Average Queue Length from ScienceDirect's AI-generated Topic Pages&quot;&gt;Average Queue Length&lt;/a&gt; (&lt;/span&gt;&lt;em&gt;AQL&lt;/em&gt;) and Average Emission CO&lt;sub&gt;2&lt;/sub&gt; (&lt;em&gt;AEC&lt;/em&gt;). In addition, the cooperation between the different trained &lt;em&gt;DRL&lt;/em&gt;-based controllers allows the system to continuously learn and improve its performance by interacting with the environment, particularly when the traffic is congested.</style></abstract></record></records></xml>