Modelling herd behaviour in traffic jams using Markov chains-based reinforcement learning.

Citation:

Heddar Y, Fourar Y-O, Djebabra M. Modelling herd behaviour in traffic jams using Markov chains-based reinforcement learning. International Journal of Simulation and Process Modelling [Internet]. 2026;23 (2) :119-133.

Abstract:

Traffic congestion remains a persistent problem that compromises road safety. This phenomenon is often amplified by driver behaviors, particularly those characterized by the herd effect. This study aims to model the emergence and dynamics of the herd effect in traffic jams and to simulate a strategy for mitigating this behavior among drivers. To achieve these objectives, reinforcement learning (RL) was employed within the frameworks of memoryless Markov chains and multi-phase Markov chains. The results demonstrate the effectiveness of Markov chains in accurately modeling the collective behavior of specific drivers. Likewise, the simulations illustrate RL’s capacity to regulate the herd effect and optimize individual decision-making during congestion. The findings suggest that traffic authorities may consider implementing RL-based strategies to mitigate herd behavior, improve traffic flow, and enhance road safety.

Publisher's Version

Last updated on 05/20/2026