Computing, Telecommunication and Control

Информатика, телекоммуникации и управление

2687-0517

10.18721/JCSTCS.18313

Development of a dual-loop method of intelligent traffic light control based on reinforcement learning and hourly distillation of phase strategies

Разработка двухконтурного метода интеллектуального светофорного регулирования на основе обучения с подкреплением и почасовой дистилляции фазовых стратегий

Sazanov

Arseniy

arseny.sazanov@gmail.com

35303230700

AAH-8784-2019

Vyacheslav

shkodyrev@imop.spbstu.ru

6603839750

Sergey M. Ustinov

Сергей

usm50@yandex.ru

Peter the Great St.Petersburg Polytechnic University

30 09 2025

18 3 144 153

With increasingly complex urban dynamics, as well as increasing demands for the sustainability of urban mobility and introduction of cognitive technologies into transport infrastructure, the paper proposes a dual-loop method for intelligent traffic light control based on reinforcement learning and phase strategy distillation procedures. The first level implements real-time control through an RL-agent, while the second one generates backup hourly plans based on statistics of its behavior. The method is based on a system-discrete model taking into account stochastic traffic parameters and permissible control constraints. The simulation conducted in SUMO for a real intersection demonstrates a significant reduction in average transport delay compared to classical control, confirming the efficiency, sustainability and scalability of the approach. The obtained results substantiate the possibility of practical implementation of the model within the framework of intelligent transport systems of large cities and for laying the engineering foundation for hybrid urban mobility management architectures.

reinforcement learning intelligent traffic light control dual-loop control architecture traffic light controller traffic management and control