<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="en">
  <front xmlns:xlink="http://www.w3.org/1999/xlink">
    <journal-meta>
      <journal-title-group>
        <journal-title>Computing, Telecommunication and Control</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Информатика, телекоммуникации и управление</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2687-0517</issn>
    </journal-meta>
    <article-meta xmlns:xlink="http://www.w3.org/1999/xlink">
      <article-id pub-id-type="publisher-id">13</article-id>
      <article-id pub-id-type="doi">10.18721/JCSTCS.18313</article-id>
      <title-group>
        <article-title>Development of a dual-loop method of intelligent traffic light control based on reinforcement learning and hourly distillation of phase strategies</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Разработка двухконтурного метода интеллектуального светофорного регулирования на основе обучения с подкреплением и почасовой дистилляции фазовых стратегий</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Sazanov</surname>
            <given-names>Arseniy</given-names>
          </name>
          <email>arseny.sazanov@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="scopus">35303230700</contrib-id>
          <contrib-id contrib-id-type="researcherid">AAH-8784-2019</contrib-id>
          <name>
            <surname>Vyacheslav</surname>
            <given-names>P.</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>shkodyrev@imop.spbstu.ru</email>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="scopus">6603839750</contrib-id>
          <name>
            <surname>Sergey M. Ustinov</surname>
            <given-names>Сергей</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>usm50@yandex.ru</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Peter the Great St.Petersburg Polytechnic University</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2025-09-30">
        <day>30</day>
        <month>09</month>
        <year>2025</year>
      </pub-date>
      <volume>18</volume>
      <issue>3</issue>
      <fpage>144</fpage>
      <lpage>153</lpage>
      <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="pdf" xlink:href="https://infocom.spbstu.ru/userfiles/files/articles/2025/3/144-153.pdf"/>
      <abstract xml:lang="en">
        <p>With increasingly complex urban dynamics, as well as increasing demands for the sustainability of urban mobility and introduction of cognitive technologies into transport infrastructure, the paper proposes a dual-loop method for intelligent traffic light control based on
reinforcement learning and phase strategy distillation procedures. The first level implements real-time control through an RL-agent, while the second one generates backup hourly plans based on statistics of its behavior. The method is based on a system-discrete model taking into account stochastic traffic parameters and permissible control constraints. The simulation conducted in SUMO for a real intersection demonstrates a significant reduction in average transport delay compared to classical control, confirming the efficiency, sustainability and scalability of the approach. The obtained results substantiate the possibility of practical implementation of the model within the framework of intelligent transport systems of large cities and for laying the engineering foundation for hybrid urban mobility management architectures.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>reinforcement learning</kwd>
        <kwd>intelligent traffic light control</kwd>
        <kwd>dual-loop control architecture</kwd>
        <kwd>traffic light controller</kwd>
        <kwd>traffic management and control</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
