<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="ru">
  <front xmlns:xlink="http://www.w3.org/1999/xlink">
    <journal-meta>
      <journal-title-group>
        <journal-title>Computing, Telecommunication and Control</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Информатика, телекоммуникации и управление</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2687-0517</issn>
    </journal-meta>
    <article-meta xmlns:xlink="http://www.w3.org/1999/xlink">
      <article-id pub-id-type="publisher-id">6</article-id>
      <article-id pub-id-type="doi">10.18721/JCSTCS.14106</article-id>
      <title-group>
        <article-title>Reinforcement learning for industrial manufacturing control system</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Усиление обучения для системы управления промышленным производством</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Hanafi</surname>
            <given-names>M. Yassine</given-names>
          </name>
          <email>hanafi.med.yassine@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="scopus">35303230700</contrib-id>
          <contrib-id contrib-id-type="researcherid">AAH-8784-2019</contrib-id>
          <name>
            <surname>Vyacheslav</surname>
            <given-names>P.</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>shkodyrev@imop.spbstu.ru</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Peter the Great St.Petersburg Polytechnic University</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2021-03-31">
        <day>31</day>
        <month>03</month>
        <year>2021</year>
      </pub-date>
      <volume>14</volume>
      <issue>1</issue>
      <fpage>60</fpage>
      <lpage>69</lpage>
      <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="pdf" xlink:href="https://infocom.spbstu.ru/userfiles/files/articles/2021/1/60-69.pdf"/>
      <abstract xml:lang="en">
        <p>The problem posed is a very general case of optimal control of a dynamic, potentially stochastic, and partially observable system for which a model is not necessarily available. We analyze the disadvantages of classical approaches of the control theory and present a new modified numerical reinforcement learning rule of machine learning algorithm. Control theory is a field that has been studied for a very long time and which deals with the behavior of dynamic systems and how to influence it. Among the best-known examples are LQG (Linear Quadratic Gaussian) or PID (Proportional Integral Derivative) controllers. Most of the existing approaches presuppose (analytical) knowledge of the dynamic system, and one of the constraints is the need to be able to free oneself from a priori models. We focus on modified reinforcement learning approach to adaptive control policy as perspective area of control of complex dynamical system under uncertainty.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>reinforcement learning</kwd>
        <kwd>multi agent system</kwd>
        <kwd>oil manufacturing</kwd>
        <kwd>Bellman equations</kwd>
        <kwd>dynamic programming</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
