<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="ru">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Computing, Telecommunication and Control</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Информатика, телекоммуникации и управление</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2687-0517</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">1</article-id>
      <article-id pub-id-type="doi">10.18721/JCSTCS.16201</article-id>
      <title-group>
        <article-title>Flexible deep forest classifier with multi-head attention</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Гибкий классификатор на основе глубокого леса с использованием многомерного внимания</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Konstantinov</surname>
            <given-names>Andrei</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>andrue.konst@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0002-5637-1420</contrib-id>
          <contrib-id contrib-id-type="scopus">7004013271</contrib-id>
          <contrib-id contrib-id-type="researcherid">F-6480-2013</contrib-id>
          <name>
            <surname>Lev</surname>
            <given-names>V.</given-names>
          </name>
          <xref ref-type="aff" rid="aff2"/>
          <email>lev.utkin@mail.ru</email>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0003-2275-1473</contrib-id>
          <name>
            <surname>Kirpichenko</surname>
            <given-names>Stanislav</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>kirpichenko.sr@gmail.com</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Peter the Great St. Petersburg Polytechnic University</aff>
      <aff id="aff2">Peter the Great St.Petersburg Polytechnic University</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2023-06-30">
        <day>30</day>
        <month>06</month>
        <year>2023</year>
      </pub-date>
      <volume>16</volume>
      <issue>2</issue>
      <fpage>7</fpage>
      <lpage>16</lpage>
      <abstract xml:lang="en">
        <p>A new modification of the deep forest (DF), called the attention-based deep forest (ABDF), for solving classification problems is proposed in the paper. The main idea behind the modification is to use the attention mechanism to aggregate predictions of the random forests at each level of the DF to enhance the classification performance of the DF. The attention mechanism is implemented by assigning the attention weights with trainable parameters to class probability vectors. The trainable parameters are determined by solving an optimization problem minimizing the loss function of predictions at each level of the DF. In order to reduce the number of random forests, the multi-head attention is incorporated into the DF. Numerical experiments with real data illustrate the ABDF and compare it with the original DF.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>machine learning</kwd>
        <kwd>classification</kwd>
        <kwd>random forest</kwd>
        <kwd>decision tree</kwd>
        <kwd>deep learning</kwd>
        <kwd>attention mechanism</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
