<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="ru">
  <front xmlns:xlink="http://www.w3.org/1999/xlink">
    <journal-meta>
      <journal-title-group>
        <journal-title>Computing, Telecommunication and Control</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Информатика, телекоммуникации и управление</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2687-0517</issn>
    </journal-meta>
    <article-meta xmlns:xlink="http://www.w3.org/1999/xlink">
      <article-id pub-id-type="publisher-id">4</article-id>
      <article-id pub-id-type="doi">10.18721/JCSTCS.15404</article-id>
      <title-group>
        <article-title>Research and comparative analysis of the effectiveness of software and hardware implementations of the operation of summing transposed matrices</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Исследование и сравнительный анализ эффективности программной и аппаратных реализаций операции суммирования транспонированных матриц</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Antonov</surname>
            <given-names>Alexander</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>antonov@eda-lab.ftk.spbstu.ru</email>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Besedin</surname>
            <given-names>Denis</given-names>
          </name>
          <xref ref-type="aff" rid="aff2"/>
          <email>1310nero@mail.ru</email>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Filippov</surname>
            <given-names>Aleksey</given-names>
          </name>
          <email>filippov@eda-server.ftk.spbstu.ru</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Peter the Great St.Petersburg Polytechnic University</aff>
      <aff id="aff2">Peter the Great St. Petersburg Polytechnic University</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2022-12-30">
        <day>30</day>
        <month>12</month>
        <year>2022</year>
      </pub-date>
      <volume>15</volume>
      <issue>4</issue>
      <fpage>51</fpage>
      <lpage>63</lpage>
      <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="pdf" xlink:href="https://infocom.spbstu.ru/userfiles/files/articles/2022/4/51-63.pdf"/>
      <abstract xml:lang="en">
        <p>The article is devoted to the study and comparative analysis of the software and hardware implementation of the operation of summing transposed matrices and its modified version – the operation of transposing the sum of matrices. A feature of the study is the use of high-level synthesis tools to obtain a hardware implementation. The relevance of the study is due to the widespread use of matrix operations for solving problems of various classes, the power asymptotic complexity of matrix calculations and the lack of data on the use of this toolkit in the tasks of creating hardware devices for matrix calculations. A step-by-step method of synthesis and optimization of a hardware device is proposed. A comparative study of software and hardware implementations of two computational tasks is carried out. It is shown that a large gain in the performance of hardware implementations is obtained by increasing the degree of parallelism of calculations. Additionally, conclusions are drawn about the inefficiency of attempts to achieve high clock frequencies, as well as about the increase in resources spent with increased speed due to parallelization.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>hardware implementation</kwd>
        <kwd>performance</kwd>
        <kwd>hardware costs</kwd>
        <kwd>FPGA</kwd>
        <kwd>parallel computing</kwd>
        <kwd>pipelining</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
