<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="ru">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Computing, Telecommunication and Control</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Информатика, телекоммуникации и управление</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2687-0517</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">2</article-id>
      <title-group>
        <article-title>Improving the precision of Bayesian classifier for text documents</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Повышение точности байесовского классификатора текстовых документов</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0001-6960-0942</contrib-id>
          <contrib-id contrib-id-type="scopus">22956402700</contrib-id>
          <contrib-id contrib-id-type="researcherid">K-3059-2012</contrib-id>
          <name>
            <surname>Peter</surname>
            <given-names>V.</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>petert@dcn.icc.spbstu.ru</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Peter the Great St.Petersburg Polytechnic University</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2010-02-10">
        <day>10</day>
        <month>02</month>
        <year>2010</year>
      </pub-date>
      <issue>1</issue>
      <issue-id pub-id-type="publisher-id">93</issue-id>
      <fpage>12</fpage>
      <lpage>18</lpage>
      <abstract xml:lang="en">
        <p>The problem of automatic text document categorization is considered. It is shown that the immediate implementation of Bayesian classifier provides quite low categorization precision due to strong statistical dependence of feature vector elements. A solution to this problem is proposed, which is based on transforming the document feature vector into the one with higher dimension and weaker statistical dependency between its elements.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>automatic categorization</kwd>
        <kwd>Bayesian classifier</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
