<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="ru">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Computing, Telecommunication and Control</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Информатика, телекоммуникации и управление</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2687-0517</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">13</article-id>
      <title-group>
        <article-title>WEB-page structure and text monitoring with a robot of search-engine</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Динамическое отслеживание модулем информационно-поисковой системы изменений в структуре или тексте интернет-ресурса</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Ivankov</surname>
            <given-names>Alexey</given-names>
          </name>
          <email>a.vnkv1@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Eliseev</surname>
            <given-names>Dmitry</given-names>
          </name>
        </contrib>
      </contrib-group>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2010-06-10">
        <day>10</day>
        <month>06</month>
        <year>2010</year>
      </pub-date>
      <issue>3</issue>
      <issue-id pub-id-type="publisher-id">101</issue-id>
      <fpage>86</fpage>
      <lpage>92</lpage>
      <abstract xml:lang="en">
        <p>The algorithm for the dynamic monitoring of Web-page structure and text is developed. The algorithm is implemented as a robot of search-engine. Document structure changes are estimated as tree-edit distance. Vector model is in use to estimate the changes into the text. Semantics hierarchy to be obtained from the HTML source code is not an efficient tool for the case the structure is changed significantly.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>search-engine</kwd>
        <kwd>information retrieving</kwd>
        <kwd>semantic changes</kwd>
        <kwd>hierarchical structure of document</kwd>
        <kwd>HTML-grammar</kwd>
        <kwd>vector model of document</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
