<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="en">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Computing, Telecommunication and Control</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Информатика, телекоммуникации и управление</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2687-0517</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">3</article-id>
      <title-group>
        <article-title>Voice activity detector based on pitch statistics for speaker recognition</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Алгоритм обнаружения речевой активности на основе статистик основного тона в задаче распознавания диктора</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Simonchik</surname>
            <given-names>Konstantin</given-names>
          </name>
          <email>simonchik@speechpro.com</email>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Galinina</surname>
            <given-names>Olga</given-names>
          </name>
          <email>olga.galinina@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Kapustin</surname>
            <given-names>Alexey</given-names>
          </name>
          <email>kapustinalex@yandex.ru</email>
        </contrib>
      </contrib-group>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2010-08-10">
        <day>10</day>
        <month>08</month>
        <year>2010</year>
      </pub-date>
      <issue>4</issue>
      <issue-id pub-id-type="publisher-id">103</issue-id>
      <fpage>18</fpage>
      <lpage>23</lpage>
      <abstract xml:lang="en">
        <p>A modification of the voice activity detection (VAD) based on pitch statistics is proposed for improving speaker recognition performance. Comparison of two approaches for the VAD choice for phonogram preprocessing is presented. The first approach is a modification of pitch statistics VAD, the second one represents the simple energy-based scheme. A quantitative measurement of verification system quality is also presented. The experimental results show that the proposed VAD essentially improves the performance of the speaker recognition systems.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>voice activity detector</kwd>
        <kwd>VAD</kwd>
        <kwd>speaker recognition</kwd>
        <kwd>pitch statistics</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
