<!DOCTYPE article PUBLIC '-//NLM//DTD Journal Publishing DTD v2.1 20050630//EN' 'http://uploads.ingentaconnect.com/docs/dtd/ingenta-journalpublishing.dtd'>
<article article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="aggregator">72010604</journal-id>
      <journal-title>Electronic Imaging</journal-title>
      <issn pub-type="ppub">2470-1173</issn><issn pub-type="epub"></issn>
      <publisher>
        <publisher-name>Society for Imaging Science and Technology</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.2352/ISSN.2470-1173.2017.14.HVEI-122</article-id>
      <article-id pub-id-type="sici">2470-1173(20170129)2017:14L.91;1-</article-id>
      <article-id pub-id-type="publisher-id">s11.phd</article-id>
      <article-id pub-id-type="other">/ist/ei/2017/00002017/00000014/art00011</article-id>
      <article-categories>
        <subj-group>
          <subject>Articles</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Balancing Type I Errors and Statistical Power in Video Quality Assessment</article-title>
      </title-group>
      <contrib-group>
        <contrib>
          <name>
            <surname>Brunnström</surname>
            <given-names>Kjell.</given-names>
          </name>
        </contrib>
        <contrib>
          <name>
            <surname>Barkowsky</surname>
            <given-names>Marcus</given-names>
          </name>
        </contrib>
      </contrib-group>
      <pub-date>
        <day>29</day>
        <month>01</month>
        <year>2017</year>
      </pub-date>
      <volume>2017</volume>
      <issue>14</issue>
      <fpage>91</fpage>
      <lpage>96</lpage>
      <permissions>
        <copyright-year>2017</copyright-year>
      </permissions>
      <abstract>
        <p>This paper analyzes how an experimenter can balance errors in subjective video quality tests between the statistical power of finding an effect if it is there and not claiming that an effect is there if the effect it is not there i.e. balancing Type I and Type II errors. The risk of
 committing Type I errors increases with the number of comparisons that are performed in statistical tests. We will show that when controlling for this and at the same time keeping the power of the experiment at a reasonably high level, it will require more test subjects than are normally used
 and recommended by international standardization bodies like the ITU. Examples will also be given for the influence of Type I error on the statistical significance of comparing objective metrics by correlation.</p>
      </abstract>
      <kwd-group>
        <kwd>TYPE I ERROR</kwd>
        <kwd>TYPE II ERROR</kwd>
        <kwd>POWER</kwd>
        <kwd>VIDEO QUALIY</kwd>
        <kwd>SUBJECTIVE EXPERIMENTS</kwd>
        <kwd>OBJECTIVE</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
