Back to articles
Articles
Volume: 32 | Article ID: art00002
Image
Improved Temporal Pooling for Perceptual Video Quality Assessment Using VMAF
  DOI :  10.2352/ISSN.2470-1173.2020.11.HVEI-068  Published OnlineJanuary 2020
Abstract

The Video Multimethod Assessment Fusion (VMAF) method, proposed by Netflix, offers an automated estimation of perceptual video quality for each frame of a video sequence. Then, the arithmetic mean of the per-frame quality measurements is taken by default, in order to obtain an estimate of the overall Quality of Experience (QoE) of the video sequence. In this paper, we validate the hypothesis that the arithmetic mean conceals the bad quality frames, leading to an overestimation of the provided quality. We also show that the Minkowski mean (appropriately parametrized) approximates well the subjectively measured QoE, providing superior Spearman Rank Correlation Coefficient (SRCC), Pearson Correlation Coefficient (PCC), and Root-Mean-Square-Error (RMSE) scores.

Subject Areas :
Views 30
Downloads 4
 articleview.views 30
 articleview.downloads 4
  Cite this article 

Sophia Batsi, Lisimachos P. Kondi, "Improved Temporal Pooling for Perceptual Video Quality Assessment Using VMAFin Proc. IS&T Int’l. Symp. on Electronic Imaging: Human Vision and Electronic Imaging,  2020,  pp 68-1 - 68-6,  https://doi.org/10.2352/ISSN.2470-1173.2020.11.HVEI-068

 Copy citation
  Copyright statement 
Copyright © Society for Imaging Science and Technology 2020
72010604
Electronic Imaging
2470-1173
Society for Imaging Science and Technology
7003 Kilworth Lane, Springfield, VA 22151 USA