Back to articles
Volume: 28 | Article ID: art00006
Machine Learning-based Early Termination in Prediction Block Decomposition for VP9
  DOI :  10.2352/ISSN.2470-1173.2016.2.VIPC-236  Published OnlineFebruary 2016

VP9 is an open-source video codec released by Google. It introduces superblocks (SBs) of size 64 × 64, and uses a recursive decomposition scheme to break them all the way down to 4 × 4 blocks. This provides a large efficiency gain for VP9. However, it also brings large computational complexity when encoding because of the rate distortion (RD) optimization on prediction blocks. This paper proposes a method that can early terminate the block partitioning process based on the information of the current block. We first model the early termination decision as a binary classification problem. Second, to solve this classification problem, a weighted linear Support Vector Machine (SVM) is trained whose weights are determined by the RD cost increase caused by misclassification. Finally, we model the parameter selection of the SVM as an optimization problem, which can enable us to control the trade-off between time saving and RD cost increase. Experimental results on standard HD data shows that the proposed method can reduce the complexity of partitioning prediction blocks while maintaining comparable coding performance - The Bjøntegaard delta bit rate is ∼1.2% for ∼30% encoding time reduction.

Subject Areas :
Views 80
Downloads 3
 articleview.views 80
 articleview.downloads 3
  Cite this article 

Xintong Han, Yunqing Wang, Yaowu Xu, Jim Bankoski, "Machine Learning-based Early Termination in Prediction Block Decomposition for VP9in Proc. IS&T Int’l. Symp. on Electronic Imaging: Visual Information Processing and Communication VII,  2016,

 Copy citation
  Copyright statement 
Copyright © Society for Imaging Science and Technology 2016
Electronic Imaging
Society for Imaging Science and Technology
7003 Kilworth Lane, Springfield, VA 22151 USA