IS&T | Library

Pixel Decimation of RD-Cost Functions in the HEVC Encoder

35 0

Ahmed M Hamza, Abdelrahman Abdelazim, Djamel Ait-Boudaoud

Pages 1 - 5, February 2016, © Society for Imaging Science and Technology 2016

DOI

10.2352/ISSN.2470-1173.2016.2.VIPC-239

Volume 28

Issue 2

We present and analyse schemes for the improvement of computational complexity in the current HEVC (High Efficiecy Video Coding) standard, by a subsampling of the block-matching distortion cost functions used in the encoding process. HEVC improves on prior standards considerably in coding (compression) efficiency, with a large set-back in time complexity for inter and intra prediction processes and mode decisions. We alleviate this by reducing the number of calculations per decision in all modes of prediction, through pixel decimation in the SAD and SSE distortion cost functions. Experimentation with different patterns shows significant encoding time reduction with these schemes, used in tandem with built-in Fast Encoding optimizations in the HEVC reference implementation.

Digital Library: EI

Published Online: February 2016

Guided Filter Demosaicking For Fourier Spectral Filter Array

42 5

Jie Jia, Chuan Ni, Andrew Sarangan, Keigo Hirakawa

Pages 1 - 5, February 2016, © Society for Imaging Science and Technology 2016

DOI

10.2352/ISSN.2470-1173.2016.2.VIPC-228

Volume 28

Issue 2

We recently introduced a spectral filter array design for single-shot multispectral imaging that is based on Fourier transform spectroscopy. In this article, we investigate feasibility of guided filter demosaicking for our SFA design.

Digital Library: EI

Published Online: February 2016

VPx Error Resilient Video Coding Using Duplicated Prediction Information

37 1

Neeraj Gadgil, Edward J Delp

Pages 1 - 6, February 2016, © Society for Imaging Science and Technology 2016

DOI

10.2352/ISSN.2470-1173.2016.2.VIPC-234

Volume 28

Issue 2

Most real-time video applications typically demand low end-to-end latency and faithful reconstruction of the video sequence. Many popular video coding standards (e.g. VP8, VP9, H.264 and HEVC) aim at achieving high compression efficiencies by exploiting spatial and temporal redundancies. This makes the encoded bitstream vulnerable to errors. Thus, applications especially on mobile phones, tablet PCs and other portable devices that use WiFi or 3G/4G/LTE networks typically suffer from low quality of service typically characterized by frequent delays, jitter, frozen picture, partial/no picture and total loss of connection. Similar scenarios are also often observed while watching live streaming accompanied by service interruptions and a blank screen. Our approach is to investigate error resilient coding control for the VPx encoder to make the bitstream more error resilient for streaming applications under lossy channel conditions. In this paper, we describe an error resilient coding system that uses duplication of frame prediction information. Our “error resilience packet” consists of this prediction information of several frames, that can be used for error concealment in the case of packet loss.

Digital Library: EI

Published Online: February 2016

Machine Learning-based Early Termination in Prediction Block Decomposition for VP9

148 6

Xintong Han, Yunqing Wang, Yaowu Xu, Jim Bankoski

Pages 1 - 6, February 2016, © Society for Imaging Science and Technology 2016

DOI

10.2352/ISSN.2470-1173.2016.2.VIPC-236

Volume 28

Issue 2

VP9 is an open-source video codec released by Google. It introduces superblocks (SBs) of size 64 × 64, and uses a recursive decomposition scheme to break them all the way down to 4 × 4 blocks. This provides a large efficiency gain for VP9. However, it also brings large computational complexity when encoding because of the rate distortion (RD) optimization on prediction blocks. This paper proposes a method that can early terminate the block partitioning process based on the information of the current block. We first model the early termination decision as a binary classification problem. Second, to solve this classification problem, a weighted linear Support Vector Machine (SVM) is trained whose weights are determined by the RD cost increase caused by misclassification. Finally, we model the parameter selection of the SVM as an optimization problem, which can enable us to control the trade-off between time saving and RD cost increase. Experimental results on standard HD data shows that the proposed method can reduce the complexity of partitioning prediction blocks while maintaining comparable coding performance - The Bjøntegaard delta bit rate is ∼1.2% for ∼30% encoding time reduction.

Digital Library: EI

Published Online: February 2016

A Sample Adaptive Offset Early Termination Method for HEVC Parallel Encoding

32 0

Younhee Kim, Jinwuk Seok, Myeong-Seok Gi, Huiyong Kim, Jin Soo Choi

Pages 1 - 6, February 2016, © Society for Imaging Science and Technology 2016

DOI

10.2352/ISSN.2470-1173.2016.2.VIPC-238

Volume 28

Issue 2

We propose a slice-level SAO on-off control method that can be applied in the parallel HEVC encoding scheme. To be applied in the parallel encoding scheme, our method does not use any information from the previous encoded frames. Our method uses the GOP level and slice quantization parameter, which are given before starting the current frame encoding. Our experimental results shows that our method can control SAO on-off in the slice level with very small amount of loss than the method that is hardly employed in the parallel encoding scheme.

Digital Library: EI

Published Online: February 2016

Fingerprint Liveness Detection Using Ensemble of Local Image Quality Assessments

15 0

Wonjun Kim, Sungjoo Suh, Youngsung Kim, Changkyu Choi

DOI

10.2352/ISSN.2470-1173.2016.2.VIPC-242

Volume 28

Issue 2

Detecting spoofing compared to a live trait is a critical problem in the biometric authentication. In this paper, we present a novel method to detect fake fingerprint attacks based on the ensemble of image quality assessments (IQAs). The key idea of the proposed method is to combine quality scores obtained from multiple local regions, which are input into the linear SVM classifier to determine whether the given fingerprint is fake or not. One important advantage of the proposed method is that, in contrast to previous approaches, it accurately identifies fake fingerprints even with small partial distortions. Moreover, the proposed method does not require any additional device. Experimental results on the mobile device show that the proposed method is effective for fingerprint liveness detection in real-world scenarios.

Digital Library: EI

Published Online: February 2016

Optimizing color information processing inside an SVM network

32 2

J Pasquet, G Subsol, M Derras, M Chaumont

DOI

10.2352/ISSN.2470-1173.2016.2.VIPC-243

Volume 28

Issue 2

Today, with the higher computing power of CPUs and GPUs, many different neural network architectures have been proposed for object detection in images. However, these networks are often not optimized to process color information. In this paper, we propose a new method based on an SVM network, that efficiently extracts this color information. We describe different network architectures and compare them with several color models (CIELAB, HSV, RGB...). The results obtained on real data show that our network is more efficient and robust than a single SVM network, with an average precision gain ranging from 1.5% to 6% with respect to the complexity of the test image database. We have optimized the network architecture in order to gain information from color data, thus increasing the average precision by up to 10%.

Digital Library: EI

Published Online: February 2016

Motion deblurring for depth-varying scenes

51 0

Ruiwen Zhen, Robert Stevenson

DOI

10.2352/ISSN.2470-1173.2016.2.VIPC-033

Volume 28

Issue 2