Efficient real-time portrait video segmentation with temporal guidance

Weichen  Xu; Yezhi  Shen; Qian  Lin; Jan P. Allebach; Fengqing  Zhu

doi:10.2352/EI.2022.34.8.IMAGE-263

Abstract

Virtual background has become an increasingly important feature of online video conferencing due to the popularity of remote work in recent years. To enable virtual background, a segmentation mask of the participant needs to be extracted from the real-time video input. Most previous works have focused on image based methods for portrait segmentation. However, portrait video segmentation poses additional challenges due to complicated background, body motion, and inter-frame consistency. In this paper, we utilize temporal guidance to improve video segmentation, and propose several methods to address these challenges including prior mask, optical flow, and visual memory. We leverage an existing portrait segmentation model PortraitNet to incorporate our temporal guided methods. Experimental results show that our methods can achieve improved segmentation performance on portrait videos with minimum latency.

Electronic Imaging

2470-1173

Society for Imaging Science and Technology

IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA

10.2352/EI.2022.34.8.IMAGE-263

IMAGE-263

Article

Efficient real-time portrait video segmentation with temporal guidance

XuWeichen

Purdue University, United States

ShenYezhi

Purdue University, United States

LinQian

HP Labs, United States

AllebachJan P.

Purdue University, United States

ZhuFengqing

Purdue University, United States

Abstract

16012022

IMAGE

Imaging and Multimedia Analytics at the Edge 2022

263-1

263-7

2022

PortraitSemantic segmentationTemporalVideoReal-time

articleview.keywords