Back to articles
Proceedings Paper
Volume: 37 | Article ID: IMAGE-273
Image
Frontal View Synthesis for Immersive Video Conferencing using Dual-camera Capture and Frame Interpolation
  DOI :  10.2352/EI.2025.37.8.IMAGE-273  Published OnlineFebruary 2025
Abstract
Abstract

In this paper, we propose a new solution for synthesizing frontal human images in video conferencing, aimed at enhancing immersive communication. Traditional methods such as center staging, gaze correction, and background replacement improve the user experience, but they do not fully address the issue of off-center camera placement. We introduce a system that utilizes two arbitrary cameras positioned on the top bezel of a display monitor to capture left and right images of the video participant. A facial landmark detection algorithm identifies key points on the participant’s face, from which we estimate the head pose. A segmentation model is employed to remove the background, isolating the user. The core component of our method is a video frame interpolation technique that synthesizes a realistic frontal view of the participant by leveraging the two captured angles. This method not only enhances visual alignment between users but also maintains natural facial expressions and gaze direction, resulting in a more engaging and life-like video conferencing experience.

Subject Areas :
Views 3
Downloads 0
 articleview.views 3
 articleview.downloads 0
  Cite this article 

Yezhi Shen, Md Adnan Faisal Hossain, Weichen Xu, Qian Lin, Fengqing Zhu, "Frontal View Synthesis for Immersive Video Conferencing using Dual-camera Capture and Frame Interpolationin Electronic Imaging,  2025,  pp 273-1 - 273-8,  https://doi.org/10.2352/EI.2025.37.8.IMAGE-273

 Copy citation
  Copyright statement 
Copyright © 2025, Society for Imaging Science and Technology
ei
Electronic Imaging
2470-1173
2470-1173
Society for Imaging Science and Technology
IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA