We propose a post-processing framework based on multiview interactive video segmentation for correcting 2D-plus-depth video footage. The suggested approach uses user-made scribbles to guide the multi-view segmentation process, which is based on an efficient cost-volume filtering algorithm. We extend the 2D algorithm to 3D and propose several improvements that increase precision and recall while also decreasing the need for user input. Our semi-automatic approach is supported by an interactive visualization tool that integrates both 2D and 3D views of the footage, allowing the user to explore novel views coherently and grasp a better understanding of the underlying data. We integrate our post-processing framework into a workflow for generating dynamic meshes from footage recorded by multiple stereo cameras, demonstrating the applicability of the technique.