Semi-automatic stereo extraction from video footage

Moshe Guttmann*, Lior Wolf, Daniel Cohen-Or

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review


We present a semi-automatic system that converts conventional video shots to stereoscopic video pairs. The system requires just a few user-scribbles in a sparse set of frames. The system combines a diffusion scheme, which takes into account the local saliency and the local motion at each video location, coupled with a classification scheme that assigns depth to image patches. The system tolerates both scene motion and camera motion. In typical shots, containing hundreds of frames, even in the face of significant motion, it is enough to mark scribbles on the first and last frames of the shot. Once marked, plausible stereo results are obtained in a matter of seconds, leading to a scalable video conversion system. Finally, we validate our results with ground truth stereo video.

Original languageEnglish
Title of host publication2009 IEEE 12th International Conference on Computer Vision, ICCV 2009
Number of pages7
StatePublished - 2009
Event12th International Conference on Computer Vision, ICCV 2009 - Kyoto, Japan
Duration: 29 Sep 20092 Oct 2009

Publication series

NameProceedings of the IEEE International Conference on Computer Vision


Conference12th International Conference on Computer Vision, ICCV 2009


Dive into the research topics of 'Semi-automatic stereo extraction from video footage'. Together they form a unique fingerprint.

Cite this