June 1, 2010

Single image depth estimation from predicted semantic labels

Key Points

Key points are not available for this paper at this time.

Abstract

We consider the problem of estimating the depth of each pixel in a scene from a single monocular image. Unlike traditional approaches, which attempt to map from appearance features to depth directly, we first perform a semantic segmentation of the scene and use the semantic labels to guide the 3D reconstruction. This approach provides several advantages: By knowing the semantic class of a pixel or region, depth and geometry constraints can be easily enforced (e.g., “sky” is far away and “ground” is horizontal). In addition, depth can be more readily predicted by measuring the difference in appearance with respect to a given semantic class. For example, a tree will have more uniform appearance in the distance than it does close up. Finally, the incorporation of semantic features allows us to achieve state-of-the-art results with a significantly simpler model than previous works.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Beyang Liu

Stanford University

Stephen Jay Gould

Brigham Young University

Daphne Koller

Mount Sinai Hospital

Actions

Institutions

Stanford University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Single image depth estimation from predicted semantic labels

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study