Krishna Reddy, Mahesh Kumar

Resource type

Thesis

Thesis type

(Thesis) M.Sc.

Date created

2022-12-16

Authors/Contributors

Author: Krishna Reddy, Mahesh Kumar

Abstract

Training a single network for high resolution and geometrically consistent monocular depth estimation is challenging due to varying scene complexities in the real world. To address this, we present a dual depth estimation setup to decompose the estimations into ordinal and metric depth. The goal of ordinal depth estimation is to leverage novel ordinal losses with relaxed geometric constraints to model local and global ordinal relations for capturing better high-frequency depth details and scene structure. However, ordinal depth inherently lacks geometric structure, and to resolve this, we introduce a metric depth estimation method to enforce geometric constraints on the prior ordinal depth estimations. The estimated scale-invariant metric depth achieves high resolution and is geometrically consistent in generating meaningful 3D point cloud representation for scene reconstruction. We demonstrate the effectiveness of our ordinal and metric networks by performing zero-shot and in-the-wild depth evaluations with state-of-the-art depth estimation networks.

Extent

68 pages.

Keywords

Identifier

etd22277

Copyright statement

Copyright is held by the author(s).

Permissions

This thesis may be printed or downloaded for non-commercial research and scholarly purposes.

Supervisor or Senior Supervisor

Thesis advisor: Aksoy, Yağız

Language

English

Member of collection

Computing Science Theses

Download file	Size
etd22277.pdf	39.91 MB

Metric monocular reconstruction through ordinal depth

Keywords

Views & downloads - as of June 2023