NFDI4DS | UHH-SEMS - Publication Details

Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNN

Monocular Leverage (statistics)

DOI: 10.48550/arxiv.2308.08333 Publication Date: 2023-01-01

Abstract Supplemental Material References Cited by

AUTHORS (3)

Jiawei Yao

Tong Wu

Xiaofeng Zhang

ABSTRACT

Monocular depth estimation is an ongoing challenge in computer vision. Recent progress with Transformer models has demonstrated notable advantages over conventional CNNs this area. However, there's still a gap understanding how these prioritize different regions 2D images and affect performance. To explore the differences between Transformers CNNs, we employ sparse pixel approach to contrastively analyze distinctions two. Our findings suggest that while excel handling global context intricate textures, they lag behind preserving gradient continuity. further enhance performance of monocular estimation, propose Depth Gradient Refinement (DGR) module refines through high-order differentiation, feature fusion, recalibration. Additionally, leverage optimal transport theory, treating maps as spatial probability distributions, distance loss function optimize our model. Experimental results demonstrate integrated plug-and-play proposed without increasing complexity computational costs on both outdoor KITTI indoor NYU-Depth-v2 datasets. This research not only offers fresh insights into but also paves way for novel methodologies.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products OPENALEX - Publications

PlumX Metrics

Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNN

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....