Self-Supervised Monocular Depth Estimation for Dynamic Targets

Jing Zhao, Liquan Dong*, Haojie Liu, Chengwei Lv, Rujia Zhang, Lingqin Kong, Ming Liu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Monocular depth estimation is one of the classic problems in the field of computer vision. It is widely used in fields such as 3D scene reconstruction and augmented reality. In this paper, a self-supervised monocular depth estimation model for dynamic targets is designed to solve the problems of current unsupervised monocular depth estimation in moving scenes. To solve the problem of insufficient accuracy of depth estimation in dynamic areas, a method of strengthening the fusion of local dynamic area features with a cross-attention mechanism is proposed, which refines the overall depth structure, expands the receptive field and enhances the representation ability of target area features. At the same time, a depth prior, namely pseudo-depth, is generated. The problem of blurred edges of dynamic targets is solved by matching the surface normals of the predicted depth and pseudo-depth and by constraining the relative normal angles of the two depth maps around the edge of the dynamic area to be consistent. Two channel attention modules are also designed to effectively integrate semantic information from different scales, so as to more fully integrate features of different scales and improve the overall modeling effect. We conduct experiments on the KITTI and DDAD datasets. The experimental results show that the proposed method outperforms the mainstream monocular depth estimation methods, especially in dynamic areas, showing better depth estimation performance. The absolute relative error and square relative error of the proposed method are reduced by up to 30% and 70% respectively compared with the baseline network.

源语言英语
主期刊名Tenth Symposium on Novel Optoelectronic Detection Technology and Applications
编辑Chen Ping
出版商SPIE
ISBN(电子版)9781510688148
DOI
出版状态已出版 - 2025
活动10th Symposium on Novel Optoelectronic Detection Technology and Applications - Taiyuan, 中国
期限: 1 11月 20243 11月 2024

出版系列

姓名Proceedings of SPIE - The International Society for Optical Engineering
13511
ISSN(印刷版)0277-786X
ISSN(电子版)1996-756X

会议

会议10th Symposium on Novel Optoelectronic Detection Technology and Applications
国家/地区中国
Taiyuan
时期1/11/243/11/24

指纹

探究 'Self-Supervised Monocular Depth Estimation for Dynamic Targets' 的科研主题。它们共同构成独一无二的指纹。

引用此