Autonomous driving paper index

Self-Supervised Monocular Depth Estimation Based on High-Order Spatial Interactions

2024-02-15 · IEEE Sensors Journal

autonomous drivingdepth estimationmonocular depthmonocular camerakittiperception

One-line summary

Inspired by HorNet, in this article, we propose a novel self-supervised monocular depth estimation framework based on high-order spatial interactions, referred to as the Hor-Depth.

Engineering notes

This strategy applies varying constraints to the model at different training stages, effectively reducing training fluctuations and mitigating outliers that significantly deviate from the predicted values. The proposed approach demonstrates exceptional performance in self-supervised monocular depth estimation, surpassing certain stereo supervised or monocular supervised methods, as evidenced by its impressive results on the KITTI Eigen split benchmark.

Chinese explanation / 中文解读

中文解读待补充：本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Depth estimation plays a pivotal role in various applications, including autonomous driving and robot navigation. In contrast to depth estimation using multiple images, such as stereo depth perception, inferring depth relations from a monocular camera is notably more challenging yet highly valuable. Traditionally, convolutional neural networks (CNNs) with residual structures have been extensively employed for this task, but they inherently constrain the model’s feature extraction capabilities. Inspired by HorNet, in this article, we propose a novel self-supervised monocular depth estimation framework based on high-order spatial interactions, referred to as the Hor-Depth. Furthermore, the Hor-Depth improves feature fusion efficiency in the depth network decoder by incorporating the attentional feature fusion (AFF) module based on first-order spatial interaction, leading to more refined predicted disparity maps. To address issues of loss fluctuations and training instability, we introduce a progressive scale-weight adjustment strategy-based loss function. This strategy applies varying constraints to the model at different training stages, effectively reducing training fluctuations and mitigating outliers that significantly deviate from the predicted values. The proposed approach demonstrates exceptional performance in self-supervised monocular depth estimation, surpassing certain stereo supervised or monocular supervised methods, as evidenced by its impressive results on the KITTI Eigen split benchmark.

5.0Engineering value

8.0Research novelty

5.0Business relevance

Links and sources

Official / arXiv page

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.