Autonomous driving paper index

Multimodal Diffusion-Based Depth Estimation Framework with Wi-Fi and Vision

2026-06-17 · ACM Transactions on Sensor Networks

autonomous drivingdepth estimationmonocular depthlidardeploymentprediction

One-line summary

To overcome these challenges, we propose WiViD, a novel diffusion-based depth estimation framework that integrates commercial Wi-Fi and visual data.

Engineering notes

By leveraging the iterative refinement capabilities of diffusion models, WiViD achieves high-precision depth predictions. Real-world experiments demonstrate that WiViD significantly outperforms state-of-the-art monocular depth estimation methods, reducing Absolute Relative Error (ARE) and Square Relative Error (SRE) by 33.3% and 9.5%, respectively, highlighting its superior accuracy and robustness.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Depth estimation is essential for applications such as autonomous driving, robotic navigation, and augmented reality. While LiDAR-based and mmWave-based approaches offer high accuracy, they come with substantial deployment costs, whereas monocular vision methods often struggle with precision. To overcome these challenges, we propose WiViD, a novel diffusion-based depth estimation framework that integrates commercial Wi-Fi and visual data. By leveraging the iterative refinement capabilities of diffusion models, WiViD achieves high-precision depth predictions. WiViD features two key encoding modules: the Complex-Valued CSI Encoder (CCE), which extracts rich spatio-temporal features from Wi-Fi signals, and the Residual Image Encoder (RIE), which processes visual data. Additionally, we introduce the Multi-Modal Alignment Bidirectional Cross-Attention (MABA) mechanism to enhance cross-modal feature integration. Real-world experiments demonstrate that WiViD significantly outperforms state-of-the-art monocular depth estimation methods, reducing Absolute Relative Error (ARE) and Square Relative Error (SRE) by 33.3% and 9.5%, respectively, highlighting its superior accuracy and robustness.

5.5Engineering value
8.0Research novelty
6.5Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment