Autonomous driving paper index

FreqBEV-V2I: Frequency-Domain BEV-Enhanced Vehicle-to-Infrastructure Cooperative 3D Detection

2026-02-01 · IEEE transactions on intelligent transportation systems (Print)

autonomous drivingautonomous vehiclebev3d object detection3d detectionobject detectionperception

One-line summary

In this paper, we explore a frequency domain BEV representation to address these challenges and propose the FreqBEV-V2I framework that incorporates FreqBEVFlow and FreqBEVFusion blocks.

Engineering notes

Accurate vehicle-to-infrastructure (V2I) cooperation can significantly enhance the perception performance of autonomous vehicles by leveraging information from infrastructure. Experimental results on the real-world DAIR-V2X dataset demonstrate that FreqBEV-V2I significantly outperforms current state-of-the-art methods, achieving superior 3D object detection performance and robustness across various latency conditions.

Chinese explanation / 中文解读

中文解读待补充：本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Accurate vehicle-to-infrastructure (V2I) cooperation can significantly enhance the perception performance of autonomous vehicles by leveraging information from infrastructure. However, existing cooperation methods based on spatial bird’s-eye-view (BEV) representation struggle with asynchronous temporal misalignment and heterogeneous feature collaboration, leading to 3D detection performance degradation and compromised safety. In this paper, we explore a frequency domain BEV representation to address these challenges and propose the FreqBEV-V2I framework that incorporates FreqBEVFlow and FreqBEVFusion blocks. In FreqBEVFlow, we design global filter spatial differential matching and wavelet-enhanced Fourier channel refinement networks to capture global motion variations via self-supervised learning, which effectively addresses transmission asynchronous latency. Meanwhile, FreqBEVFusion integrates features from vehicle and infrastructure with a frequency adaptive convolution network for V2I heterogeneous feature collaboration. Experimental results on the real-world DAIR-V2X dataset demonstrate that FreqBEV-V2I significantly outperforms current state-of-the-art methods, achieving superior 3D object detection performance and robustness across various latency conditions. Specifically, under ideal V2I communication conditions, FreqBEV-V2I achieves 61.59% mAP@3D (IoU=0.5), surpassing individual no-fusion and existing state-of-the-art methods by 16.77% and 5.78%, respectively. Even in latency-aware scenarios, FreqBEV-V2I maintains high accuracy with an mAP@3D (IoU=0.5) of 61.20% at 200 ms latency, significantly outperforming other methods. The code is available at https://github.com/DeepPhysicVision/FreqBEV-V2I.

7.0Engineering value

8.0Research novelty

5.5Business relevance

Links and sources

Official / arXiv page

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.