Autonomous driving paper index
Boosting Vehicle-to-Vehicle Collaborative Perception in Bird's-Eye View by Attentive Feature Fusion and Robust Pose Correction
One-line summary
To address this, we propose a novel framework named Effective and Robust Collaborative Perception (ERCP), which is designed to enhance feature fusion with strong robustness against CAV pose errors.
Engineering notes
Extensive evaluations on the OPV2V and V2V4Real datasets demonstrate that ERCP achieves state-of-the-art performance in the collaborative vehicle detection task in BEV.
Chinese explanation / 中文解读
中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。
Original abstract
Collaborative perception enables Connected Autonomous Vehicles (CAVs) to share sensory data, and therefore presents a promising path towards long-range robust environmental understanding by overcoming individual perception limitations such as occlusions. The core challenge of collaborative perception lies in the precise spatial alignment of shared features and their effective fusion. To address this, we propose a novel framework named Effective and Robust Collaborative Perception (ERCP), which is designed to enhance feature fusion with strong robustness against CAV pose errors. Specifically, the robustness is improved by a two-stage coarse-to-fine Pose Correction Module (PCM) that performs feature spatial alignment, and is further boosted by the proposed perturbative training mechanism. Furthermore, a Multi-scale Cross-attention Fusion Module (MCFM) effectively aggregates aligned Bird's-Eye View (BEV) features from multiple CAVs, and leverages cross-attention among different scales to create a comprehensive representation for downstream perception tasks. Extensive evaluations on the OPV2V and V2V4Real datasets demonstrate that ERCP achieves state-of-the-art performance in the collaborative vehicle detection task in BEV.
Links and sources
Need this topic turned into a technical roadmap?
Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.
Request B2B research
Comments