Autonomous driving paper index

SCNet3D: Rethinking the Feature Extraction Process of Pillar-Based 3D Object Detection

2025-01-01 · IEEE transactions on intelligent transportation systems (Print)

BEV Perception 3D Object Detection LiDAR Perception

autonomous drivingbev3d object detection3d detectionobject detectionlidarpoint cloudkitti

One-line summary

In this paper, we propose SCNet3D, a novel pillar-based method that tackles the challenges of feature enhancement, information preservation, and small target detection from the perspectives of features and data.

Engineering notes

Then, a STMod-Convolution Network (SCNet) is designed, which achieves sufficient feature extraction and fusion of BEV pseudo images through two channels, one for basic feature and one for advanced feature. Extensive experiments demonstrate that our SCNet3D has superior performance and excellent robustness.

Chinese explanation / 中文解读

中文解读待补充：本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

LiDAR-based 3D object detection is essential for autonomous driving. In order to extract information from sparse and unordered point cloud data, pillar-based methods make the data compact and orderly by converting point cloud into pseudo images. However, these methods suffer from limited feature extraction capabilities, and tend to lose key information during the conversion, leading to inferior detection accuracy than voxel-based or point-based methods especially for small objects. In this paper, we propose SCNet3D, a novel pillar-based method that tackles the challenges of feature enhancement, information preservation, and small target detection from the perspectives of features and data. We first introduce a Feature Enhancement Module (FEM), which uses the attention mechanism to weight features in three dimensions, and enhances 3D features from local to global layer by layer. Then, a STMod-Convolution Network (SCNet) is designed, which achieves sufficient feature extraction and fusion of BEV pseudo images through two channels, one for basic feature and one for advanced feature. Moreover, a Shape and Distance Aware Data Augmentation (SDAA) approach is proposed to add more samples to the point cloud while maintaining the original shape and distance of the samples during the training process. Extensive experiments demonstrate that our SCNet3D has superior performance and excellent robustness. Remarkably, SCNet3D achieves the AP of 82.35% in the moderate Car category, 44.64% in the moderate Pedestrian category and 67.55% in the moderate Cyclist category on the KITTI test split in 3D detection benchmark, outperforming many state-of-the-art 3D detectors.

5.0Engineering value

8.0Research novelty

5.0Business relevance

Links and sources

Official / arXiv page

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.