Autonomous driving paper index

An Omnidirectional Visual Perception Method Based on Parameter-Free Projection of Key Points

2024-03-29 · 2024 5th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)

autonomous drivingbird's eye viewbevdepth estimationinstance segmentationnuscenesperception

One-line summary

To solve this problem, in this paper, we propose a non-parameterized key point projection method based on the attention mechanism, which significantly improves the computational speed of the algorithm.

Engineering notes

To solve this problem, in this paper, we propose a non-parameterized key point projection method based on the attention mechanism, which significantly improves the computational speed of the algorithm. Experiments show that the segmentation performance strength of our method outperforms that of similar methods and meets the real-time requirements.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

In recent years, omnidirectional vision BEV (Bird's Eye View) perception has become an important research direction in the field of autonomous driving vision. The correct perception of instance targets around the vehicle is a key technology to ensure that the vehicle can drive safely in complex traffic scenes. The difficulty of this technology is to efficiently convert the 2D features of the camera view into BEV features. Existing methods based on depth estimation suffer from the problem of inaccurate depth estimation or the need to create a large number of BEV depth labels. The model size of the Transformer-based method is large, which makes it difficult to use in practical applications. Both methods have the problem of a large number of invalid calculations, which seriously affects the real-time performance of the algorithm. To solve this problem, in this paper, we propose a non-parameterized key point projection method based on the attention mechanism, which significantly improves the computational speed of the algorithm. Our IOU for instance segmentation on the NuScenes is 44.4, and the algorithm runs at 65 FPS. Experiments show that the segmentation performance strength of our method outperforms that of similar methods and meets the real-time requirements.

5.5Engineering value
8.0Research novelty
5.0Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment