Autonomous driving paper index

Enhancing Autonomous Driving: A Low-Cost Monocular End-to-End Framework With Multi-Task Integration and Temporal Fusion

2024-10-01 · IEEE Transactions on Intelligent Vehicles

End-to-End Autonomous Driving BEV Perception Path Planning Autonomous Driving Simulation

autonomous driving systemautonomous drivingbird's eye viewbevend-to-endmotion planningcarlaperceptionplanning

One-line summary

In particular, to improve the performance of the vision-based model, our model considers three key aspects.

Engineering notes

The performance of our proposed model is evaluated using the longest-06 and Town05-long benchmarks within the CARLA-0.9.10.1 simulator. The experiment results demonstrate that our proposed model performs state-of-the-art results on the longest-06 benchmark and promising results on the Town05-long benchmark even compared to these multi-modal sensor based models.

Chinese explanation / 中文解读

中文解读待补充：本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

End to end autonomous driving system has rapidly progressed and garnered significant attention. Recently, multi-modal fusion methods have surpassed vision-based methods in terms of performance. However, the multi-modal methods are both costly and time-confusion. This paper aims to propose a low-cost monocular end-to-end multi-task autonomous driving framework to address these challenges. In particular, to improve the performance of the vision-based model, our model considers three key aspects. First, to enhance the level of detail and visual representation of the surrounding environment, depth embeddings and semantic embeddings are utilized to assist in the perception of the Bird's Eye View(BEV) space. In addition, the motion queries are separated from the BEV queries and the novel transformer-based motion decoder is proposed to generate the output waypoints and target speed for motion planning. Subsequently, to optimize the utilization of the historic features, we introduced a streaming-based processing temporal module to fuse current and historic features. The performance of our proposed model is evaluated using the longest-06 and Town05-long benchmarks within the CARLA-0.9.10.1 simulator. The experiment results demonstrate that our proposed model performs state-of-the-art results on the longest-06 benchmark and promising results on the Town05-long benchmark even compared to these multi-modal sensor based models.

5.5Engineering value

8.0Research novelty

5.0Business relevance

Links and sources

Official / arXiv page

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.