Autonomous driving paper index

HybridTE2: Hybrid Transformer-based End-to-End Learning for Autonomous Driving

2024-05-12 · Industrial Cyber-Physical Systems

autonomous drivingend-to-endprediction

One-line summary

Accurate navigation prediction is paramount for autonomous driving, offering the potential to enhance safety and efficiency by mitigating accidents caused by human error.

Engineering notes

The experimental results demonstrate that the proposed HybridTE2 architecture significantly outperforms the state-of-the-art CNN-based Nvidia’s PilotNet architecture and other baseline architectures.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Accurate navigation prediction is paramount for autonomous driving, offering the potential to enhance safety and efficiency by mitigating accidents caused by human error. This task necessitates the capture of long-term dependencies and complex correlations, especially in challenging driving scenarios. While Convolutional Neural Networks (CNNs) have historically been employed to predict navigation commands, delivering good accuracy, they often struggle in capturing long-term dependencies. In contrast, attention-based models like Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) networks exhibit proficiency in capturing long-term dependencies. This research is driven by a fundamental question: Which types of architectural designs are best suited to effectively capture these long-term dependencies in the context of precise navigation prediction. To tackle the question, a hybrid Transformer-based end-to-end architecture, HybridTE2, was designed for predicting steering angles and speed in autonomous driving scenarios. The proposed architecture integrates the strengths of CNNs and Transformer Encoder, leveraging CNNs to efficiently extract lowlevel features while simultaneously harnessing the Transformer’s ability to capture long-term dependencies. To empirically validate the proposed architecture, our evaluation has been conducted using the real-world large-scale dataset DBNet for driving behavior learning. The experimental results demonstrate that the proposed HybridTE2 architecture significantly outperforms the state-of-the-art CNN-based Nvidia’s PilotNet architecture and other baseline architectures.

6.0Engineering value
8.0Research novelty
5.5Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment