Autonomous driving paper index

DTVAD:End-to-End Autonomous Vehicle Planning Using Reinfocement Learning

2025-07-28 · Cybersecurity and Cyberforensics Conference

end-to-end autonomous drivingautonomous drivingautonomous vehicleend-to-endreinforcement learningsim2realnuscenesperceptionplanning

One-line summary

In this paper, we propose DTVAD, an RL-based end-to-end autonomous driving framework that directly leverages real-world datasets, specifically the nuScenes dataset, and employs a decision transformer as the planning module.

Engineering notes

Key topics: end-to-end autonomous driving, autonomous driving, autonomous vehicle, end-to-end, reinforcement learning, sim2real, nuscenes, perception, planning. See the paper for implementation details and experimental results.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Reinforcement learning (RL) based end-to-end autonomous driving faces significant challenges due to the sim2real gap, primarily because simulation environments are often insufficiently realistic. However, collecting data from the real world is highly risky, as RL requires exploration of potentially dangerous behaviors to learn which situations should be avoided. Additionally, the loss function of RL may hinder the optimization of the perception module. In this paper, we propose DTVAD, an RL-based end-to-end autonomous driving framework that directly leverages real-world datasets, specifically the nuScenes dataset, and employs a decision transformer as the planning module. Notably, we randomly sample 4,096 trajectories as the action space and utilize the Fréchet distance to measure the similarity between trajectories. For each sample in the nuScenes dataset, we select one of the top 70 most similar trajectories as the action for the corresponding state. The discrepancy between the selected action and the recorded trajectory may introduce potential collisions, enabling the agent to learn which behaviors should be avoided. And we use Decision Transformer(DT) as our planning head. We evaluate the effectiveness of DTVAD on the nuScenes dataset through open-loop test.

6.5Engineering value
7.0Research novelty
5.5Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment