Autonomous driving paper index

Research on Generalization of Autonomous Driving Behavior Policies Based on Policy Constraints

2025-05-09 · 2025 IEEE 7th International Conference on Communications, Information System and Computer Engineering (CISCE)

autonomous driving systemautonomous drivinglidarpoint cloudsensor fusioncarlaperception

One-line summary

To address these issues, this paper proposes a framework that incorporates adversarial networks to constrain the encoder's output on top of Proximal Policy Optimization (PPO), combined with sensor fusion(ASF-PPO).

Engineering notes

Experimental results show that, compared to Soft Actor-Critic (SAC) and Deep Deterministic Policy Gradient (DDPG), our method achieves significant advantages in both training speed and performance, while also demonstrating excellent generalization on previously unseen road scenarios.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

The development of autonomous driving holds significant potential to enhance traffic efficiency and improve societal well-being. However, current autonomous driving systems heavily rely on high-dimensional raw perception data, such as images and LiDAR point clouds, within their decision-making models. This reliance often leads to overfitting to specific training environments, resulting in poor transferability and weak generalization capabilities in new driving scenarios. To address these issues, this paper proposes a framework that incorporates adversarial networks to constrain the encoder's output on top of Proximal Policy Optimization (PPO), combined with sensor fusion(ASF-PPO). This enables the agent to learn key generalized features, enhancing its generalization ability. The proposed method not only enhances the agent's generalization capability in unseen environments but also accelerates the convergence of the training process. We validate the effectiveness of our approach in the CARLA simulation environment. Experimental results show that, compared to Soft Actor-Critic (SAC) and Deep Deterministic Policy Gradient (DDPG), our method achieves significant advantages in both training speed and performance, while also demonstrating excellent generalization on previously unseen road scenarios.

5.0Engineering value
7.0Research novelty
5.0Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment