Autonomous driving paper index

Enhancing deep reinforcement learning with expert demonstrations for mobile robot navigation in unstructured off-road environments

2026-07-01 · Research Online (Edith Cowan University)

autonomous drivingend-to-endreinforcement learningimitation learningon-roadcontrol

One-line summary

Advancements in the field of Deep Learning has ushered in a boom in autonomous navigation research.

Engineering notes

Modern supervised learning methods have been used to achieve state-of-the-art results in urban navigation benchmarks but struggle to generalise to out-of-distribution data.

Chinese explanation / 中文解读

中文解读待补充：本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Advancements in the field of Deep Learning has ushered in a boom in autonomous navigation research. Most of the work being conducted in this space, however, has focused on on-road urban navigation scenarios, with unstructured outdoor terrain navigation receiving much more limited attention. Given the wide range of applications that exist for legged and wheeled ground robots in off-road environments in areas such as agriculture, mining and disaster recovery, there is a growing need for research work to improve the navigational capabilities of mobile robots deployed in these challenging environments. A promising candidate for application to these navigation challenges in the unstructured off-road domain is Deep Reinforcement Learning (DRL), which has shown significant success in achieving human-level expertise in a wide range of control problems. For problems with higher dimensionality however, DRL suffers from sample-inefficiency and requires large amounts of compute resources for convergence. Modern supervised learning methods have been used to achieve state-of-the-art results in urban navigation benchmarks but struggle to generalise to out-of-distribution data. While expert policies for urban on-road navigation are readily available in both synthetic and real-world forms, significant challenges exist in the case of unstructured outdoor environments due to the lack of availability of the large amounts of annotated navigational data needed for agent learning. Based on an in-depth study of published literature for DRL, Imitation Learning (IL) and synthetic and real-world sensor data generation, this research investigates a novel method for combining DRL and IL techniques to achieve robust autonomous navigation in unstructured outdoor terrain. Validation of the proposed approach has been carried out in simulated environments containing prototype-phase designs consisting of both simplified terrain and obstacle features and more realistic designs containing high-fidelity photogrammetry assets representing Western Australian environments. The 3D environment models are implemented using the NVIDIA Isaac Sim simulation platform, which supports realistic physics-based autonomous robot training workflows. The developed 3D environment models are used for generating navigational expert policies, which are subsequently used to improve the sample efficiency of off-policy Reinforcement Learning (RL) algorithms. The expert trajectories are incorporated into the early learning phase of the RL algorithms using the proposed framework. The performance of the proposed approach is tested over four different 3D environment models and two different RL algorithms. The results of the experiments indicate that the proposed approach can achieve up to four times more sample-efficient learning with the use of expert trajectories in the replay buffer for off-policy RL algorithms. The work presented in this thesis can be used to develop more robust and sample efficient end-to-end navigational policies for off-road and unstructured terrain. Application of this approach will lead to the advancement of autonomous navigation capabilities in the agriculture, mining, disaster-recovery and transportation industries.

6.0Engineering value

8.0Research novelty

5.5Business relevance

Links and sources

PDF from original source

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.