Autonomous driving paper index

Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control

2025-05-21 · IEEE International Conference on Computer Vision · arXiv: 2505.15304

autonomous drivingself-drivingimitation learningdeploymentcontrol

One-line summary

To address this, we propose SaliencyAware Quantized Imitation Learning (SQIL), which combines quantization-aware training with a selective lossweighting strategy for mission-critical states.

Engineering notes

We validate SQIL's generalization capability across extensive simulation benchmarks with environment variations, real-world tasks, and cross-domain tasks (self-driving, physics simulation), consistently recovering full-precision performance. Notably, a 4-bit weightquantized VLA model for robotic manipulation achieves up to $2.5 \times$ speedup and $2.5 \times$ energy savings on an edge GPU with minimal accuracy loss.

Chinese explanation / 中文解读

中文解读待补充：本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Deep neural network (DNN)-based policy models, such as vision-language-action (VLA) models, excel at automating complex decision-making from multi-modal inputs. However, scaling these models greatly increases computational overhead, complicating deployment in resourceconstrained settings like robot manipulation and autonomous driving. To address this, we propose SaliencyAware Quantized Imitation Learning (SQIL), which combines quantization-aware training with a selective lossweighting strategy for mission-critical states. By identifying these states via saliency scores and emphasizing them in the training loss, SQIL preserves decision fidelity under low-bit precision. We validate SQIL's generalization capability across extensive simulation benchmarks with environment variations, real-world tasks, and cross-domain tasks (self-driving, physics simulation), consistently recovering full-precision performance. Notably, a 4-bit weightquantized VLA model for robotic manipulation achieves up to $2.5 \times$ speedup and $2.5 \times$ energy savings on an edge GPU with minimal accuracy loss. These results underline SQIL 's potential for efficiently deploying large IL-based policy models on resource-limited devices.

5.5Engineering value

7.0Research novelty

6.5Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.