Autonomous driving paper index

IR-WSANet: A lightweight network with frequency-spatial joint optimization for infrared small target detection

2026-06-06 · Scientific Reports

autonomous driving

One-line summary

This paper proposes IR-WSANet, a lightweight network based on an improved YOLOv10, which enhances the detection performance of infrared small targets through a frequency-spatial joint optimization strategy.

Engineering notes

The results show that IR-WSANet significantly improves the detection performance of infrared small targets in complex scenes through the combination of frequency domain filtering enhancement and space-channel dual attention mechanism.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Infrared Small Target Detection (IRSTD) holds significant application value in military operations, early warning and surveillance, aerospace, and other fields. However, traditional detection methods face challenges related to small target pixel sizes, strong background noise, and sparse infrared image features, resulting in insufficient accuracy and robustness. This paper proposes IR-WSANet, a lightweight network based on an improved YOLOv10, which enhances the detection performance of infrared small targets through a frequency-spatial joint optimization strategy. Firstly, the discrete wavelet transform convolution (DWaveletConv) is introduced into the backbone network, and the fusion of high-frequency details and low-frequency semantics is enhanced by multi-band feature decomposition to suppress noise interference. Secondly, we designed a cooperative module (POS-SHSA) that integrates POSConvEmbedding with a partial channel single-head self-attention mechanism (SHSA), which combines local spatial features and global context information to improve the positioning accuracy of small targets. Experiments verify the effectiveness of the model on SIDD and HIT-UAV datasets: the mAP of IR-WSANet on SIDD-City, SIDD-Mountain and HIT-UAV datasets reaches 97.2%, 82.6% and 82.8%, respectively, which is 2.8% to 14.1% higher than the baseline YOLOv10, and the highest F1 score was improved to 14.8%, while maintaining low computing cost (27.9 GFLOPs) and real-time performance (42.8 FPS). The results show that IR-WSANet significantly improves the detection performance of infrared small targets in complex scenes through the combination of frequency domain filtering enhancement and space-channel dual attention mechanism.

5.0Engineering value
7.0Research novelty
5.0Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment