Illumination-guided with Crossmodal Transformer Fusion for RGB-T Object Detection

Ruilin Xie; Sheng Jiang; Yiming Bie; Miaolei Xia

doi:10.2352/J.ImagingSci.Technol.2025.69.2.020505

Back to articles

Regular Article

Volume: 69 | Article ID: 020505

Illumination-guided with Crossmodal Transformer Fusion for RGB-T Object Detection

traffic scene feature fusion transformers multispectral pedestrian detection

DOI : 10.2352/J.ImagingSci.Technol.2025.69.2.020505 Published Online : March 2025

Abstract

Object detection in varying traffic scenes presents significant challenges in real-world applications. Thermal image utilization is acknowledged as a beneficial approach to enhance RGB image detection, especially in suboptimal lighting conditions. However, harnessing the combined potential of RGB and thermal images remains a formidable task. We tackle this by implementing an illumination-guided adaptive information fusion technique across both data types. Thus, we propose the illumination-guided with crossmodal attention transformer fusion (ICATF), a novel object detection framework that skillfully integrates features from RGB and thermal data. Further, an illumination-guided module is developed to adapt features to current lighting conditions, steering the learning process towards the most informative data fusion. Then, we incorporate frequency domain convolutions within the network’s backbone to assimilate spectral context and derive more nuanced features. In addition, we fuse the differential modality features for multispectral pedestrian detection with illumination-guided feature weights and transformer fusion architecture. Our method achieves state-of-the-art by experimental results on multispectral detection datasets, including FLIR-aligned, LLVIP, and KAIST.

Journal Title : Journal of Imaging Science and Technology

Publisher Name : Society for Imaging Science and Technology

Downloads 23

Cite this article

Ruilin Xie, Sheng Jiang, Yiming Bie, Miaolei Xia, "Illumination-guided with Crossmodal Transformer Fusion for RGB-T Object Detection" in Journal of Imaging Science and Technology, 2025, pp 1 - 11, https://doi.org/10.2352/J.ImagingSci.Technol.2025.69.2.020505

Copy citation

Article timeline

received May 2024
accepted October 2024
PublishedMarch 2025

articleview.keywords

Login or subscribe to view the content