Attention-based Weighted Fusion Network for Object Detection

Ruixing Yu; Chuyin Wang; Yifei Tang

doi:10.2352/J.ImagingSci.Technol.2024.68.6.060508

Back to articles

Regular Article

Volume: 68 | Article ID: 060508

Attention-based Weighted Fusion Network for Object Detection

object detection feature pyramid network attention mechanism feature fusion

DOI : 10.2352/J.ImagingSci.Technol.2024.68.6.060508 Published Online : November 2024

Abstract

Object detection and video single-frame detection have seen substantial advancements in recent years, particularly with deep-learning-based approaches demonstrating strong performance. However, these detectors often struggle in practical scenarios such as the analysis of video frames captured by unmanned aerial vehicles. The existing detectors usually do not perform well, especially for some objects with small area, large scale variation, dense distribution, and motion blur. To address these challenges, we propose a new feature extraction network: Attention-based Weighted Fusion Network. Our proposed method incorporates the Self-Attention Residual Block to enhance feature extraction capabilities. To accurately locate and identify objects of interest, we introduce the Mixed Attention Module, which significantly enhances object detection accuracy. Additionally, we incorporate adaptive learnable weights for each feature map to emphasize contributions from feature maps with varying resolutions during feature fusion. The performance of our method is evaluated on two datasets: PASCAL VOC and VisDrone2019. Experimental results demonstrate that our proposed method is superior to the baseline and other detectors. Our method achieves 87.1% mean average precision on the Pascal VOC 2007 test set and surpasses the baseline by 3.1% AP₅₀. In addition, our method also exhibits lower false detection rate and missed detection rate compared with other detectors.

Journal Title : Journal of Imaging Science and Technology

Publisher Name : Society for Imaging Science and Technology

Downloads 11

Cite this article

Ruixing Yu, Chuyin Wang, Yifei Tang, "Attention-based Weighted Fusion Network for Object Detection" in Journal of Imaging Science and Technology, 2024, pp 1 - 18, https://doi.org/10.2352/J.ImagingSci.Technol.2024.68.6.060508

Copy citation

Article timeline

received January 2024
accepted July 2024
PublishedNovember 2024

articleview.keywords

Login or subscribe to view the content