Paper
19 February 2024 FF-DETR: feature-fusion transformers for end-to-end object detection
Jinhao Xu, Tao Ma, Junping Xu, Jianlin Zhang
Author Affiliations +
Proceedings Volume 13063, Fourth International Conference on Computer Vision and Data Mining (ICCVDM 2023); 130631C (2024) https://doi.org/10.1117/12.3021368
Event: Fourth International Conference on Computer Vision and Data Mining (ICCVDM 2023), 2023, Changchun, China
Abstract
Object detection is a challenging task in computer vision that involves predicting both the class and the location of objects in an image. Most existing methods rely on convolutional neural networks and hand-crafted modules, such as anchor boxes and non-maximum suppression. Recently, a novel end-to-end approach called DETR was proposed, which uses a transformer encoder-decoder structure to model object detection as a set prediction problem. However, DETR suffers from some limitations, such as poor performance on small objects and slow convergence speed. In this paper, we propose FF-DETR, a feature-fusion detection transformer that improves the performance and convergence speed of DETR-like models. FF-DETR introduces three feature fusion modules: (1) Contour Fusion FPN, which fuses multi-scale features using self-attention and deformable convolution; (2) Position-Content Query Fusion, which initializes the content query features by fusing the position query features and the encoder output features; and (3) Global Decoder Layer Fusion, which fuses the outputs of each decoder layer and updates the position query features iteratively. We conduct experiments on the COCO dataset and show that FF-DETR outperforms DETR and other variants in terms of accuracy and efficiency.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Jinhao Xu, Tao Ma, Junping Xu, and Jianlin Zhang "FF-DETR: feature-fusion transformers for end-to-end object detection", Proc. SPIE 13063, Fourth International Conference on Computer Vision and Data Mining (ICCVDM 2023), 130631C (19 February 2024); https://doi.org/10.1117/12.3021368
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top