Paper
3 January 2025 DSSNet: a transformer-based network for dense scene text detection and recognition in complex environments
Jing Li, Huabing Zhou
Author Affiliations +
Proceedings Volume 13442, Fifth International Conference on Signal Processing and Computer Science (SPCS 2024); 134420M (2025) https://doi.org/10.1117/12.3053112
Event: Fifth International Conference on Signal Processing and Computer Science (SPCS 2024), 2024, Kaifeng, China
Abstract
This paper presents a novel approach for dense scene text detection called DSSNet (Dense Script Spotter Network). The network leverages ResNet and FPN for feature extraction, employing multi-scale feature fusion and Transformer-based feature processing to enhance text recognition across varying sizes. The method generates text instance shapes using Bézier central curves and performs text recognition by integrating positional query information. Experimental results on the DSTD1500 and ICDAR2015 datasets demonstrate that DSSNet outperforms existing methods in terms of text localization accuracy, recognition accuracy, and annotation flexibility.
(2025) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Jing Li and Huabing Zhou "DSSNet: a transformer-based network for dense scene text detection and recognition in complex environments", Proc. SPIE 13442, Fifth International Conference on Signal Processing and Computer Science (SPCS 2024), 134420M (3 January 2025); https://doi.org/10.1117/12.3053112
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Detection and tracking algorithms

Transformers

Network architectures

Image segmentation

Visualization

Feature fusion

RELATED CONTENT

FE VIT a faster and extensible vision transformer based...
Proceedings of SPIE (October 18 2022)
Visual inertial SLAM system based on ELSED point and line...
Proceedings of SPIE (January 03 2025)
Hierarchical OCR for printed Tamil text
Proceedings of SPIE (March 15 2019)
Object recognition based on spatial active basis template
Proceedings of SPIE (December 02 2011)

Back to Top