Back to articles
Proceedings Paper
Volume: 36 | Article ID: IMAGE-241
Image
Self-Attention Enhanced Recognition: A Unified Model for Handwriting and Scene-text Recognition with Improved Inference
  DOI :  10.2352/EI.2024.36.8.IMAGE-241  Published OnlineJanuary 2024
Abstract
Abstract

In this paper, we introduce a unified handwriting and scene-text recognition model tailored to discern both printed and hand-written text images. Our primary contribution is the incorporation of the self-attention mechanism, a salient feature of the transformer architecture. This incorporation leads to two significant advantages: 1) A substantial improvement in the recognition accuracy for both scene-text and handwritten text, and 2) A notable decrease in inference time, addressing a prevalent challenge faced by modern recognizers that utilize sequence-based decoding with attention.

Subject Areas :
Views 130
Downloads 48
 articleview.views 130
 articleview.downloads 48
  Cite this article 

Gaurav Patel, Taewook Kim, Qian Lin, Jan P. Allebach, Qiang Qiu, "Self-Attention Enhanced Recognition: A Unified Model for Handwriting and Scene-text Recognition with Improved Inferencein Electronic Imaging,  2024,  pp 241-1 - 241-6,  https://doi.org/10.2352/EI.2024.36.8.IMAGE-241

 Copy citation
  Copyright statement 
Copyright © 2024, Society for Imaging Science and Technology 2024
ei
Electronic Imaging
2470-1173
2470-1173
Society for Imaging Science and Technology
IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA