Generalizing Handwriting and Scene-Text Detection in Images

Taewook  Kim; Gaurav  Patel; Qian  Lin; Jan P. Allebach; Qiang  Qiu

doi:10.2352/EI.2024.36.8.IMAGE-242

Abstract

In this paper, we present a deep-learning approach that unifies handwriting and scene-text detection in images. Specifically, we adopt adversarial domain generalization to improve text detection across different domains and extend the conventional dice loss to provide extra training guidance. Furthermore, we build a new benchmark dataset that comprehensively captures various handwritten and scene text scenarios in images. Our extensive experimental results demonstrate the effectiveness of our approach in generalizing detection across both handwriting and scene text.

Electronic Imaging

2470-1173

Society for Imaging Science and Technology

IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA

10.2352/EI.2024.36.8.IMAGE-242

IMAGE-242

Proceedings Paper

Generalizing Handwriting and Scene-Text Detection in Images

KimTaewook

Purdue University, US

PatelGaurav

Purdue University, US

LinQian

HP Inc., US

AllebachJan P.

Purdue University, US

QiuQiang

Purdue University, US

Abstract

2112024

IMAGE

Imaging and Multimedia Analytics at the Edge 2024

242-1

242-6

2024

Computer VisionGeneralizationHandwritingsOptical Character Recognition (OCR)Text DetectionText Recognition

articleview.keywords