In this paper, we present a deep-learning approach that unifies handwriting and scene-text detection in images. Specifically, we adopt adversarial domain generalization to improve text detection across different domains and extend the conventional dice loss to provide extra training guidance. Furthermore, we build a new benchmark dataset that comprehensively captures various handwritten and scene text scenarios in images. Our extensive experimental results demonstrate the effectiveness of our approach in generalizing detection across both handwriting and scene text.
Taewook Kim, Gaurav Patel, Qian Lin, Jan P. Allebach, Qiang Qiu, "Generalizing Handwriting and Scene-Text Detection in Images" in Electronic Imaging, 2024, pp 242-1 - 242-6, https://doi.org/10.2352/EI.2024.36.8.IMAGE-242