Lightweight single pass numerical reading extraction for displays in the wild

Shanmukha  Yenneti; Yan-Ming  Chiou; Bob  Price

doi:10.2352/EI.2023.35.7.IMAGE-282

Abstract

Although considerable progress has been made in recognizing multi-character text from images, there are still cases where there is a lack of robust computationally-efficient methods that can execute on portable devices to read device displays in the wild. We specifically address the problem of parsing digits from 7 segment displays. Recognizing these displays is important for many tasks such as assisting users with tasks using augmented reality agents that need to verify actions or connecting legacy devices to the internet for process control using cheap cameras. Legacy techniques based on image processing operators and OCR are brittle whereas massive deep networks are too computationally expensive. We describe a computationally tractable VGG style backbone combined with a novel digit inference head that can be trained using a synthetic display generator with novel augmentations. We show the model trained on augmented synthetic data generalizes well to a corpus of real-world display images getting 97.8% single-frame accuracy and obtaining a throughput of 30 frames per second. We describe how the output can be further stabilized to improve accuracy through a kind of mode filtering.

Electronic Imaging

2470-1173

Society for Imaging Science and Technology

IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA

10.2352/EI.2023.35.7.IMAGE-282

IMAGE-282

Article

Lightweight single pass numerical reading extraction for displays in the wild

YennetiShanmukha

Palo Alto Research Center Incorporated, United States

ChiouYan-Ming

Palo Alto Research Center Incorporated, United States

PriceBob

Palo Alto Research Center Incorporated, United States

Abstract

1612023

IMAGE

Imaging and Multimedia Analytics at the Edge 2023

282--1

282-4

2023

digital displayseven segment displaymeter readingdigit extractiondigit inferencelegacy iot device interface

articleview.keywords