Back to articles
Article
Volume: 35 | Article ID: IMAGE-282
Image
Lightweight single pass numerical reading extraction for displays in the wild
  DOI :  10.2352/EI.2023.35.7.IMAGE-282  Published OnlineJanuary 2023
Abstract
Abstract

Although considerable progress has been made in recognizing multi-character text from images, there are still cases where there is a lack of robust computationally-efficient methods that can execute on portable devices to read device displays in the wild. We specifically address the problem of parsing digits from 7 segment displays. Recognizing these displays is important for many tasks such as assisting users with tasks using augmented reality agents that need to verify actions or connecting legacy devices to the internet for process control using cheap cameras. Legacy techniques based on image processing operators and OCR are brittle whereas massive deep networks are too computationally expensive. We describe a computationally tractable VGG style backbone combined with a novel digit inference head that can be trained using a synthetic display generator with novel augmentations. We show the model trained on augmented synthetic data generalizes well to a corpus of real-world display images getting 97.8% single-frame accuracy and obtaining a throughput of 30 frames per second. We describe how the output can be further stabilized to improve accuracy through a kind of mode filtering.

Subject Areas :
Views 58
Downloads 9
 articleview.views 58
 articleview.downloads 9
  Cite this article 

Shanmukha Yenneti, Yan-Ming Chiou, Bob Price, "Lightweight single pass numerical reading extraction for displays in the wildin Electronic Imaging,  2023,  pp 282--1 - 282-4,  https://doi.org/10.2352/EI.2023.35.7.IMAGE-282

 Copy citation
  Copyright statement 
Copyright © 2023, Society for Imaging Science and Technology 2023
ei
Electronic Imaging
2470-1173
2470-1173
Society for Imaging Science and Technology
IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA