Back to articles
Articles
Volume: 30 | Article ID: art00014
Image
Text/Figure Separation in Document Images Using Docstrum Descriptor and Two-Level Clustering
  DOI :  10.2352/ISSN.2470-1173.2018.2.VIPC-253  Published OnlineJanuary 2018
Abstract

We propose a novel algorithm for text/figure separation tailored for binary document images containing line drawings, block diagrams, charts, schemes and other kinds of business graphics. Most of the approaches for this task rely either on clever design of visual descriptor allowing to easily distinguish text and graphics regions or on the supervised learning using dataset of labeled text/figure regions. Such approaches often provide moderate separation accuracy when applied to document images which contain very diverse set of figure classes and lack sufficiently representative labeled training dataset. In contrast, our method is well-suited for vast variety of figure classes and capable of operating either in semi-supervised mode or unsupervised mode. We achieve this by leveraging unsupervised learning algorithms applied to Docstrum descriptors extracted from regions of interest and subsequent semi-supervised label propagation or unsupervised label inference. Another advantage of our method is its suitability for large scale data processing which is achieved through efficient kernel-approximating feature mapping applied to Docstrum descriptors and two-level clustering where fast mini-batch K-means algorithm is first applied to large scale data and only small number of resulting cluster centroids is subsequently processed by one of the more sophisticated clustering algorithms.

Subject Areas :
Views 20
Downloads 5
 articleview.views 20
 articleview.downloads 5
  Cite this article 

Valery Anisimovskiy, Ilya Kurilin, Andrey Shcherbinin, Petr Pohl, "Text/Figure Separation in Document Images Using Docstrum Descriptor and Two-Level Clusteringin Proc. IS&T Int’l. Symp. on Electronic Imaging: Visual Information Processing and Communication IX,  2018,  pp 253-1 - 253-12,  https://doi.org/10.2352/ISSN.2470-1173.2018.2.VIPC-253

 Copy citation
  Copyright statement 
Copyright © Society for Imaging Science and Technology 2018
72010604
Electronic Imaging
2470-1173
Society for Imaging Science and Technology