Document Image Classification on the Basis of Layout Information

Sergey Zavalishin; Andrey Bout; Ilya Kurilin; Michail Rychagov

doi:10.2352/ISSN.2470-1173.2017.2.VIPC-412

Abstract

In this paper, we propose a document image classification framework based on layout information. Our method does not use OCR; hence, it is completely language independent. Still we are able to exploit text data by extracting text regions with a novel MSER-based approach. Our MSER formulation provides great robustness against text distortions in comparison to the existing one. We introduce two types of novel image descriptors supplemented with Fisher vectors, based on Bernoulli mixture model. Classifiers, based on aforementioned descriptors, are assembled into meta-classification system that is able to classify document in complex cases when individual classifier accuracy is poor. Our meta-classification system demonstrates low processing time comparable to a single classifier. We show that our method outperforms the existing ones by the means of classification accuracy for a wide range of documents of both well-known and machine-generated document datasets.

72010604

Electronic Imaging

2470-1173

Society for Imaging Science and Technology

10.2352/ISSN.2470-1173.2017.2.VIPC-412

2470-1173(20170129)2017:2L.78;1-

s15.phd

/ist/ei/2017/00002017/00000002/art00015

Articles

Document Image Classification on the Basis of Layout Information

ZavalishinSergey

BoutAndrey

KurilinIlya

RychagovMichail

29012017

2017

DOCUMENT CLASSIFICATIONCLASSIFIER ENSEMBLINGFISHER VECTORSRUN LENGTH DESCRIPTORLOCAL BINARY PATTERNS

articleview.keywords