Back to articles
Volume: 28 | Article ID: art00009
Improving a deep convolutional neural network architecture for character recognition
  DOI :  10.2352/ISSN.2470-1173.2016.17.DRR-060  Published OnlineFebruary 2016

Deep architectures based on convolutional neural networks have obtained state-of-the-art results for several recognition tasks. These architectures rely on a cascade of convolutional layers and activation functions. Beyond the set-up of the number of layers and the number of neurons in each layer, the choice of activation functions, training optimization algorithm and regularization procedure are of great importance. In this work we start from a deep convolutional architecture and we describe the effect of recent activation functions, optimization algorithms and regularization procedures when applied to the recognition of handwritten digits from the MNIST dataset. The network achieves a 0.38 % error rate, matching and slightly improving the best known performance of a single model trained without data augmentation at the time the experiments were performed.

Subject Areas :
Views 17
Downloads 2
 articleview.views 17
 articleview.downloads 2
  Cite this article 

Bogdan-Ionuţ Cirstea, Laurence Likforman-Sulem, "Improving a deep convolutional neural network architecture for character recognitionin Proc. IS&T Int’l. Symp. on Electronic Imaging: Document Recognition and Retrieval XXIII,  2016,

 Copy citation
  Copyright statement 
Copyright © Society for Imaging Science and Technology 2016
Electronic Imaging
Society for Imaging Science and Technology