Adaptive Bit Depth Control for Neural Network Quantization

Youngil  Seo; Dongpan  Lim; Jungguk  Lee; Seongwook  Song

doi:10.2352/EI.2024.36.7.ISS-285

Abstract

Recently, many deep learning applications have been used on the mobile platform. To deploy them in the mobile platform, the networks should be quantized. The quantization of computer vision networks has been studied well but there have been few studies for the quantization of image restoration networks. In previous study, we studied the effect of the quantization of activations and weight for deep learning network on image quality following previous study for weight quantization for deep learning network. In this paper, we made adaptive bit-depth control of input patch while maintaining the image quality similar to the floating point network to achieve more quantization bit reduction than previous work. Bit depth is controlled adaptive to the maximum pixel value of the input data block. It can preserve the linearity of the value in the block data so that the deep neural network doesn't need to be trained by the data distribution change. With proposed method we could achieve 5 percent reduction in hardware area and power consumption for our custom deep network hardware while maintaining the image quality in subejctive and objective measurment. It is very important achievement for mobile platform hardware.

Electronic Imaging

2470-1173

Society for Imaging Science and Technology

IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA

10.2352/EI.2024.36.7.ISS-285

ISS-285

Proceedings

Adaptive Bit Depth Control for Neural Network Quantization

SeoYoungil

Samsung Electronics Ltd, Republic of Korea

LimDongpan

Samsung Electronics Ltd, Republic of Korea

LeeJungguk

Samsung Electronics Ltd, Republic of Korea

SongSeongwook

Samsung Electronics Ltd, Republic of Korea

Abstract

21012024

ISS

Imaging Sensors and Systems 2024

285-1

285-6

2024

Convolutional Neural NetworkDeep learningDenoisingImage RestorationImaging sensorQuantization

articleview.keywords