IS&T | Library

Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays

Abstract

The dynamic range of the intensity of long-wave infrared (LWIR) cameras are often more than 8bit and its images have to be visualized using histogram equalization and so on. Many visualization methods do not consider effects of noise, which must be taken care of in real situations. We propose a novel LWIR images visualization method based on gradient-domain processing or gradient mapping. Processing based on intensity and gradient power in the gradient domain enables visualizing LWIR images with noise reduction. We evaluate the proposed method quantitatively and qualitatively and show its effectiveness.

Digital Library: EI

Published Online: January 2024

Proceedings

45 8

Deep Learning
Domain Adaptation
High Dynamic Range
Image Quality
Optimization
Transformers

Andrei Chubarau, Hyunjin Yoo, Tara Akhavan, James Clark

Pages 224-1 - 224-7, January 2024, This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. 2024

DOI

10.2352/EI.2024.36.11.HVEI-224

Volume 36

Issue 11

Abstract

Conventional image quality metrics (IQMs), such as PSNR and SSIM, are designed for perceptually uniform gamma-encoded pixel values and cannot be directly applied to perceptually non-uniform linear high-dynamic-range (HDR) colors. Similarly, most of the available datasets consist of standard-dynamic-range (SDR) images collected in standard and possibly uncontrolled viewing conditions. Popular pre-trained neural networks are likewise intended for SDR inputs, restricting their direct application to HDR content. On the other hand, training HDR models from scratch is challenging due to limited available HDR data. In this work, we explore more effective approaches for training deep learning-based models for image quality assessment (IQA) on HDR data. We leverage networks pre-trained on SDR data (source domain) and re-target these models to HDR (target domain) with additional fine-tuning and domain adaptation. We validate our methods on the available HDR IQA datasets, demonstrating that models trained with with our combined recipe outperform previous baselines, converge much quicker, and reliably generalize to HDR inputs.

Digital Library: EI

Published Online: January 2024

Does End-to-End Trained Deep Model Always Perform Better than Non-End-to-End Counterpart?

62 2

Image classification
Optimization
Deep learning

Ikuro Sato, Guoqing Liu, Kohta Ishikawa, Teppei Suzuki, Masayuki Tanaka

Pages 240-1 - 240-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-240

Volume 33

Issue 10

It has been rigorously demonstrated that an end-to-end (E2E) differentiable formulation of a deep neural network can turn a complex recognition problem into a unified optimization task that can be solved by some gradient descent method. Although E2E network optimization yields a powerful fitting ability, the joint optimization of layers is known to potentially bring situations where layers co-adapt one to the other in a complex way that harms generalization ability. This work aims to numerically evaluate the generalization ability of a particular non-E2E network optimization approach known as FOCA (Feature-extractor Optimization through Classifier Anonymization) that helps to avoid complex co-adaptation, with careful hyperparameter tuning. In this report, we present intriguing empirical results where the non-E2E trained models consistently outperform the corresponding E2E trained models on three image-classification datasets. We further show that E2E network fine-tuning, applied after the feature-extractor optimization by FOCA and the following classifier optimization with the fixed feature extractor, indeed gives no improvement on the test accuracy. The source code is available at https://github.com/DensoITLab/FOCA-v1.

Digital Library: EI

Published Online: January 2021

Per Clip Lagrangian Multiplier Optimisation for HEVC

156 3

Video Coding
HEVC
Compression
Optimization

Daniel J Ringis, François Pitié, Anil Kokaram

DOI

10.2352/ISSN.2470-1173.2020.10.IPAS-136

Volume 32

Issue 10

The majority of internet traffic is video content. This drives the demand for video compression in order to deliver high quality video at low target bitrates. This paper investigates the impact of adjusting the rate distortion equation on compression performance. An constant of proportionality, k, is used to modify the Lagrange multiplier used in H.265 (HEVC). Direct optimisation methods are deployed to maximise BD-Rate improvement for a particular clip. This leads to up to 21% BD-Rate improvement for an individual clip. Furthermore we use a more realistic corpus of material provided by YouTube. The results show that direct optimisation using BD-rate as the objective function can lead to further gains in bitrate savings that are not available with previous approaches.

Digital Library: EI

Published Online: January 2020

CHEMOMETRIC DATA ANALYSIS WITH AUTOENCODER NEURAL NETWORK

185 7

Deep Learning
Data Analysis
Optimization

Muhammad Bilal, Mohib Ullah, Habib Ullah

DOI

10.2352/ISSN.2470-1173.2019.1.VDA-679

Volume 31

Issue 1

We propose novel deep learning based chemometric data analysis technique. We trained L2 regularized sparse autoencoder end-to-end for reducing the size of the feature vector to handle the classic problem of the curse of dimensionality in chemometric data analysis. We introduce a novel technique of automatic selection of nodes inside the hidden layer of an autoencoder through Pareto optimization. Moreover, Gaussian process regressor is applied on the reduced size feature vector for the regression. We evaluated our technique on orange juice and wine dataset and results are compared against 3 state-of-the-art methods. Quantitative results are shown on Normalized Mean Square Error (NMSE) and the results show considerable improvement in the state-of-the-art.

Digital Library: EI

Published Online: January 2019

Multi-sensor fusion for Automated Driving: Selecting model and optimizing on Embedded platform

108 9

Sensor Fusion
Kalman Filter
Extended Kalman Filter
Unscented Kalman Filter
Optimization
DSP

Shyam Jagannathan, Mihir Mody, Jason Jones, Pramod Swami, Deepak Poddar

DOI

10.2352/ISSN.2470-1173.2018.17.AVM-256

Volume 30

Issue 17