IS&T | Library

Open Set Domain Adaptation for Image Classification With Multiple Unknown Labels Using Unsupervised Clustering in a Target Domain

Abstract

With the emergence of 200 mega pixel QxQ Bayer pattern image sensors, the remosaic technology that rearranges color filter arrays (CFAs) into Bayer patterns has become increasingly important. However, the limitations of the remosaic algorithm in the sensor often result in artifacts that degrade the details and textures of the images. In this paper, we propose a deep learning-based artifact correction method to enhance image quality within a mobile environment while minimizing shutter lag. We generated a dataset for training by utilizing a high-performance remosaic algorithm and trained a lightweight U-Net based network. The proposed network effectively removes these artifacts, thereby improving the overall image quality. Additionally, it only takes about 15 ms to process a 4000x3000 image on a Galaxy S22 Ultra, making it suitable for real-time applications.

Digital Library: EI

Published Online: February 2025

Proceedings

76 33

Data annotation
Domain adaptation
Image classification
Image quality
Noise robustness
Pseudo labeling
Unsupervised clustering

Daichi Nishihara, Yoshihiro Midoh, Youyang Ng, Osamu Yamane, Maasa Takahashi, Shuhei Iijima, Jun Shiomi, Goh Itoh, Noriyuki Miura

DOI

10.2352/EI.2024.36.15.COIMG-162

Volume 36

Issue 15

Another look at SSIM image quality metric

Abstract

Domain adaptation, which transfers an existing system with teacher labels (source domain) to another system without teacher labels (target domain), has garnered significant interest to reduce human annotations and build AI models efficiently. Open set domain adaptation considers unknown labels in the target domain that were not present in the source domain. Conventional methods treat unknown labels as a single entity, but this assumption may not hold true in real-world scenarios. To address this challenge, we propose open set domain adaptation for image classification with multiple unknown labels. Assuming that there exists a discrepancy in the feature space between the known labels in the source domain and the unknown labels in the target domain based on their type, we can leverage clustering to classify the types of unknown labels by considering the pixel-wise feature distances between samples in the target domain and the known labels in the source domain. This enables us to assign pseudo-labels to target samples based on the classification results obtained through unsupervised clustering with an unknown number of clusters. Experimental results show that the accuracy of domain adaptation is improved by re-training using these pseudo-labels in a closed set domain adaptation setting.

Digital Library: EI

Published Online: January 2024

Article

175 77

Image quality
SSIM
PSNR
SNR
Subband decomposition
CSF

Yuriy Reznik

DOI

10.2352/EI.2023.35.8.IQSP-305

Volume 35

Issue 8

Abstract

We review the design of the SSIM quality metric and offer an alternative model of SSIM computation, utilizing subband decomposition and identical distance measures in each subband. We show that this model performs very close to the original and offers many advantages from a methodological standpoint. It immediately brings several possible explanations of why SSIM is effective. It also suggests a simple strategy for band noise allocation optimizing SSIM scores. This strategy may aid the design of encoders or pre-processing filters for video coding. Finally, this model leads to more straightforward mathematical connections between SSIM, MSE, and SNR metrics, improving previously known results.

Digital Library: EI

Published Online: January 2023

Decision-making on image denoising expedience

219 4

Denoising efficiency
Denoising expedience
BM3D
Performance prediction
AWGN
Image quality
Neural network

Andrii Rubel, Oleksii Rubel, Vladimir Lukin, Karen Egiazarian

Pages 237-1 - 237-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-237

Volume 33

Issue 10

Image denoising is a classical preprocessing stage used to enhance images. However, it is well known that there are many practical cases where different image denoising methods produce images with inappropriate visual quality, which makes an application of image denoising useless. Because of this, it is desirable to detect such cases in advance and decide how expedient is image denoising (filtering). This problem for the case of wellknown BM3D denoiser is analyzed in this paper. We propose an algorithm of decision-making on image denoising expedience for images corrupted by additive white Gaussian noise (AWGN). An algorithm of prediction of subjective image visual quality scores for denoised images using a trained artificial neural network is proposed as well. It is shown that this prediction is fast and accurate.

Digital Library: EI

Published Online: January 2021

Predicting Single Observer’s Votes from Objective Measures using Neural Networks

59 0

Deep neural networks
Image quality
Human vision
Observers Behavior

Lohic Fotio Tiotsop, Tomas Mizdos, Miroslav Uhrina, Peter Pocta, Marcus Barkowsky, Enrico Masala

Pages 130-1 - 130-8, January 2020, © Society for Imaging Science and Technology 2020

DOI

10.2352/ISSN.2470-1173.2020.11.HVEI-130

Volume 32

Issue 11

The last decades witnessed an increasing number of works aiming at proposing objective measures for media quality assessment, i.e. determining an estimation of the mean opinion score (MOS) of human observers. In this contribution, we investigate a possibility of modeling and predicting single observer’s opinion scores rather than the MOS. More precisely, we attempt to approximate the choice of one single observer by designing a neural network (NN) that is expected to mimic that observer behavior in terms of visual quality perception. Once such NNs (one for each observer) are trained they can be looked at as “virtual observers” as they take as an input information about a sequence and they output the score that the related observer would have given after watching that sequence. This new approach allows to automatically get different opinions regarding the perceived visual quality of a sequence whose quality is under investigation and thus estimate not only the MOS but also a number of other statistical indexes such as, for instance, the standard deviation of the opinions. Large numerical experiments are performed to provide further insight into a suitability of the approach.

Digital Library: EI

Published Online: January 2020

Region of interest extraction for image quality assessment

79 5

Region of interst
Image quality
Print quality

Runzhe Zhang, Eric Maggard, Yousun Bang, Minki Cho, Jan Allebach

DOI

10.2352/ISSN.2470-1173.2020.9.IQSP-321

Volume 32

Issue 9

Print quality (PQ) is most important in the printing industry. To detect and analyze print defects is an effective solution to improve print quality. As the different types of print defects appear in different regions of interest (ROI) in the digital image of a scanned page, extracting the different ROIs helps to detect and analyze the printer defect. This paper proposes a method to extract different ROIs based on the digital image object map [1], which includes three different labels: raster (images or pictures), vector (background and smooth gradient color areas), and symbol (symbols and texts). Our ROI extraction method will extract four kinds of ROIs based on these three labeled objects. So we need to distinguish the background area and smooth gradient color area (color vector) from other vector objects. The process of the ROI extraction method includes four parts; and each part will extract one kind of ROI. For the color vector and background ROI extraction part, we develop two approaches: one is to obtain the maximum area rectangular ROI; and the other approach is to extract the deepest rectangular ROI. With both of these two methods, we use a greedy algorithm to gather additional useful ROIs. In the final result of the ROI extraction process, we only save the left top and right bottom positions for each ROI. In the end, we design a Matlab GUI Tool and label the ROI ground truth manually. We calculate the intersection over union (IoU)) between the ROI extraction result and the ROI manually labeled ground truth to evaluate our ROI extraction algorithm, and check whether it is good enough to crop different ROIs from the image of the scanned page to detect and analyze print defects.

Digital Library: EI

Published Online: January 2020

Relation Between Image Quality and Scan Resolution: Part I

80 10

Image quality
Resolution
Relation

Zhenhua Hu, Litao Hu, Peter Bauer, Todd Harris, Jan Allebach

DOI

10.2352/ISSN.2470-1173.2020.9.IQSP-322

Volume 32

Issue 9

Currently, a particular scan resolution has to be defined before a scanner starts working. Two problems arise from this process. Firstly, no matter how different two pages contents are, they will be scanned into the same resolution. For example, after scanning, a blank page and a fine-detailed drawing will have the same resolution. Secondly, for one scanned page, every part of its output would have the same resolution, whatever their contents are. These problems will cause unnecessary waste of memory used to store scanned images. So a method to decide the minimum acceptable scan resolution is needed. But current image quality estimators are not suitable for estimating image quality at different resolutions. This paper proposes four features to assess image qualities at different resolutions, namely 75, 100, 150, 200 and 300 dpi. The features are tile-SSIM mean, tile-SSIM standard deviation, horizontal transition density, and vertical transition density. Tests on images containing different contents show that these features are promising in evaluate image qualities across different scan resolutions.

Digital Library: EI

Published Online: January 2020

A blind mesh visual quality assessment method based on convolutional neural network

61 2

3D mesh
Image quality
Convolutional neural network

Ilyass Abouelaziz, Aladine Chetouani, Mohammed El Hassouni, Hocine Cherifi

DOI

10.2352/ISSN.2470-1173.2018.18.3DIPM-423

Volume 30

Issue 18