IS&T | Library

Progressive Fusion Image Transparency-Guided Enhancement Algorithm based on Generative Adversarial Learning

0 0

generative adversarial learning
progressive fusion of images
transparency mask
image enhancement

Le Wang, Xiaona Ding, Yinglin Zhu

Pages 1 - 10, September 2025, © Society for Imaging Science and Technology 2025

DOI

10.2352/J.ImagingSci.Technol.2025.69.5.050501

Volume 69

Issue 5

Abstract

The progressive fusion algorithm enhances image boundary smoothness, preserves details, and improves visual harmony. However, issues with multi-scale fusion and improper color space conversion can lead to blurred details and color distortion, which do not meet modern image processing standards for high-quality output. Therefore, a progressive fusion image transparency-guided enhancement algorithm based on generative adversarial learning is proposed. The method combines wavelet transform with gradient field fusion to enhance image details, preserve spectral features, and generate high-resolution true-color fused images. It extracts the image mean, standard deviation, and smoothness features, and uses these along with the original image input to generate an adversarial network. The optimization design introduces global context, transparency mask prediction, and a dual-discriminator structure to enhance the transparency of progressively fused images. The experimental results showed that using the designed method, the information entropy was 7.638, the blind image quality index was 24.331, the natural image quality evaluator value was 3.611, and the processing time was 0.036 s. The overall evaluation indices were excellent, effectively restoring image detail information and spatial color while avoiding artifacts. The processed images exhibited high quality with complete detail preservation.

Digital Library: JIST

Published Online: September 2025

Single Image Dehazing based on Transmission Map Segmentation and Prior Knowledge

125 11

image analysis
image enhancement
image processing

Tao Tao, Huajun Gong, Changjiu Yuan, Hao Zhou

Pages 1 - 11, July 2024, © Society for Imaging Science and Technology 2024

DOI

10.2352/J.ImagingSci.Technol.2024.68.4.040501

Volume 68

Issue 4

Abstract

Single image dehazing is very important in intelligent vision systems. Since the dark channel prior (DCP) is invalid in bright areas such as the sky part of the image and will cause the recovered image to suffer severe distortion in the bright area. Therefore, we propose a novel dehazing method based on transmission map segmentation and prior knowledge. First, we divide the hazy input into bright areas and non-bright areas, then estimate the transmission map via DCP in the non-bright area, and propose a transmission map compensation function for correction in the bright area. Then we fuse the DCP and the bright channel prior (BCP) to accurately estimate the atmospheric light, and finally restore the clear image according to the physical model. Experiments show that our method well solves the DCP distortion problem in bright regions of images and is competitive with state-of-the-art methods.

Digital Library: JIST

Published Online: July 2024

Ricci-Notation Tensor Framework for Model-based Approaches to Imaging

70 8

Computation
image enhancement
modeling

Dileepan Joseph

Pages 1 - 17, July 2024, © Society for Imaging Science and Technology 2024

DOI

10.2352/J.ImagingSci.Technol.2024.68.4.040504

Volume 68

Issue 4

Abstract

Model-based approaches to imaging, such as specialized image enhancements in astronomy, facilitate explanations of relationships between observed inputs and computed outputs. These models may be expressed with extended matrix-vector (EMV) algebra, especially when they involve only scalars, vectors, and matrices, and with n-mode or index notations, when they involve multidimensional arrays, also called numeric tensors or, simply, tensors. Although this paper features an example, inspired by exoplanet imaging, that employs tensors to reveal (inverse) 2D fast Fourier transforms in an image enhancement model, the work is actually about the tensor algebra and software, or tensor frameworks, available for model-based imaging. The paper proposes a Ricci-notation tensor (RT) framework, comprising a dual-variant index notation, with Einstein summation convention, and codesigned object-oriented software, called the RTToolbox for MATLAB. Extensions to Ricci notation offer novel representations for entrywise, pagewise, and broadcasting operations popular in EMV frameworks for imaging. Complementing the EMV algebra computable with MATLAB, the RTToolbox demonstrates programmatic and computational efficiency via careful design of numeric tensor and dual-variant index classes. Compared to its closest competitor, also a numeric tensor framework that uses index notation, the RT framework enables superior ways to model imaging problems and, thereby, to develop solutions.

Digital Library: JIST

Published Online: July 2024

Composite Structure Detection Method for Surface Scratches on Textured Paper based on Photometric Stereoscopic Imaging

145 9

color perception
image capture
image enhancement

Yaoshun Yue, Maohai Lin

Pages 1 - 10, May 2024, © Society for Imaging Science and Technology 2024

DOI

10.2352/J.ImagingSci.Technol.2024.68.3.030503

Volume 68

Issue 3

Simplifying Tone Curves for Image Enhancement

Abstract

Along with the improvement of quality requirements in industrial production, surface inspection of workpiece has gradually become an indispensable and important process in the production of the workpiece. Aiming at the traditional methods in textured paper inspection, there are problems of low efficiency and large error; based on machine vision, we propose a “photometric stereo vision + fast Fourier enhancement + feature fusion” composite structure inspection method. First, as the traditional CCD camera produces obvious noise and scratches, which are difficult to distinguish from the background texture area, we propose combining the photometric stereo vision measurement algorithm to get the surface gradient information of the textured paper to obtain more gradient texture information; and then realize the secondary enhancement of the image through Fourier transform in spatial and frequency domains. Second, as the textured paper scratches are difficult to detect, the features are difficult to extract, and the threshold boundary is difficult to define, we propose dynamic threshold segmentation through multi-feature fusion to realize the surface scratch detection work of textured paper. We designed experiments using more than 300 different textured papers; and the results show that the composite structure detection method proposed in this paper is feasible and has advantages.

Digital Library: JIST

Published Online: May 2024

Proceedings Paper

146 30

Tone mapping
tone curves
image enhancement
quadratic programming

James Bennett, Graham Finlayson

Pages 108 - 114, November 2023, This work is licensed under the Creative Commons Attribution 4.0 International License. 2023

DOI

10.2352/CIC.2023.31.1.21

Volume 31

Issue 1

Image-based Perceptual Editing: Leather “Authenticity” as a Case Study

Abstract

A single tone curve which is used to globally remap the brightness of each pixel in an image is one of the simplest ways to enhance an image. Tone curves might be the result of individual user edits or from algorithmic processing including in-camera processing pipelines. The precise shape of the tone curve is not strongly constrained other than it is usually limited to increasing functions of brightness. In this paper we constrain the shape further and define a simple tone adjustment, mathematically, to be a tone curve that has either no or one inflexion point. It follows that a complex tone curve is one with more than one inflexion point, visually making the curve appear ‘wiggly’. Empirically, complex tone curves do not seem to be used very often. For any given tone curve we show how the closest simple approximation can be efficiently found. We apply our approximation method to the MIT-Adobe FiveK dataset which comprises 5000 images that are manually tone-edited by 5 experts. For all 25,000 edited images - where some of the tone adjustments are complex - we find that they are all well-approximated by simple tone curve adjustments.

Digital Library: CIC

Published Online: November 2023

JIST-first

30 4

computational photography
image enhancement
perception

Shuhei Watanabe, Takahiko Horiuchi

Pages 60401-1 - 60401-10, November 2020, © Society for Imaging Science and Technology 2021

DOI

10.2352/J.ImagingSci.Technol.2020.64.6.060401

Volume 33

Issue 5

Parameter Estimation of PuRet Algorithm for Managing Appearance of Material Objects on Display Devices

Traditionally, the appearance of an object in an image is edited to elicit a preferred perception. However, the editing method might be arbitrary and might not consider the human perception mechanism. In this study, the authors explored image-based leather “authenticity” editing using an estimation model that considers a perception mechanism derived in their previous work. They created leather rendered images by emphasizing or suppressing image properties corresponding to the “authenticity.” Subsequently, they performed two subjective experiments, one using fully edited images and another using partially edited images whose specular reflection intensity was constant. Participants observed the leather rendered images and evaluated the differences in the perception of “authenticity.” The authors found that the “authenticity” perception could be changed by manipulating the intensity of specular reflection and the texture (grain and surface irregularity) in the images. The results of this study could be used to tune the properties of images to make them more appealing.

Digital Library: EI

Published Online: November 2020

JIST-first

14 1

image enhancement
material appearance
material perception

Midori Tanaka, Ryusuke Arai, Takahiko Horiuchi

DOI

10.2352/J.ImagingSci.Technol.2019.63.6.060404

Volume 32

Issue 15

Digital circuit methods to correct and filter noise of nonlinear CMOS image sensors (JIST-first)

In addition to colors and shapes, factors of material appearance such as glossiness, translucency, and roughness are important for reproducing the realistic feeling of an image. In general, these perceptual qualities are often degraded when reproduced as a digital color image. The authors have aimed to edit the material appearance of an image as measured by a general camera and reproduce it on a general display device. In their previous study, the authors found that the pupil diameter decreases slightly when observing the surface properties of an object and proposed an algorithm called “PuRet” for enhancing the material appearance based on the physiological models of the pupil and retina. However, to obtain an accurate reproduction, it was necessary to manually adjust two types of adaptation parameters in PuRet as related to the retinal response for each scene and the particular characteristics of the display device. This study realizes the management of the appearance of material objects on display devices by automatically deriving the optimum parameters in PuRet from captured RAW image data. The results indicate that the authors succeeded in estimating an adaptation parameter from the median value of the scene luminance as estimated from a RAW image. They also succeeded in estimating another adaptation parameter from the average value of the scene luminance and the luminance contrast value of the output display device. As a result of an experiment using an unknown display device that was not applied to derive the estimation model, it was confirmed that the proposed model works properly.

Digital Library: EI

Published Online: November 2019

JIST-first

15 2

device integration
image processing
sensors
camera
dynamic range
image enhancement
noise
video

Maikon Nascimento, Jing Li, Dileepan Joseph

DOI

10.2352/J.ImagingSci.Technol.2018.62.6.060404

Volume 31

Issue 9

Sharpening Image Details Using Local Phase Congruency Analysis

Nonlinear complementary metal-oxide semiconductor (CMOS) image sensors (CISs), such as logarithmic (log) and linearâ–”logarithmic (linlog) sensors, achieve high/wide dynamic ranges in single exposures at video frame rates. As with linear CISs, fixed pattern noise (FPN) correction and salt-and-pepper noise (SPN) filtering are required to achieve high image quality. This paper presents a method to generate digital integrated circuits, suitable for any monotonic nonlinear CIS, to correct FPN in hard real time. It also presents a method to generate digital integrated circuits, suitable for any monochromatic nonlinear CIS, to filter SPN in hard real time. The methods are validated by implementing and testing generated circuits using field-programmable gate array (FPGA) tools from both Xilinx and Altera. Generated circuits are shown to be efficient, in terms of logic elements, memory bits, and power consumption. Scalability of the methods to full high-definition (FHD) video processing is also demonstrated. In particular, FPN correction and SPN filtering of over 140 megapixels per second are feasible, in hard real time, irrespective of the degree of nonlinearity. c 2018 Society for Imaging Science and Technology.

Digital Library: EI

Published Online: November 2018

Articles

133 6

sharpness
defocus
phase congruency
image sharpening
image enhancement

Andrey Shcherbinin, Konstantin Kolchin, Ivan Glazistov, Mikhail Rychagov

DOI

10.2352/ISSN.2470-1173.2018.13.IPAS-218

Volume 30

Issue 13

Two General Models for Gradient Operators in Imaging

We suggest a method for sharpening an image or video stream without using convolution, as in unsharp masking, or deconvolution, as in constrained least-squares filtering. Instead, our technique is based on a local analysis of phase congruency and hence focuses on perceptually important details. The image is partitioned into overlapping tiles, and is processed tile by tile. We perform a Fourier transform for each of the tiles, and define congruency for each of the components in such a way that it is large when the component's neighbours are aligned with it, and small otherwise. We then amplify weak components with high phase congruency and reduce strong components with low phase congruency. Following this method, we avoid strengthening the Fourier components corresponding to sharp edges, while amplifying those details that underwent a slight or moderate defocus blur. The tiles are then seamlessly stitched. As a result, the image sharpness is improved wherever perceptually important details are present.

Digital Library: EI

Published Online: January 2018

Articles

62 3

image gradients
direction gradients
image enhancement

Artyom Grigoryan, Sos Agaian

DOI

10.2352/ISSN.2470-1173.2018.13.IPAS-391

Volume 30

Issue 13