IS&T | Library

Supervised Reconstruction for Silhouette Tomography

Abstract

In this paper, we explore a space-time geometric view of signal representation in machine learning models. The question we are interested in is if we can identify what is causing signal representation errors – training data inadequacies, model insufficiencies, or both. Loosely expressed, this problem is stylistically similar to blind deconvolution problems. However, studies of space-time geometries might be able to partially solve this problem by considering the curvature produced by mass in (Anti-)de Sitter space. We study the effectiveness of our approach on the MNIST dataset.

Digital Library: EI

Published Online: February 2025

Proceedings

66 10

3D imaging
Binary image
Deep learning
Neural network
Supervised learning
U-Net
X-ray CT

Evan Bell, Michael T. McCann, Marc Klasky

DOI

10.2352/EI.2024.36.5.MLSI-298

Volume 36

Issue 5

Using simulation to quantify the performance of automotive perception systems

Abstract

In this paper, we introduce silhouette tomography, a novel formulation of X-ray computed tomography that relies only on the geometry of the imaging system. We formulate silhouette tomography mathematically and provide a simple method for obtaining a particular solution to the problem, assuming that any solution exists. We then propose a supervised reconstruction approach that uses a deep neural network to solve the silhouette tomography problem. We present experimental results on a synthetic dataset that demonstrate the effectiveness of the proposed method.

Digital Library: EI

Published Online: January 2024

Article

343 99

Autonomous driving
Image systems simulation
Automotive perception system
Neural network

Zhenyi Liu, Devesh Shah, Alireza Rahimpour, Devesh Upadhyay, Joyce Farrell, Brian Wandell

DOI

10.2352/EI.2023.35.16.AVM-118

Volume 35

Issue 16

Abstract

The design and evaluation of complex systems can benefit from a software simulation - sometimes called a digital twin. The simulation can be used to characterize system performance or to test its performance under conditions that are difficult to measure (e.g., nighttime for automotive perception systems). We describe the image system simulation software tools that we use to evaluate the performance of image systems for object (automobile) detection. We describe experiments with 13 different cameras with a variety of optics and pixel sizes. To measure the impact of camera spatial resolution, we designed a collection of driving scenes that had cars at many different distances. We quantified system performance by measuring average precision and we report a trend relating system resolution and object detection performance. We also quantified the large performance degradation under nighttime conditions, compared to daytime, for all cameras and a COCO pre-trained network.

Digital Library: EI

Published Online: January 2023

7T MRI super-resolution with Generative Adversarial Network

62 12

GAN
Medical imaging
MRI
Neural network
Super-resolution

Huy Do, Pascal Bourdon, David Helbert, Mathieu Naudin, Remy Guillevin

Pages 106-1 - 106-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.18.3DIA-106

Volume 33

Issue 18

The high-resolution magnetic resonance image (MRI) provides detailed anatomical information critical for clinical application diagnosis. However, high-resolution MRI typically comes at the cost of long scan time, small spatial coverage, and low signal-to-noise ratio. The benefits of the convolutional neural network (CNN) can be applied to solve the super-resolution task to recover high-resolution generic images from low-resolution inputs. Additionally, recent studies have shown the potential to use the generative advertising network (GAN) to generate high-quality super-resolution MRIs using learned image priors. Moreover, existing approaches require paired MRI images as training data, which is difficult to obtain with existing datasets when the alignment between high and low-resolution images has to be implemented manually.This paper implements two different GAN-based models to handle the super-resolution: Enhanced super-resolution GAN (ESRGAN) and CycleGAN. Different from the generic model, the architecture of CycleGAN is modified to solve the super-resolution on unpaired MRI data, and the ESRGAN is implemented as a reference to compare GAN-based methods performance. The results of GAN-based models provide generated high-resolution images with rich textures compared to the ground-truth. Moreover, results from experiments are performed on both 3T and 7T MRI images in recovering different scales of resolution.

Digital Library: EI

Published Online: January 2021

Decision-making on image denoising expedience

225 4

Denoising efficiency
Denoising expedience
BM3D
Performance prediction
AWGN
Image quality
Neural network

Andrii Rubel, Oleksii Rubel, Vladimir Lukin, Karen Egiazarian

Pages 237-1 - 237-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-237

Volume 33

Issue 10

Image denoising is a classical preprocessing stage used to enhance images. However, it is well known that there are many practical cases where different image denoising methods produce images with inappropriate visual quality, which makes an application of image denoising useless. Because of this, it is desirable to detect such cases in advance and decide how expedient is image denoising (filtering). This problem for the case of wellknown BM3D denoiser is analyzed in this paper. We propose an algorithm of decision-making on image denoising expedience for images corrupted by additive white Gaussian noise (AWGN). An algorithm of prediction of subjective image visual quality scores for denoised images using a trained artificial neural network is proposed as well. It is shown that this prediction is fast and accurate.

Digital Library: EI

Published Online: January 2021

10.2352/ISSN.2470-1173.2021.10.IPAS-238

204 0

Similarity search
Similarity metrics
Image noise
AWGN
Signal-dependent noise
Neural network

Oleksii Rubel, Rostyslav Tsekhmystro, Vladimir Lukin, Karen Egiazarian

Pages 238-1 - 238-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

Volume 33

Issue 10

A similarity search in images has become a typical operation in many applications. A presence of noise in images greatly affects the correctness of detection of similar image blocks, resulting in a reduction of efficiency of image processing methods, e.g., non-local denoising. In this paper, we study noise immunity of various distance measures (similarity metrics). Taking into account a wide variety of information content in real life images and variations of noise type and intensity. We propose a set of test data and obtain preliminary results for several typical cases of image and noise properties. The recommendations for metrics' and threshold selection are given. Fast implementation of the proposed benchmark is realized using CUDA technology.

Digital Library: EI

Published Online: January 2021

Neural network-based assessment of the impact induced in video quality assessment by the semantic labels

35 3

Subjective quality evaluation
Semantic labeled grading scales
2D and 3D video
Neural network

C. Hernandez, Z. De La Lande Dolce, R. Bensaied, M. Mitrea

DOI

10.2352/ISSN.2470-1173.2021.9.IQSP-224

Volume 33

Issue 9

Subjective video quality assessment generally comes across with semantically labeled evaluation scales (e.g. Excellent, Good, Fair, Poor and Bad on a single stimulus, 5 level grading scale). While suspicions about an eventual bias these labels induce in the quality evaluation always occur, to the best of our knowledge, very few state-of-the-art studies target an objective assessment of such an impact. Our study presents a neural network solution in this respect. We designed a 5-class classifier, with 2 hidden layers, and a softmax output layer. An ADAM optimizer coupled to a Sparse Categorical Cross Entropy function is subsequently considered. The experimental results are obtained out of processing a database composed of 440 observers scoring about 7 hours of video content of 4 types (high-quality stereoscopic video content, low-quality stereoscopic video content, high-quality 2D video, and low-quality 2D video). The experimental results are discussed and confrontment to the reference given by a probability-based estimation method. They show an overall good convergence between the two types of methods while pointing out to some inner applicative differences that are discussed and explained.

Digital Library: EI

Published Online: January 2021

Learning based demosaicing and color correction for RGB-IR patterned image sensors

191 80

RGB-IR
Demosaicing
Color correction
Neural network
Machine learning

Navinprashath R R, Radhesh Bhat

DOI

10.2352/ISSN.2470-1173.2019.15.AVM-045

Volume 31

Issue 15

RGB-IR sensor combines the capabilities of RGB sensor and IR sensor in one single sensor. However, the additional IR pixel in the RGBIR sensor reduces the effective number of pixels allocated to visible region introducing aliasing artifacts due to demosaicing. Also, the presence of IR content in R, G and B channels poses new challenges in accurate color reproduction. Sharpness and color reproduction are very important image quality factors for visual aesthetic as well as computer vision algorithms. Demosaicing and color correction module are integral part of any color image processing pipeline and responsible for sharpness and color reproduction respectively. The image processing pipeline has not been fully explored for RGB-IR sensors. We propose a neural network-based approach for demosaicing and color correction for RGB-IR patterned image sensors. In our experimental results, we show that our learning-based approach performs better than the existing demosaicing and color correction methods.

Digital Library: EI

Published Online: January 2019

A comparative study on wavelets and residuals in deep super resolution

59 1

Super resolution
Neural network
Deep learning
Wavelets

Ruofan Zhou, Fayez Lahoud, Majed El Helou, Sabine Süsstrunk

DOI

10.2352/ISSN.2470-1173.2019.13.COIMG-135

Volume 31

Issue 13