IS&T | Library

Regular

A - B - C - D - F - H - I - J - M - N - O - P - R - S - T - V

Adaptive filteringAWGNAction recognitionADMM

BM3D

CNNColorClustered microcalcificationsClassificationComputer-aided diagnosisComputer Vision

Denoising expedienceDeep learning featuresDeep learningDykstra's projection algorithmDomain adaptationDenoising efficiency

FilteringFusion algorithmsFeature de-noisingFPGAFiltering and denoisingFast correcting while scanning

HazeHevc

Image fusionImage classificationImage retrievalImage noiseImage quality

JPEG

Machine learningMachine VisionMotion searchMotion compensation

Neural network

Optimization

POCSPhotometric stereoPerformance prediction

Reinforcement learningRate distortion optimization

Signal-dependent noiseSecond Harmonic Generation ImagerySaturated pixels correctionScene understandingSimilarity metricsSimilarity search

Tissue Sub-Region ClassificationTools and systems

Video codingVirtual adversarial trainingVideo processingVehicle re-identification

Articles

44 8

IPAS Conference Overview and Papers Program

Filtering and denoising
Fusion algorithms
Video processing
Tools and systems
Color

Pages A10-1 - A10-6, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-A10

Volume 33

Issue 10

View

Digital Library: EI

Published Online: January 2021

Articles

269 10

Projection methods for finding intersection of two convex sets and their use in signal processing problems

Dykstra's projection algorithm
POCS
ADMM
JPEG

Zuzana Bílková, Michal Šorel

Pages 226-1 - 226-6, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-226

Volume 33

Issue 10

Abstract

View

Finding a point in the intersection of two closed convex sets is a common problem in image processing and other areas. Projections onto convex sets (POCS) is a standard algorithm for finding such a point. Dykstra's projection algorithm is a well known alternative that finds the point in the intersection closest to a given point. Yet another lesser known alternative is the alternating direction method of multipliers (ADMM) that can be used for both purposes. In this paper we discuss the differences in the convergence of these algorithms in image processing problems. The ADMM applied to finding an arbitrary point in the intersection is much faster than POCS and any algorithm for finding the nearest point in the intersection.

Digital Library: EI

Published Online: January 2021

Articles

98 17

Real-Time Vehicle Orientation Classification and Viewpoint-Aware Vehicle Re-Identification

Scene understanding
Image retrieval
Vehicle re-identification
CNN

Oliver Tamas Kocsis, Tunc Alkanat, Egor Bondarev, Peter H.N. de With

Pages 234-1 - 234-8, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-234

Volume 33

Issue 10

Abstract

View

Vehicle re-identification (re-ID) is based on identity matching of vehicles across non-overlapping camera views. Recently, the research on vehicle re-ID attracts increased attention, mainly due to its prominent industrial applications, such as post-crime analysis, traffic flow analysis, and wide-area vehicle tracking. However, despite the increased interest, the problem remains to be challenging. One of the most significant difficulties of vehicle re-ID is the large viewpoint variations due to non-standardized camera placements. In this study, to improve re-ID robustness against viewpoint variations while preserving algorithm efficiency, we exploit the use of vehicle orientation information. First, we analyze and benchmark various deep learning architectures in terms of performance, memory use, and cost on applicability to orientation classification. Secondly, the extracted orientation information is utilized to improve the vehicle re-ID task. For this, we propose a viewpoint-aware multi-branch network that improves the vehicle re-ID performance without increasing the forward inference time. Third, we introduce a viewpoint-aware mini-batching approach which yields improved training and higher re-ID performance. The experiments show an increase of 4.0% mAP and 4.4% rank-1 score on the popular VeRi dataset with the proposed mini-batching strategy, and overall, an increase of 2.2% mAP and 3.8% rank-1 score compared to the ResNet-50 baseline.

Digital Library: EI

Published Online: January 2021

Articles

175 18

Text Recognition of Cardboard Pharmaceutical Packages by Utilizing Machine Vision

Adaptive filtering
Photometric stereo
Image fusion
Haze
Machine Vision
Filtering

Jarmo Koponen, Keijo Haataja, Pekka Toivanen

Pages 235-1 - 235-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-235

Volume 33

Issue 10

Abstract

View

In this paper, text recognition of variably curved cardboard pharmaceutical packages is studied from the photometric stereo imaging point-of-view with focus on developing a method for binarizing the expiration date and batch code texts. Adaptive filtering, more specifically Wiener filter, is used together with haze removal algorithm with fusion of LoG-edge detected sub-images resulting an Otsu thresholded binary image of expiration date and batch code texts for future analysis. Some results are presented, and they appear to be promising for text binarization. Successful binarization is crucial in text character segmentation and further automatic reading. Furthermore, some new ideas will be presented that will be used in our future research work.

Digital Library: EI

Published Online: January 2021

Articles

219 4

Decision-making on image denoising expedience

Denoising efficiency
Denoising expedience
BM3D
Performance prediction
AWGN
Image quality
Neural network

Andrii Rubel, Oleksii Rubel, Vladimir Lukin, Karen Egiazarian

Pages 237-1 - 237-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-237

Volume 33

Issue 10

Abstract

View

Image denoising is a classical preprocessing stage used to enhance images. However, it is well known that there are many practical cases where different image denoising methods produce images with inappropriate visual quality, which makes an application of image denoising useless. Because of this, it is desirable to detect such cases in advance and decide how expedient is image denoising (filtering). This problem for the case of wellknown BM3D denoiser is analyzed in this paper. We propose an algorithm of decision-making on image denoising expedience for images corrupted by additive white Gaussian noise (AWGN). An algorithm of prediction of subjective image visual quality scores for denoised images using a trained artificial neural network is proposed as well. It is shown that this prediction is fast and accurate.

Digital Library: EI

Published Online: January 2021

Articles

197 0

Similarity search
Similarity metrics
Image noise
AWGN
Signal-dependent noise
Neural network

Oleksii Rubel, Rostyslav Tsekhmystro, Vladimir Lukin, Karen Egiazarian

Pages 238-1 - 238-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-238

Volume 33

Issue 10

Abstract

View

A similarity search in images has become a typical operation in many applications. A presence of noise in images greatly affects the correctness of detection of similar image blocks, resulting in a reduction of efficiency of image processing methods, e.g., non-local denoising. In this paper, we study noise immunity of various distance measures (similarity metrics). Taking into account a wide variety of information content in real life images and variations of noise type and intensity. We propose a set of test data and obtain preliminary results for several typical cases of image and noise properties. The recommendations for metrics' and threshold selection are given. Fast implementation of the proposed benchmark is realized using CUDA technology.

Digital Library: EI

Published Online: January 2021

Articles

241 9

Graph Adversarial Learning for Noisy Skeleton-based Action Recognition

Action recognition
Computer Vision
Feature de-noising

Henglin Shi, Wei Peng, Xin Liu, Guoying Zhao

Pages 239-1 - 239-7, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-239

Volume 33

Issue 10

Abstract

View

Skeleton based action recognition is playing a critical role in computer vision research, its applications have been widely deployed in many areas. Currently, benefiting from the graph convolutional networks (GCN), the performance of this task is dramatically improved due to the powerful ability of GCN for modeling the Non-Euclidean data. However, most of these works are designed for the clean skeleton data while one unavoidable drawback is such data is usually noisy in reality, since most of such data is obtained using depth camera or even estimated from RGB camera, rather than recorded by the high quality but extremely costly Motion Capture (MoCap) [1] system. Under this circumstance, we propose a novel GCN framework with adversarial training to deal with the noisy skeleton data. With the guiding of the clean data in the semantic level, a reliable graph embedding can be extracted for noisy skeleton data. Besides, a discriminator is introduced such that the feature representation could further improved since it is learned with an adversarial learning fashion. We empirically demonstrate the proposed framework based on two current largest scale skeleton-based action recognition datasets. Comparison results show the superiority of our method when compared to the state-of-the-art methods under the noisy settings.

Digital Library: EI

Published Online: January 2021

Articles

61 2

Does End-to-End Trained Deep Model Always Perform Better than Non-End-to-End Counterpart?

Image classification
Optimization
Deep learning

Ikuro Sato, Guoqing Liu, Kohta Ishikawa, Teppei Suzuki, Masayuki Tanaka

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-240

Volume 33

Issue 10

Abstract

View

It has been rigorously demonstrated that an end-to-end (E2E) differentiable formulation of a deep neural network can turn a complex recognition problem into a unified optimization task that can be solved by some gradient descent method. Although E2E network optimization yields a powerful fitting ability, the joint optimization of layers is known to potentially bring situations where layers co-adapt one to the other in a complex way that harms generalization ability. This work aims to numerically evaluate the generalization ability of a particular non-E2E network optimization approach known as FOCA (Feature-extractor Optimization through Classifier Anonymization) that helps to avoid complex co-adaptation, with careful hyperparameter tuning. In this report, we present intriguing empirical results where the non-E2E trained models consistently outperform the corresponding E2E trained models on three image-classification datasets. We further show that E2E network fine-tuning, applied after the feature-extractor optimization by FOCA and the following classifier optimization with the fixed feature extractor, indeed gives no improvement on the test accuracy. The source code is available at https://github.com/DensoITLab/FOCA-v1.

Digital Library: EI

Published Online: January 2021

Articles

94 8

FPGA-based Fast Algorithm of Correcting Saturated Pixel in Image

Saturated pixels correction
FPGA
Fast correcting while scanning

Jun Fu, Chuanli Xiao, Jingyi Huang, Wenxue Peng, Xinpei Wang, Zhen Xu, Yanheng Wang, Min Liu, Xuanqin Mou

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-241

Volume 33

Issue 10

Abstract

View

This paper proposes a novel method to correct saturated pixels in images. This method is based on the YCbCr color space and separately corrects the chrominance and the luminance of saturated pixels. In this algorithm, the saturated image is processed on the scan line, which is beneficial to the hardware implementation and the correction effect is good. Through the results of the joint simulation of MATLAB and Modelsim, it can be concluded that the hardware algorithm of this paper can use less resources to achieve fast correction. This paper uses Altera DE4 high-level development platform for hardware implementation. The calculation results show that highspeed image and video processing by FPGA is feasible and efficient, and it can be done frame by frame for highdefinition video. It has broad practical application prospects.

Digital Library: EI

Published Online: January 2021

Articles

46 5

Deep Learning Features for Discriminating Between Benign and Malignant Microcalcification Lesions

Deep learning features
Classification
Computer-aided diagnosis
Clustered microcalcifications

Juan Wang, Liang Lei, Yongyi Yang

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-246

Volume 33

Issue 10

Abstract

View

Accurate diagnosis of microcalcification (MC) lesions in mammograms as benign or malignant is a challenging clinical task. In this study we investigate the potential discriminative power of deep learning features in MC lesion diagnosis. We consider two types of deep learning networks, of which one is a convolutional neural network developed for MC detection and the other is a denoising autoencoder network. In the experiments, we evaluated both the separability between malignant and benign lesions and the classification performance of image features from these two networks using Fisher's linear discriminant analysis on a set of mammographic images. The results demonstrate that the deep learning features from the MC detection network are most discriminative for classification of MC lesions when compared to both features from the autoencoder network and traditional handcrafted texture features.

Digital Library: EI

Published Online: January 2021

Filters

Keywords

Keywords

Subject Areas