IS&T | Library

Real-time Cattle Intake Monitoring Using Stereo Vision

Abstract

Nowadays, the quality of low-light pictures is becoming a competitive edge in mobile phones. To ensure this, the necessity to filter out dark defects that cause abnormalities in dark photos in advance is emerging, especially for dark blemish. However, high manpower is required to separate dark blemish patterns due to the low consistency problem of the existing scoring method. This paper proposes a novel deep learning-based screening method to solve this problem. The proposed pipeline uses two ResNet-D models with different depths to perform classification and regression of visibility, respectively. Then it derives a new score that combines the outputs of both models into one. In addition, we collect the large-scale image set from real manufacturing processes to train models and configure the dataset with two types of label systems suitable for each model. Experimental results show the performance of the deep learning models trained and validated with the presented datasets. Our classification model has significantly improved screening performance with respect to its accuracy and F1-score compared to the conventional handcraft method. Also, the visibility regression method shows a high Pearson correlation coefficient with 30 expert engineers, and the inference output of our regression model is consistent with it.

Digital Library: EI

Published Online: January 2024

Proceedings

124 36

Classification
Dairy management
Point cloud
Realtime
Stereo vision
Total mixed ration intake

Prajwal Rao, McKinley N Flinders, Dennis Buckmaster, Amy R Reibman, Jacquelyn P Boerman

DOI

10.2352/EI.2024.36.6.IRIACV-278

Volume 36

Issue 6

Deep learning based speech emotion recognition for Parkinson patient

Abstract

Accurate measurements of daily feed consumption for dairy cattle is an important metric for determining animal health and feed efficiency. Traditionally, manual measurements or average feed consumption for groups of animals have been employed which leads to human error and overall inconsistent measurements for the individual. Therefore, we developed a scalable non-invasive analytics system that leverages depth information derived from stereo cameras to consistently measure feed offered and report findings throughout the day. A top-down array of cameras faces the available feed, measures feed depth, projects depth to a 3-dimensional (3D) mesh, and finally estimates feed volume from the 3D projection. Our successful experiments at the Purdue University Dairy, that houses 230 cows, demonstrates its robustness and scalability for larger operations holding significant potential for optimizing feed management in dairy farms, thereby improving animal health and sustainability in the dairy industry.

Digital Library: EI

Published Online: January 2024

Article

279 60

Parkinsonian
Speech emotions Recognition
Deep Learning
Lightweight
Healthcare AI
Classification

Habib Khan, Mohib Ullah, Fadi Al-Machot, Faouzi Alaya Cheikh, Muhammad Sajjad

Pages 298--1 - 298-6, January 2023, This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. 2023

DOI

10.2352/EI.2023.35.9.IPAS-298

Volume 35

Issue 9

Overcoming deep learning subclass imbalances: Comparing the transfer of identity across a racial transformation

Abstract

Speech emotions (SEs) are an essential component of human interactions and an efficient way of persuading human behavior. The recognition of emotions from the speech is an emergent but challenging area of digital signal processing (DSP). Healthcare professionals are always looking for the best ways to understand patient voices for better diagnosis and treatment. Speech emotion recognition (SER) from the human voice, particularly in a person with neurological disorders like Parkinson's disease (PD), can expedite the diagnostic process. Patients with PD are primarily passed through diagnosis via expensive tests and continuous monitoring that is time-consuming and costly. This research aims to develop a system that can accurately identify common SEs which are important for PD patients, such as anger, happiness, normal, and sadness. We proposed a novel lightweight deep model to predict common SEs. The adaptive wavelet thresholding method is employed for pre-processing the audio data. Furthermore, we generated spectrograms from the speech data instead of directly processing voice data to extract more discriminative features. The proposed method is trained on generated spectrograms of the IEMOCAP dataset. The suggested deep learning method contains convolution layers for learning discriminative features from spectrograms. The performance of the proposed framework is evaluated on standard performance metrics, which show promising real-time results for PD patients.

Digital Library: EI

Published Online: January 2023

Article

80 28

Biometrics
Classification
Deep Learning
Face Recognition
Generative Adversarial Network
Race Bias

Andrew Sumsion, Shad Torrie, Zheng Sun, Dah-Jye Lee

DOI

10.2352/EI.2023.35.5.IRIACV-327

Volume 35

Issue 5

Abstract

As facial authentication systems become an increasingly advantageous technology, the subtle inaccuracy under certain subgroups grows in importance. As researchers perform data augmentation to increase subgroup accuracies, it is critical that the data augmentation approaches are understood. We specifically research the impact that the data augmentation method of racial transformation has upon the identity of the individual according to a facial authentication network. This demonstrates whether the racial transformation maintains critical aspects to an individual identity or whether the data augmentation method creates the equivalence of an entirely new individual for networks to train upon. We demonstrate our method for racial transformation based on other top research articles methods, display the embedding distance distribution of augmented faces compared with the embedding distance of non-augmented faces and explain to what extent racial transformation maintains critical aspects to an individual’s identity.

Digital Library: EI

Published Online: January 2023

Deep Learning Features for Discriminating Between Benign and Malignant Microcalcification Lesions

50 5

Deep learning features
Classification
Computer-aided diagnosis
Clustered microcalcifications

Juan Wang, Liang Lei, Yongyi Yang

Pages 246-1 - 246-6, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.10.IPAS-246

Volume 33

Issue 10

Accurate diagnosis of microcalcification (MC) lesions in mammograms as benign or malignant is a challenging clinical task. In this study we investigate the potential discriminative power of deep learning features in MC lesion diagnosis. We consider two types of deep learning networks, of which one is a convolutional neural network developed for MC detection and the other is a denoising autoencoder network. In the experiments, we evaluated both the separability between malignant and benign lesions and the classification performance of image features from these two networks using Fisher's linear discriminant analysis on a set of mammographic images. The results demonstrate that the deep learning features from the MC detection network are most discriminative for classification of MC lesions when compared to both features from the autoencoder network and traditional handcrafted texture features.

Digital Library: EI

Published Online: January 2021

CNN-based Classification of Degraded Images

193 6

Degraded image
Degradation parameter
Classification
Deep Learning

Kazuki Endo, Masayuki Tanaka, Masatoshi Okutomi

Pages 28-1 - 28-7, January 2020, © Society for Imaging Science and Technology 2020

DOI

10.2352/ISSN.2470-1173.2020.10.IPAS-028

Volume 32

Issue 10

Classification of degraded images is very important in practice because images are usually degraded by compression, noise, blurring, etc. Nevertheless, most of the research in image classification only focuses on clean images without any degradation. Some papers have already proposed deep convolutional neural networks composed of an image restoration network and a classification network to classify degraded images. This paper proposes an alternative approach in which we use a degraded image and an additional degradation parameter for classification. The proposed classification network has two inputs which are the degraded image and the degradation parameter. The estimation network of degradation parameters is also incorporated if degradation parameters of degraded images are unknown. The experimental results showed that the proposed method outperforms a straightforward approach where the classification network is trained with degraded images only.

Digital Library: EI

Published Online: January 2020

Pathology image-based lung cancer subtyping using deeplearning features and cell-density maps

109 13

lung cancer
pathology image
deep learning
cell density
Classification

Mustafa I. Jaber, Christopher W. Szeto, Bing Song, Liudmila Beziaeva, Stephen C. Benz, Patrick Soon-Shiong, Shahrooz Rabizadeh

DOI

10.2352/ISSN.2470-1173.2020.10.IPAS-064

Volume 32

Issue 10

In this paper, we propose a patch-based system to classify non-small cell lung cancer (NSCLC) diagnostic whole slide images (WSIs) into two major histopathological subtypes: adenocarcinoma (LUAD) and squamous cell carcinoma (LUSC). Classifying patients accurately is important for prognosis and therapy decisions. The proposed system was trained and tested on 876 subtyped NSCLC gigapixel-resolution diagnostic WSIs from 805 patients – 664 in the training set and 141 in the test set. The algorithm has modules for: 1) auto-generated tumor/non-tumor masking using a trained residual neural network (ResNet34), 2) cell-density map generation (based on color deconvolution, local drain segmentation, and watershed transformation), 3) patch-level feature extraction using a pre-trained ResNet34, 4) a tower of linear SVMs for different cell ranges, and 5) a majority voting module for aggregating subtype predictions in unseen testing WSIs. The proposed system was trained and tested on several WSI magnifications ranging from x4 to x40 with a best ROC AUC of 0.95 and an accuracy of 0.86 in test samples. This fully-automated histopathology subtyping method outperforms similar published state-of-the-art methods for diagnostic WSIs.

Digital Library: EI

Published Online: January 2020

Rare-Class Extraction Using Cascaded Pretrained Networks Applied to Crane Classification

162 2

Convolutional neural network
Classification
Crane classification
Data acquisition
Surveillance
Automatic data labeling

Sander R. Klomp, Guido M.Y.E. Brouwers, Rob G.J. Wijnhoven, Peter H.N. de With

DOI

10.2352/ISSN.2470-1173.2020.6.IRIACV-049

Volume 32

Issue 6

Overweight vehicles are a common source of pavement and bridge damage. Especially mobile crane vehicles are often beyond legal per-axle weight limits, carrying their lifting blocks and ballast on the vehicle instead of on a separate trailer. To prevent road deterioration, the detection of overweight cranes is desirable for law enforcement. As the source of crane weight is visible, we propose a camera-based detection system based on convolutional neural networks. We iteratively label our dataset to vastly reduce labeling and extensively investigate the impact of image resolution, network depth and dataset size to choose optimal parameters during iterative labeling. We show that iterative labeling with intelligently chosen image resolutions and network depths can vastly improve (up to 70×) the speed at which data can be labeled, to train classification systems for practical surveillance applications. The experiments provide an estimate of the optimal amount of data required to train an effective classification system, which is valuable for classification problems in general. The proposed system achieves an AUC score of 0.985 for distinguishing cranes from other vehicles and an AUC of 0.92 and 0.77 on lifting block and ballast classification, respectively. The proposed classification system enables effective road monitoring for semi-automatic law enforcement and is attractive for rare-class extraction in general surveillance classification problems.

Digital Library: EI

Published Online: January 2020

A CNN adapted to time series for the classification of Supernovae

79 5

Cosmology
Supernovae
Signal processing
Classification
Deep learning
CNN

Anthony BRUNEL, Johanna PASQUET, Jérôome PASQUET, Nancy RODRIGUEZ, Frédéric COMBY, Dominique FOUCHEZ, Marc CHAUMONT

DOI

10.2352/ISSN.2470-1173.2019.14.COLOR-090

Volume 31

Issue 14

Cosmologists are facing the problem of the analysis of a huge quantity of data when observing the sky. The methods used in cosmology are, for the most of them, relying on astrophysical models, and thus, for the classification, they usually use a machine learning approach in two-steps, which consists in, first, extracting features, and second, using a classifier. In this paper, we are specifically studying the supernovae phenomenon and especially the binary classification “I.a supernovae versus not-I.a supernovae”. We present two Convolutional Neural Networks (CNNs) defeating the current state-of-the-art. The first one is adapted to time series and thus to the treatment of supernovae light-curves. The second one is based on a Siamese CNN and is suited to the nature of data, i.e. their sparsity and their weak quantity (small learning database).

Digital Library: EI

Published Online: January 2019

Deep p-Fibonacci scattering networks

192 0

Deep Neural networks
pFibonacci numbers
Wavelet on graphs
Scattering transform
Classification

F. Battisti, M. Carli, E. De Paola, K. Egiazarian

DOI

10.2352/ISSN.2470-1173.2018.13.IPAS-193

Volume 30

Issue 13