IS&T | Library

Supervised Reconstruction for Silhouette Tomography

Abstract

Selecting the optimal resolution and post-processing techniques of 3D objects for cultural heritage documentation is one of the most distinguishable challenges within 3D imaging. Many techniques exist to document a tangible object at very high objective accuracy, but there also exist techniques that can visualize a similar perceptual accuracy without documenting the objective values. The application difference between storage of complex geometric data and the visualization of it could be fundamentally different, and if the two methods are not disassociated it could lead to either false or inaccurate digital documentation of a cultural heritage object. In this investigation we compare several different metrics for evaluating the quality of a 3D object, both objectively and perceptually, and look at how the different approaches might report greatly different outputs based on the post-processing of a 3D object. We also provide some insight in how to interpret the output of various metrics, and how to compare them.

Digital Library: ARCHIVING

Published Online: April 2024

Proceedings

64 10

3D imaging
Binary image
Deep learning
Neural network
Supervised learning
U-Net
X-ray CT

Evan Bell, Michael T. McCann, Marc Klasky

DOI

10.2352/EI.2024.36.5.MLSI-298

Volume 36

Issue 5

3D Imaging and Applications 2023 Conference Overview and Papers Program

Abstract

In this paper, we introduce silhouette tomography, a novel formulation of X-ray computed tomography that relies only on the geometry of the imaging system. We formulate silhouette tomography mathematically and provide a simple method for obtaining a particular solution to the problem, assuming that any solution exists. We then propose a supervised reconstruction approach that uses a deep neural network to solve the silhouette tomography problem. We present experimental results on a synthetic dataset that demonstrate the effectiveness of the proposed method.

Digital Library: EI

Published Online: January 2024

Front Matter

98 50

3D imaging
3D data processing
3D shape indexing and retrieval
3D compression
3D reconstruction

DOI

10.2352/EI.2023.35.17.3DIA-A17

Volume 35

Issue 17

3D Imaging and Applications 2022 Conference Overview and Papers Program

Abstract

Scientific and technological advances during the last decade in the fields of image acquisition, data processing, telecommunications, and computer graphics have contributed to the emergence of new multimedia, especially 3D digital data. Modern 3D imaging technologies allow for the acquisition of 3D and 4D (3D video) data at higher speeds, resolutions, and accuracies. With the ability to capture increasingly complex 3D/4D information, advancements have also been made in the areas of 3D data processing (e.g., filtering, reconstruction, compression). As such, 3D/4D technologies are now being used in a large variety of applications, such as medicine, forensic science, cultural heritage, manufacturing, autonomous vehicles, security, and bioinformatics. Further, with mixed reality (AR, VR, XR), 3D/4D technologies may also change the ways we work, play, and communicate with each other every day.

Digital Library: EI

Published Online: January 2023

Front Matter

92 22

3D imaging
3D data processing
3D shape indexing and retrieval
3D compression
3D reconstruction

Page , January 2022, © Society for Imaging Science and Technology 2022

DOI

10.2352/EI.2022.34.17.3DIA-A17

Volume 34

Issue 17

Hand authentication from RGB-D video based on deep neural network

Abstract

Digital Library: EI

Published Online: January 2022

Article

113 14

3D imaging
Hand gesture
Deep learning
Biometrics
Point clouds
RGB-D image

Ryogo Miyazaki, Kazuya Sasaki, Norimichi Tsumura, Keita Hirai

Pages 235-1 - 235-5, January 2022, © Society for Imaging Science and Technology 2022

DOI

10.2352/EI.2022.34.17.3DIA-235

Volume 34

Issue 17

Abstract

In recent years, behavioral biometrics authentication, which uses the habit of behavioral characteristics for personal authentication, has attracted attention as an authentication method with higher security since behavioral biometrics cannot mimic as fingerprint and face authentications. As the behavioral biometrics, many researches were performed on voiceprints. However, there are few authentication technologies that utilize the habits of hand and finger movements during hand gestures. Only either color images or depth images are used for hand gesture authentication in the conventional methods. In the research, therefore, we propose to find individual habits from RGB-D images of finger movements and create a personal authentication system. 3D CNN, which is a deep learning-based network, is used to extract individual habits. An F-measure of 0.97 is achieved when rock-paper-scissors are used as the authentication operation. An F-measure of 0.97 is achieved when the disinfection operation is used. These results show the effectiveness of using RGB-D video for personal authentication.

Digital Library: EI

Published Online: January 2022

3DIA Conference Overview and Papers Program

37 7

3D imaging
3D data processing
3D shape indexing and retrieval
3D compression
3D reconstruction

Pages A18-1 - A18-6, January 2021, © Society for Imaging Science and Technology 2021

DOI

10.2352/ISSN.2470-1173.2021.18.3DIA-A18

Volume 33

Issue 18

Digital Library: EI

Published Online: January 2021

Variable Precision Depth Encoding for 3D Range Geometry Compression

51 4

3D imaging
3D compression
3D encoding
Phase-shifting
Variable precision

Matthew G. Finley, Tyler Bell

DOI

10.2352/ISSN.2470-1173.2020.17.3DMP-034

Volume 32

Issue 17

This paper presents a novel method for accurately encoding 3D range geometry within the color channels of a 2D RGB image that allows the encoding frequency—and therefore the encoding precision—to be uniquely determined for each coordinate. The proposed method can thus be used to balance between encoding precision and file size by encoding geometry along a normal distribution; encoding more precisely where the density of data is high and less precisely where the density is low. Alternative distributions may be followed to produce encodings optimized for specific applications. In general, the nature of the proposed encoding method is such that the precision of each point can be freely controlled or derived from an arbitrary distribution, ideally enabling this method for use within a wide range of applications.

Digital Library: EI

Published Online: January 2020

Active Stereo Vision for Precise Autonomous Vehicle Control

52 18

Stereo vision
3D imaging
laser speckle
vehicle control

Michael Feller, Jae-Sang Hyun, Song Zhang

DOI

10.2352/ISSN.2470-1173.2020.16.AVM-258

Volume 32

Issue 16

This paper describes the development of a low-cost, lowpower, accurate sensor designed for precise, feedback control of an autonomous vehicle to a hitch. The solution that has been developed uses an active stereo vision system, combining classical stereo vision with a low cost, low power laser speckle projection system, which solves the correspondence problem experienced by classic stereo vision sensors. A third camera is added to the sensor for texture mapping. A model test of the hitching problem was developed using an RC car and a target to represent a hitch. A control system is implemented to precisely control the vehicle to the hitch. The system can successfully control the vehicle from within 35° of perpendicular to the hitch, to a final position with an overall standard deviation of 3.0 m m of lateral error and 1.5° of angular error.

Digital Library: EI

Published Online: January 2020

Constrained Non-Linear Phase Retrieval for Single Distance Xray Phase Contrast Tomography

56 4

phase retrieval algorithm
X-ray phase contrast tomography (XPCT)
3D imaging

K. Aditya Mohan, Dilworth Y. Parkinson, Jefferson A. Cuadra

DOI

10.2352/ISSN.2470-1173.2020.14.COIMG-146

Volume 32

Issue 14

X-ray phase contrast tomography (XPCT) is widely used for 3D imaging of objects with weak contrast in X-ray absorption index but strong contrast in refractive index decrement. To reconstruct an object imaged using XPCT, phase retrieval algorithms are first used to estimate the X-ray phase projections, which is the 2D projection of the refractive index decrement, at each view. Phase retrieval is followed by refractive index decrement reconstruction from the phase projections using an algorithm such as filtered back projection (FBP). In practice, phase retrieval is most commonly solved by approximating it as a linear inverse problem. However, this linear approximation often results in artifacts and blurring when the conditions for the approximation are violated. In this paper, we formulate phase retrieval as a non-linear inverse problem, where we solve for the transmission function, which is the negative exponential of the projections, from XPCT measurements. We use a constraint to enforce proportionality between phase and absorption projections. We do not use constraints such as large Fresnel number, slowly varying phase, or Born/Rytov approximations. Our approach also does not require any regularization parameter tuning since there is no explicit sparsity enforcing regularization function. We validate the performance of our non-linear phase retrieval (NLPR) method using both simulated and real synchrotron datasets. We compare NLPR with a popular linear phase retrieval (LPR) approach and show that NLPR achieves sharper reconstructions with higher quantitative accuracy.

Digital Library: EI

Published Online: January 2020

Heads-Up Lidar Imaging with Sensor Fusion

80 5

Augmented Reality
Lidar
Heads-Up Display
3D imaging
First Response
Helmet
Low Vision

Yang Cai, Sean Hackett, Ben Graham, Florian Alber, Mel Siegel

DOI

10.2352/ISSN.2470-1173.2020.13.ERVR-338

Volume 32

Issue 13