Bumpy Appearance Editing of Object Surfaces in Digital Images

Yusuke Manabe; Midori Tanaka; Takahiko Horiuchi

doi:10.2352/J.ImagingSci.Technol.2022.66.5.050403

Abstract

Objects in nature have diverse appearances, and appearance is one of the elements constituting the unique visual aspect of an object. However, previous studies have shown that when an object is represented as a digital image, its appearance can change compared to the real object depending on the material. In this study, the focus was on the “bumpiness” of the object surface, whereby a method was proposed to edit the bumpiness of the object in the image. First, the statistics obtained from images were analyzed in relation to the sensibility value of the bumpiness perceived by humans. Since the statistics on the original image could not fully explain the perception of bumpiness, analyses were conducted on multiscale images. The results suggested that the mean, standard deviation, and top, defined as the average of the luminance values within the top 10 [percentile], were highly influential as cues for bumpiness perception. The results also indicated that the components in the range of 5–65 [cycles per image-width] were highly influential. Based on these analysis results, a method is proposed to edit bumpiness perception by modulating components in the low and medium frequency bands. The effectiveness of the proposed method is demonstrated by image modulation experiments on objects of various materials.

jist

JIMTE6

Journal of Imaging Science and Technology

J. Imaging Sci. Technol.

1062-3701

1943-3522

Society for Imaging Science and Technology

050403

10.2352/J.ImagingSci.Technol.2022.66.5.050403

1321

Papers based on Work Presented at CIC30: Color and Imaging 2022

Bumpy Appearance Editing of Object Surfaces in Digital Images

Bumpy appearance editing of object surfaces in digital images

ManabeYusuke

▴

Graduate School of Science and Engineering, Chiba University, Japan

ayga3355@chiba-u.jp

TanakaMidori

▴

Graduate School of Global and Transdisciplinary Studies, Chiba University, Japan

HoriuchiTakahiko

▴

Graduate School of Science and Engineering, Chiba University, Japan

Manabe, Tanaka, and Horiuchi

▴

IS&T Members.

092022

050403-1

050403-15

1352022

2672022

2022

Abstract

ccc

1062-3701/2022/66(5)/050403/15/$25.00

printed

Printed in the USA

Introduction

Appearance is one of the elements that constitutes the unique visual aspect of an object. In recent years, considerable research has been conducted on appearance [1, 2]. A typical example of appearance research is the measurement, analysis, and reproduction of object gloss, which is perceived as a shiny or matte appearance. There is tremendous interest in the perception of gloss. As a result of a number of factors, including advances in technologies such as computer graphics and the development of new experimental techniques and measurements, efforts are underway to analyze the complex interplay between variables such as illumination and surface reflectance properties to determine the perception of gloss by the observer [3–10].

In recent years, opportunities for individuals to take photographs have increased dramatically with the reach of digital content such as on smartphones and social networking services. The higher resolution, brightness, and saturation of on-board cameras and display screens have made it possible to easily reproduce high-quality digital images. On the other hand, captured image data are encoded and compressed before being stored or transmitted. The joint photographic experts group (JPEG) [11] is a representative lossy compression method that has been actively used in various media as an international standard encoding method. Tanaka et al. showed that gloss, one of the appearance features, can be degraded in JPEG image coding [12]. They also showed that even if the color, brightness, and size of an object are faithfully reproduced on a display, the appearance can change depending on the material [13]. In general, appearance can easily change during the reproduction of digital images.

In this paper, the authors focus on the perception of bumpiness of object surfaces. In nature, object surfaces have a certain degree of bumpiness. Bumpiness is an appearance that affects a wide range of products, including metals, leather, and clothing. When the bumpy shape of an object changes, its appearance changes as well. Figure 1 shows an example of how JPEG image compression significantly changes the perceived bumpiness. Thus, because the bumpiness of an image can change, a method to edit the appearance in images is required.

Figure 1.

Change in perceived bumpiness by JPEG image compression.

Tanaka et al. proposed “PuRet” as a method to edit the appearance of objects in images [14]. This method focuses on the physiological phenomenon in which the pupil diameter decreases when observing the appearance of a material on the surface of an object, and a tone-mapping algorithm enhances the perceived appearance of the material by contracting the pupil diameter based on a retinal response model. Using this method, it is possible to emphasize the appearance of the material of the entire image, but it is impossible to edit only a specific component of the appearance of the material, such as gloss. An example of a method for editing the specific appearance of an object in an image is “band-sifting” [15], proposed by Boyadzhiev et al.. They demonstrated that sub-bands of spatial frequencies in an image can represent specific appearance information, and they edited this information by increasing or decreasing the signal in the sub-bands. By decomposing the image into frequencies and modulating it based on frequency, amplitude, and sign, it is possible to manipulate not only glossiness but also various other components of appearance. However, this manipulation requires trial and error during the editing process because the physical function is not clearly understood. In a previous study, the authors proposed a method using color space and object reflectance characteristics to edit the glossiness of the object surfaces of various materials in digital images [5]. This method was applied to test images containing objects of various materials. The effectiveness of the method was confirmed by subjective evaluation based on visual inspection and objective evaluation based on the image statistics proposed in Refs. [6–9]. However, because the method considered only glossiness, it could not edit other appearance components, including bumpiness. Ho et al. studied the perception of “roughness” [16, 17], which is a perceptual quality similar to that of bumpiness. They conducted an experiment using computer graphic (CG) images as stimulus images in which the shape and size of the bumps as well as albedo were fixed. The experimental results suggest that roughness is perceived by the proportion of shadows, mean pixel luminance, variation in pixel luminance, and texture contrast, irrespective of illumination or viewpoint angle. Although these studies on the perception of roughness have been conducted through psychological evaluation experiments using CG, the perceived sense of roughness may differ depending on whether the subject is a real object or a CG image because objects in nature have a complex combination of factors such as reflective characteristics, shape, motion, and illumination, affecting the perception of the observer.

The purpose of this study was to propose a method for editing bumpiness using images of real objects made of various materials as subjects. Here, “bumpiness” is defined as the bumpy appearance of an object’s surface. To this end, first, the relationship between the statistics obtained from the images and the sensibility value of the bumpiness perceived by humans was analyzed. Based on the results of these analyses, a bumpiness editing method was proposed and its effectiveness was experimentally verified.

Subjective Evaluation of Bumpy Appearance

A subjective evaluation experiment was conducted to analyze the factors affecting the perception of bumpiness. In this experiment, bumpiness was defined as the bumpy appearance of an object’s surface, and each observer was asked to rate the bumpiness based on the subjectivity of each observer. In Section 2.1, the experimental method of the subjective evaluation experiment and in Section 2.2, the results of the analysis of the experimental results, are described.

2.1

Experimental Method

The images used in the experiment are presented in Figure A1 of the Appendix. Fifty images of size 256×256 [pixels] were created for the visual stimuli used in the experiment by cropping only the object surface in the royalty-free image. The images used were specifically selected for the impression of bumpiness of the object surface, which remained unchanged, regardless of which part of the object surface in the image was cropped out. The viewing distance was set to 100 cm, and the subjects were asked to view the natural gray color for 3 min to adjust to it before being asked to rate the bumpiness of the image on a display on a 5-point scale. The evaluation was conducted under a fluorescent light equivalent to D50 to reproduce a natural viewing environment. A low evaluation value for the sense of bumpiness indicated that the object in the image displayed little bumpiness. By contrast, a high evaluation value for bumpiness indicated that there were many bumpy objects in the image. Nine students in their 20s (average age 22.7 ± 0.67) were tested twice, and the average score was used as the evaluation value for the bumpiness in the image. The display used in this experiment was an EIZO ColorEdge CG277, and the viewing angle on one side of the image was

3.4 2^{\circ}

. The maximum luminance was 260 [cd∕m2].

2.2

Result

The experiments were conducted for an average of 61 min with rest at arbitrary times. Figure 2 shows the average score for each test image for the results obtained in the experiments of Section 2.1. Error bars in Fig. 2(a) and 2(b) indicate within-subject and between-subject deviations, respectively. In the figure, the mean scores are sorted in ascending order. The averages of within-subject and between-subject deviations were 0.42 and 0.55, respectively, confirming stable evaluation of the subjects. The maximum standard deviation (SD) was 1.16, and no outliers were found in the measured data when the residuals were defined as outliers with values greater than 3SD. The standard deviation was large for texture images of rock surfaces and walls made of bricks and stones. This may be attributed to the fact that the texture’s bumpiness varied from place to place and was evaluated differently depending on the region of interest. On the other hand, the SD of the evaluation values was small for images with a weak perception of bumpiness, such as images of paper, metal, and smooth-surfaced boards, independent of the region of interest.

Figure 2.

Scores of bumpiness for 50 images obtained from the experiment in Section 2.1.

The results obtained in this experiment showed that the amplitude and size of the bumps had a significant impact on the perceived bumpiness, and the perceived bumpiness was not affected by the number of bumps. There was also a tendency for sharper images to be rated higher in terms of bumpiness when the amplitude, size, and randomness of the bumps were of similar magnitude, and lower when the bumps varied smoothly.

First, a linear regression model was used to determine whether the statistics used in Ref. [9] for gloss perception could be used to explain both bumpiness and gloss. The image statistics obtained from luminance histograms consisted of the mean (Mean), standard deviation (Std), skewness (Skew), kurtosis (Kurt), contrast (Cont), and top (Top). Contrast was defined as the SD divided by the mean of the luminance histogram, and Top was defined as the average luminance value within the top 10 [percentile]. A linear regression model was constructed using the explanatory variables, that is, the statistics of the 50 texture images used in the subjective evaluation experiment described in Section 2.1, obtaining a multiple correlation of R = 0.757 and adjusted coefficient of determination R2 = 0.514, which were not sufficiently accurate. Based on these results, it was difficult to use linear regression on the image statistics to adequately explain the perceived amount of bumpiness.

Hayashi et al. showed that the visual perception of bumpy shape is related to spatial frequency [18]. In Section 2.1, in the bumpiness evaluation experiments, 22 images with evaluation values ranging from 1 to 2.5 were classified as “low bumpiness”; 13 images with evaluation values ranging from 2.5 to 3.5 were classified as “middle bumpiness”; and 15 images with evaluation values ranging from 3.5 to 5 were classified as “high bumpiness.” Some of the classified images are displayed in Figure 3. The results of averaging the power spectrum in the spatial frequency domain for each group of the three classified images are shown in Figure 4, where differences are observed in the mean values of the power spectrum in the low- and medium-frequency bands. Based on this result, it was concluded that the perception of bumpiness was related to spatial frequency, and a regression model was constructed using the statistics obtained from the multiscale images as explanatory variables. The procedure for obtaining a multiscale image is illustrated in Figure 5. Cyclic bandpass filters with a width of 20 cycles per image-width (cpi) were applied to the power spectrum obtained by applying Fourier transform to the original image. However, because the components with 0–5 cpi mainly represent the brightness of the entire image, they were treated as a single band. Eight images were obtained using the bandpass filter. The statistics obtained from these images were standardized and used as explanatory variables. The results obtained using this model had a multiple correlation of R = 0.997 and an adjusted coefficient of determination R2 = 0.876, confirming a significant improvement in prediction accuracy compared to the model constructed using image statistics obtained from the original images. However, as shown in Table I, the results did not meet the 5% significance level because of the low t-value and high p-value.

Figure 3.

Examples of images classified into three groups.

Figure 4.

Results of averaging the power spectrum in the spatial frequency domain for each of the three groups of images classified according to bumpiness.

Table I.

Coefficients, t-values, and p-values resulting from regression modeling using image statistics obtained from multiscale images. The number after the component statistics represents the pass range of the bandpass filter.

Component	Coefficient	t-value	p-value	Component	Coefficient	t-value	p-value
Intercept	1.10E-13	0	1
Mean 0–5	−4.555	−2.096	0.171	Kurt 0–5	−0.2964	−1.318	0.318
Mean 5–25	3.296	1.056	0.402	Kurt 5–25	0.6732	0.352	0.759
Mean 25–45	−10.1	−0.505	0.664	Kurt 25–45	−0.5872	−0.334	0.77
Mean 45–65	13.02	0.625	0.596	Kurt 45–65	3.504	0.535	0.646
Mean 65–85	26.37	1.074	0.395	Kurt 65–85	−1.899	−0.317	0.781
Mean 85–105	−56.59	−2.407	0.138	Kurt 85–105	−2.08	−0.811	0.503
Mean 105–125	39.93	1.723	0.227	Kurt 105–125	−2.177	−1.037	0.409
Mean 125–	NA	NA	NA	Kurt 125–	−0.9277	−0.895	0.465
Std 0–5	−2.102	−1.694	0.232	Cont 0–5	0.9382	0.843	0.488
Std 5–25	−28.98	−1.658	0.239	Cont 5–25	0.7309	1.115	0.381
Std 25–45	2.237	0.03	0.979	Cont 25–45	−0.2585	−0.136	0.904
Std 45–565	−6.51	−0.06	0.957	Cont 45–65	−1.919	−0.447	0.699
Std 65–85	−74.82	−1.763	0.22	Cont 65–85	7.639	1.384	0.301
Std 85–105	216.4	2.667	0.117	Cont 85–105	−9.701	−2.502	0.129
Std 105–125	−465.6	−0.925	0.453	Cont 105–125	20.81	1.787	0.216
Std 125–	345.7	0.836	0.491	Cont 125–	−16.4	−1.517	0.269
Skew 0–5	−0.08513	−0.471	0.684	Top 0–5	4.338	2.177	0.161
Skew 5–25	−1.498	−1.172	0.362	Top 5–25	26.25	1.553	0.261
Skew 25–45	0.3386	0.168	0.882	Top 25–45	9.039	0.162	0.886
Skew 45–65	−0.8833	−0.287	0.801	Top 45–65	−9.148	−0.094	0.934
Skew 65–85	0.7325	0.176	0.877	Top 65–85	51.54	1.102	0.385
Skew 85–105	0.6179	0.4	0.728	Top 85–105	−161.1	−2.028	0.18
Skew 105–125	2.067	1.468	0.28	Top 105–125	388.2	0.743	0.535
Skew 125–	0.569	0.54	0.643	Top 125–	−308.5	−0.677	0.568

Figure 5.

Procedure for obtaining a multiscale image. The multiscale image was visualized after amplifying the components by a factor of 5, excluding the 0–5 cpi range.

A regression model was constructed for each image statistic to reduce the number of explanatory variables. The results of multiple correlations R and adjusted coefficient of determination R2 calculated from luminance statistics are shown in Table II. Higher accuracy was obtained from the regression model applied to the Mean, Std, and Top than those to the other statistics, namely, Skew, Kurt, and Cont, indicating that Skew and Kurt, in particular, did not affect the perception of bumpiness. As an example, the coefficients, t-values, and p-values of the linear regression model applied to the Mean are shown in Table III for different cpi ranges. From the t-values and p-values of the results in Table III, it is observed that the influence of the components within the 5–85 cpi range is large, and in particular, based on the t-values and p-values of the Mean, Std, and Top, it is concluded that the influence of the components within the 5–65 cpi range is large. This result indicates that the components in the low- and medium-frequency ranges influence the perception of bumpiness, as observed in the difference in the mean of the power spectrum for the three groups in Fig. 4. These cues are consistent with the results of Ho et al. [16, 17] for mean pixel value. However, the contrasts did not agree. This might be due to the difference between Ho et al.’s method of determining contrast and the method [9] used in this study.

Table II.

Multiple correlation R and adjusted coefficient of determination R2 when a regression model was constructed for each image statistic.

Statistic	R	R2
Mean	0.877	0.706
Std	0.854	0.676
Skew	0.328	0.067
Kurt	0.35	0.049
Cont	0.746	0.471
Top	0.859	0.687

Table III.

Coefficients, t-values, and p-values of the linear model applied to the Mean.

Frequence (cpi)	Coefficient	t-value	p-value
Intercept	1.021	2.391	0.0214
0–5	0.0029	1.275	0.309
5–25	0.278	4.715	2.67E-05
25–45	−0.449	−1.9	0.0644
45–65	1.465	3.095	0.0035
65–85	−1.93	−3.234	0.00238
85–105	0.959	1.36	0.181
105–125	0	65535	NA
125–	−0.118	−0.254	NA

Bumpy Appearance Editing Method

Based on the results of the analysis of bumpiness perception in Section 2.2, bumpiness perception can be modified by modulating the power spectrum in the low- and middle-frequency bands. The bumpy appearance-editing method of Figure 6 was proposed for this purpose.

Figure 6.

Overview of the proposed editing method.

In processing color images, the proposed method first converts the RGB image into the YCbCr color space, and then decomposes it into the Y component (luminance) and CbCr component (color difference) as follows:

(1)

\begin{matrix} (\begin{matrix} Y \\ C b \\ C r \end{matrix}) = (\begin{matrix} 0.299 & 0.587 & 0.114 \\ - 0.16874 & - 0.33126 & 0.500 \\ 0.500 & - 0.41869 & - 0.08131 \end{matrix}) \\ (\begin{matrix} R \\ G \\ B \end{matrix}) + (\begin{matrix} 0 \\ 128 \\ 128 \end{matrix}) . \end{matrix}

The Y -component is then processed using Fourier transform to obtain the power spectrum, which modulates the intensity in the frequency range of 5–65 cpi based on the results of the regression model constructed in Section 2.2. To modulate the Mean, Std, and Top, the intensity of the power spectrum was amplified to emphasize, and subsequently attenuated to suppress bumpiness. The modulated power spectrum was then processed using inverse Fourier transform to obtain the Y component, which was then combined with CbCr and color difference components to obtain the color image with edited bumpiness. In this study, two methods of modulating the power spectrum were examined. The first method modulated the intensity uniformly within the applicable range of the power spectrum (Method 1) according to Eq. (2), and the degree of modulation was controlled by the scale factor a.

(2)

L^{'} (a, f) = \{\begin{matrix} a \cdot L & 5 \leq frequency \leq 65 [cpi] \\ L & otherwise, \end{matrix}

where L is the intensity of the power spectrum before modulation; L′ is the intensity of the power spectrum after modulation; and f is the spatial frequency. Bumpiness is emphasized when the scale factor a is a > 1, whereas it is suppressed when a < 1.

The second modulation method considered contrast sensitivity (Method 2). As shown in Figure 7, the peak of the contrast sensitivity curve overlapped the range of the average difference in the power spectrum intensity. Thus, contrast sensitivity also had an effect on the perception of bumpiness and modulated the intensity of the power spectrum along the contrast sensitivity curve. After modulation, the luminance of the entire image was adjusted to produce an image with modulated bumpiness perception. The contrast sensitivity curve is expressed by Eq. (3) [19],

(3)

W (f) = \frac{(K + a \cdot f^{c}) \cdot e^{- b f}}{K}

where f represents the spatial frequency, and the constants a, b, c, and K are a = 75, b = 0.2, c = 0.9, and K = 46. To align the parameters of the scale factor a of Method 1 to the scale of the modulation equation, Method 2 was designed as shown in Eq. (4).

(4)

L^{'} (b, f) = \{\begin{matrix} \frac{W (f) - 1}{W_{max} - 1} \cdot (b - 1) + 1 & 5 \leq f \leq 65 [c p i] \\ L & otherwise. \end{matrix}

where L and L′ are the intensities of the power spectrum before and after modulation; Wmax is the maximum value of the contrast sensitivity curve; b is the scale factor; and f is the spatial frequency. The degree of modulation of bumpiness was controlled using scale factor b. The bumpiness was enhanced when b > 1 or suppressed when b < 1.

Figure 7.

Average of power spectrum and contrast sensitivity curve.

The results of modulation using Eq. (4) are shown in Figure 8. The luminance of the entire image was too high when bumpiness was suppressed and vice versa. To avoid this problem, the luminance components were adjusted to match the average luminance of the edited image to that of the original image using Eq. (5).

(5)

V_{i, j}^{^{'}} = V_{i, j} + (\frac{1}{N} \sum V_{s r c_{i, j}} - \frac{1}{N} \sum V_{i, j}),

where V i, j′ is the luminance value of each pixel after luminance adjustment; V i, j is the luminance value of each pixel after editing the bumpiness using Eq. (4); V srci, j is the luminance value of each pixel in the original image; and N is the number of pixels in the image. The results of adjusting the luminance component according to Eq. (5) are shown in Figure 9. The image generated after adjusting the luminance component according to Eq. (5) constitutes the output of Method 2.

Figure 8.

Results of editing the bumpiness according to Eq. (4). The results are shown for scale factor b = 0.5 and b = 1.5.

Figure 9.

Results of adjusting the luminance component according to Eq. (5).

Experimental Results

In this section, the verification of the effectiveness of the editing method through subjective evaluation is discussed. The images used in the subjective evaluation experiment are presented in Fig. A1. The results of editing using the band-sifting algorithm [14] are also used for comparison.

Figures 10–12 show examples of the results of editing bumpiness using Methods 1 and 2. In all three figures, (a) and (b) display the results suppressed by Method 1; (d) and (e) display the results enhanced by Method 1; (f) and (g) display the results suppressed by Method 2; and (h) and (i) display the results enhanced by Method 2; (j) displays the evaluated values of bumpiness from (a) to (i), and (k) displays the value of unnaturalness evaluated on the images of (a) to (i), and the error bars indicate standard errors. In this experiment, to facilitate the interpretation of bumpiness, the scale factors a and b shown in Eqs. (1) and (3) were set to 1.25 and 1.50, respectively, to enhance the bumpiness and to 0.75 and 0.50, respectively, to suppress it. The subjects were seven students in their 20s, and the experiment was conducted in the same environment used for the subjective evaluation experiment described in Section 2.1. The subjects were asked to evaluate two items: “bumpiness” and “unnaturalness”. Here, unnaturalness is defined as an appearance contrary to nature that does not exist in reality. Subjects were asked to conduct the experiment twice for each item and rate bumpiness and unnaturalness on a 5-point scale, whereby the average of the scores from all subjects was used for the evaluation of bumpiness and unnaturalness of the image. The results of all images and script used in this experiment can be found at the URL shown in the appendix.

Figure 10.

Results of editing bumpiness of the image on wrinkled paper.

Figure 11.

Results of editing bumpiness on a rough wood surface image.

Figure 12.

Results of editing bumpiness on a glossy stone surface image.

As shown in Figs. 10–12, most images used in the experiment demonstrate that the scale factor of Methods 1 and 2 control the sense of bumpiness. In addition, in all edited images, the bumpiness could be edited within an acceptable range, as the unnaturalness rating was never higher than 4. As seen in Figs. 11(k) and 12(k), there are no differences in the unnaturalness results of Methods 1 and 2 applied to bumpiness suppression; however, in the bumpiness enhancement process, there is a large increase in the evaluated value of unnaturalness in Method 2.

Figure 13 shows the results of applying the proposed method to an original image with low bumpiness evaluation value. Methods 1 and 2 control the perception of bumpiness by amplifying components that are considered to affect the perception of bumpiness. Therefore, the proposed method cannot amplify the bumpiness perception of the original image. Thus, an image with original low bumpiness is still perceived as such. The power spectrum of the original image in Figure 14 has strong intensity in the low-frequency band and weak intensity in the middle- and high-frequency bands. It can be confirmed that the suppression process of Method 1 does not work well for such images, and that the suppression process of Method 2 is more effective than that of Method 1. As the peak of the contrast sensitivity curve overlaps the peak of the frequency band with high intensity in the power spectrum, the suppression of bumpiness in the entire image can be effectively suppressed by emphasizing the degree of suppression along the contrast sensitivity rather than by changing the power spectrum uniformly.

Figure 13.

Results of applying the proposed methods to an image with low evaluation value of bumpiness.

Figure 14.

Results of editing the bumpiness on a cracked rock surface image.

Several test images were used to compare the proposed method with band-sifting. Figure 15 shows an example of the application of the proposed method and band-sifting. Bumpiness is emphasized by amplifying the high-frequency high-amplitude negative-sign (HHN) component and the high-frequency low-amplitude negative-sign (HLN) component. Amplifying the HHN and HLN components emphasizes the shadows on the surface of the object and the bumpiness of the surface. As a result, there is no significant difference between Fig. 15(b) edited by Method 1 and Fig. 15(c) edited by band-sifting in terms of the perceived bumpiness and unnaturalness. However, as shown in Figure 16(a), amplifying the HHN and HLN components did not improve the perceived bumpiness of an image formed by the reflection of light (Fig. 16(b)). To enhance the bumpiness of the original image in Fig. 16(a), it is necessary to amplify the HHP and HLP components. The results of editing bumpiness using band-sifting based on the amplification of the HHN and HLN components are not consistent. However, the proposed method can consistently edit bumpiness even when the bumpiness is attributed to shading or light reflection.

Figure 15.

Comparison of proposed method and the band-sifting algorithm.

Figure 16.

Example where amplification of HHN and HLN components does not emphasize bumpiness.

Limitations

In this section, the limitations of the proposed method are discussed. As shown in Fig. 13, the proposed method cannot enhance the bumpiness of an image with a low degree of bumpiness because both Methods 1 and 2 enhance the bumpiness by amplifying the intensity in the range of 5–65 cpi of the power spectrum. Therefore, the proposed method cannot sufficiently enhance the images with low bumpiness. For images with low bumpiness, it is necessary to use the power spectrum. Adding bumpiness would require considering the shape and other factors, making the process difficult.

Figure 17 shows the results of applying the proposed method to an image with large amplitude and size of bumps and low randomness. In this case, the suppression process is effective because the evaluated value of the sense of bumpiness is lower than that of the original image, thus suppressing the sense of bumpiness. However, when bumpiness is enhanced, there is no significant change in the evaluated value of bumpiness compared to the original image. It is also observed that the enhancement process in Method 2 significantly increases the unnaturalness of the image. For an image with a structure, such as that shown in Fig. 17, the suppression process works effectively, but the enhancement process cannot increase the sense of bumpiness. In addition, the enhancement process may increase the unnaturalness, which is a drawback of the proposed method.

Figure 17.

Results of applying the proposed methods to images with large amplitude and size of bumps and low randomness in the image.

In this study, a method for analyzing and editing bumpiness perception is proposed using texture images created by cropping only the surface of an object. Therefore, the proposed method did not perform well on natural images. The results of applying the proposed method to natural images are shown in Figure 18. Looking at the area of the stone wall, it appears that the bumpiness perception was edited according to the scale factor. However, the suppression process caused background buildings and birds to appear blurry, and the area of the sea, which does not have a bumpy shape in Method 1, was edited unnaturally. In addition, the enhancement process in Method 1 also unnaturally edited the sea area with no irregularity. These results indicate that the proposed method is not effective on natural images. This may be due to the fact that the analysis in this study was conducted on texture images, and that the range of statistics and spatial frequencies that have a large impact on the perception of bumpiness in natural images may be different from those in texture images, and also the proposed method was applied to the entire image.

Figure 18.

Results of applying the proposed method to a natural image.

Conclusions

In this study, focusing on the bumpiness of object surfaces in images, a method was proposed to edit the bumpiness of these surfaces. First, to analyze the factors influencing the perception of bumpiness, the subjects were asked to evaluate the bumpiness of 50 texture images. The bumpiness perception obtained from the experimental results of the subjective evaluation experiment, as well as the image statistics obtained from the multiscale images, were analyzed and evaluated, and a regression model was constructed using Mean, Std, and Top. The model showed high accuracy, demonstrating that these statistics had an influence on bumpiness perception. In addition, as a common result, the frequency band from 5–65 cpi had a large influence on the perception of bumpiness. Based on these results, we proposed a method to edit the sense of bumpiness.

In color images, the proposed method first transforms the RGB color space to YCbCr color space. Two modulation methods were proposed for the Fourier transform of the Y component, which is the luminance component, to modulate the Mean, Std, and Top. The first method modulates the intensity of the power spectrum uniformly in the frequency band of 5–65 cpi. The second method modulates the intensity of the power spectrum by varying the degree of modulation along the contrast sensitivity curve. Upon applying inverse Fourier transform to the modulated Y-component back to the RGB color space using the color difference component CbCr, an image is generated with edited bumpiness. Subjective evaluation experiments were conducted on the images to which the proposed method was applied to evaluate the bumpiness and unnaturalness of the images after bumpiness-editing. The effectiveness of the proposed method was confirmed through visual evaluation of many images.

Two modulation methods were designed for the proposed method. In Method 1, both the enhancement and suppression processes edit the bumpiness without significantly increasing the unnaturalness. However, suppression does not work effectively for images wherein the intensity is high in the low-frequency band of the power spectrum and low in the middle- and high-frequency bands. In contrast, in Method 2, the suppression process works effectively for such images. However, Method 2 has the disadvantage of being prone to artifacts.

The proposed method may not be effective for images with low bumpiness or images with structures such as Fig. 17. In addition, when applied to natural images, unnatural images can be generated. This is because the proposed method was designed to analyze texture images, and the results of the analysis reflect this. Because the proposed method is applied to the entire image, it is necessary to make it applicable only to areas where objects and textures exist. Furthermore, the study of roughness is not limited to 2D images, but has been applied to 2.5D prints [20]. According to Kadyrova et al. the attributes that observers look for when evaluating the naturalness of 2.5D prints are color, roughness, gloss, elevation, and lightness. Further development of our research to achieve more natural 2.5D and 3D prints is also a future challenge.

Acknowledgment

This work was supported by JSPS KAKENHI (Grant Numbers 20H05957).

Appendix A.

The 50 texture images used in Section 2.1 are shown in Fig. A1. The script used in Section 4 and the bumpiness and unnaturalness scores obtained from all images can be found at the following: https://github.com/YManabe-ChibaUniv/BumpinessEditing.git.

Figure A1.

50 texture images used in the experiment in Section 2.1.

References

1FlemingR. W.NishidaS.GegenfurtnerK. R.2015Perception of material properties (Part I)Vis. Res.109123236123–23610.1016/j.visres.2015.01.014

2FlemingR. W.NishidaS.GegenfurtnerK. R.2015Perception of material properties (Part II)Vis. Res.115157302157–30210.1016/j.visres.2015.08.006

3StorrsK. R.AndersonB. L.FlemingR. W.2021Unsupervised learning predicts human perception and misperception of glossNature Hum. Behav.5140214171402–1710.1038/s41562-021-01097-6

4DelanoyJ.LagunasM.CondorJ.GutierrezD.MasiaB.2022A generative framework for image-based editing of material appearance using perceptual attributesComput. Graph. Forum41453464453–6410.1111/cgf.14446

5ManabeY.TanakaM.HoriuchiT.2021Glossy appearance editing for heterogeneous material objectsJ. Imaging Sci. Technol.6560406-160406-1460406-1–60406-1410.2352/J.ImagingSci.Technol.2021.65.6.060406

6MotoyoshiI.NishidaS.SharanL.AdelsonE. H.2007Image statistics and the perception of surface qualitiesNature447207209207–910.1038/nature05724

7MarlowP. J.KimJ.AndersonB. L.2012The perception and misperception of specular surface reflectanceCurr. Biol.22R865R866R865–610.1016/j.cub.2012.08.009

8MarlowP. J.AndersonB. L.2013Generative constraints on image cues for perceived glossJ. Vis.131231–2310.1167/13.14.2

9ToscaniM.GuarneraD.GuarneraC.HeardebergJ. Y.GegenfurtnerK.2020Three perceptual dimensions for specular and diffuse reflectionACM Trans. Appl. Perception171261–2610.1145/3380741

10GigilashviliD.ShiW.WangZ.PedersenM.2021The role of subsurface scattering in glossiness perceptionACM Trans. Appl. Perception181261–2610.1145/3458438

11ITU-T T.81, “Information technology–Digital compression and coding of continuous-tone still images–Requirements and guidelines” (ITU-T Publications, Geneva)

12TanakaM.TakanashiT.HoriuchiT.2020Glossiness-aware image coding in JPEG frameworkJ. Imaging Sci. Technol.64050409-1050409-15050409-1–050409-1510.2352/J.ImagingSci.Technol.2020.64.5.050409

13TanakaM.HoriuchiT.2015Investigating perceptual qualities of static surface appearance using real materials and displayed imagesVis. Res.115246258246–5810.1016/j.visres.2014.11.016

14TanakaM.AraiR.HoriuchiT.2017PuRet: Material appearance enhancement considering pupil and retina behaviorsJ. Imaging Sci. Technol.61040401-1040401-8040401-1–810.2352/J.ImagingSci.Technol.2017.61.4.040401

15BoyadzheievI.BalaK.ParisS.AdelsonE.2015Band-sifting decomposition for image based material editingACM Trans. Graph.341161–1610.1145/2809796

16HoX.MaloneyL. T.LandyM. S.2006How direction of illumination affects visually perceived surface roughnessJ. Vis.6634648634–4810.1167/6.5.8

17HoX.MaloneyL. T.LandyM. S.2007The effect of viewpoint on perceived visual roughnessJ. Vis.71161–1610.1167/7.1.1

18HayashiH.UraM.MiyataK.2016The Influence of human texture recognition and the spatial frequency of texturesITE Technical Report40131613–6in Japanese

19ISO 15739, “Photography—Electronic still-picture imaging—Noise measurements”, 2013

20KadyrovaA.PedersenM.WestlandS.2022Effect of elevation and surface roughness on naturalness perception of 2.5D decor printsMaterials151151–1510.3390/ma15093372