We previously reported the performance of four color difference equations around the CIE 1978 blue color center (NCSU-B1) using various statistical measures [1,2]. In this study we employed the standardized residual sum of squares (STRESS) [3] index to test the performance of twelve
colordifference formulae using two experimental NCSU datasets. The first dataset (NCSU-B1) included 66 sample pairs around the CIE 1978 blue color center and the second dataset (NCSU-2) contained 69 sample pairs around 13 color centers [4]. In the first dataset 26 observers made a total of
5148 assessments of sample pairs with small color differences (ΔE*_{ab}<5) while the second dataset involved 20,700 assessments by 100 observers from four different geographical regions of the world (25 in each region). Each pair in both sets was assessed by each color normal
observer in three separate sittings on separate days and the average of assessments was calculated. For the samples in the first dataset a custom AATCC standard gray scale was employed to assess the magnitude of difference between colored samples. A third-degree polynomial equation was used
to convert gray scale ratings to visual differences (ΔV). In the second study a novel perceptually linear gray scale was developed and a linear function was used to obtain visual differences. Based on the analysis of STRESS index results the DIN99d equation gave the best results for both
datasets, and the CIELAB equation the worst.