
We propose a method for tuning the parameters of a color adjustment Image Signal Processor (ISP) algorithmic “block” using language prompts. This enables the user to impart a particular visual style to the ISP-processed image simply by describing it through a text prompt. To do this, we first implement the ISP block in a differentiable manner. Then, we define an objective function using an off-the-shelf, pretrained vision-language-model (VLM) such that the objective is minimized when the ISP-processed-image is most visually similar to the input language prompt. Finally, we optimize the ISP parameters using gradient descent. Experimental results demonstrate tuning of ISP parameters with different language prompts, and compare the performance of different pretrained VLMs and optimization strategies.
Owen Mayer, Shohei Noguchi, Alexander Berestov, Jiro Takatori, "Language-based Color ISP Tuning" in Color and Imaging Conference, 2025, pp 203 - 208, https://doi.org/10.2352/CIC.2025.33.1.38