Interpretable Aesthetic Assessment and Prompt-Guided Image Retouching

Conference proceedings article


Authors/Editors


Strategic Research Themes


Publication Details

Author listViriyavisuthisakul, S.; Sanguansat, P.; Yamasaki, T.

PublisherInstitute of Electrical and Electronics Engineers Inc.

Publication year2025

Start page566

End page571

Number of pages6

ISBN9798350351422; 9798331594657

ISSN27704327

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-105025025527&doi=10.1109%2FMIPR67560.2025.00096&partnerID=40&md5=ba22998a85e3f8fc47139cd738668aa0

LanguagesEnglish-Great Britain (EN-GB)


View on publisher site


Abstract

Image aesthetic evaluation is subjective and has traditionally relied on human judgment. Vision-Language models, such as CLIP, offer a novel approach by aligning visual features with textual prompts, enabling interpretable assessments. CLIP-based Image Quality Assessment (CLIP-IQA) methods utilize antonym prompt pairs to capture perceptual attributes in images, but the interpretability of these features in relation to human perception remains an open question. Recently, explainable aesthetic evaluation using vision-language models has been proposed. This approach explores the contribution of prompt-based features by combining CLIP-derived embeddings with Light Gradient Boosting Machine (LightGBM) for aesthetic scoring, supported by SHapley Additive exPlanations (SHAP) for feature attribution. A multimodal large language model (MLLM) is used to generate explanations based on these attributions. In this paper, we aim to extend this framework to be a practical application. SHAP values are used to guide prompt construction for image manipulation. Deblurring and super-resolution models are applied to improve the quality of image detail. A diffusion-based generative model synthesizes new images based on prompts, then reevaluates the impact of specific feature changes. This approach bridges interpretability and controllability in aesthetic assessment, offering new insights into the relationship between image attributes and perceived quality. © 2025 IEEE.


Keywords

No matching items found.


Last updated on 2026-14-02 at 00:00