In this paper we investigate the use of a deep Convolutional Neural Network (CNN) to predict image aesthetics. To this end we finetune a canonical CNN architecture, originally trained to classify objects and scenes, by casting the image aesthetic prediction as a regression problem. We also investigate whether image aesthetic is a global or local attribute, and the role played by bottom-up and top-down salient regions to the prediction of the global image aesthetic. Experimental results on the canonical Aesthetic Visual Analysis (AVA) dataset show the robustness of the solution proposed, which outperforms the best solution in the state of the art by almost 17% in terms of Mean Residual Sum of Squares Error (MRSSE)
Bianco, S., Celona, L., Napoletano, P., Schettini, R. (2016). Predicting image aesthetics with deep learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp.117-125). Springer Verlag [10.1007/978-3-319-48680-2_11].
Predicting image aesthetics with deep learning
BIANCO, SIMONEPrimo
;CELONA, LUIGISecondo
;NAPOLETANO, PAOLO
;SCHETTINI, RAIMONDOUltimo
2016
Abstract
In this paper we investigate the use of a deep Convolutional Neural Network (CNN) to predict image aesthetics. To this end we finetune a canonical CNN architecture, originally trained to classify objects and scenes, by casting the image aesthetic prediction as a regression problem. We also investigate whether image aesthetic is a global or local attribute, and the role played by bottom-up and top-down salient regions to the prediction of the global image aesthetic. Experimental results on the canonical Aesthetic Visual Analysis (AVA) dataset show the robustness of the solution proposed, which outperforms the best solution in the state of the art by almost 17% in terms of Mean Residual Sum of Squares Error (MRSSE)I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.