Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation | |
Zhang, Luming1; Gao, Yue2; Zimmermann, Roger1; Tian, Qi3; Li, Xuelong4![]() | |
作者部门 | 光学影像学习与分析中心 |
2014-03-01 | |
发表期刊 | IEEE TRANSACTIONS ON IMAGE PROCESSING
![]() |
ISSN | 1070-986X |
卷号 | 23期号:3 |
摘要 | Photo aesthetic quality evaluation is a fundamental yet under addressed task in computer vision and image processing fields. Conventional approaches are frustrated by the following two drawbacks. First, both the local and global spatial arrangements of image regions play an important role in photo aesthetics. However, existing rules, e. g., visual balance, heuristically define which spatial distribution among the salient regions of a photo is aesthetically pleasing. Second, it is difficult to adjust visual cues from multiple channels automatically in photo aesthetics assessment. To solve these problems, we propose a new photo aesthetics evaluation framework, focusing on learning the image descriptors that characterize local and global structural aesthetics from multiple visual channels. In particular, to describe the spatial structure of the image local regions, we construct graphlets small-sized connected graphs by connecting spatially adjacent atomic regions. Since spatially adjacent graphlets distribute closely in their feature space, we project them onto a manifold and subsequently propose an embedding algorithm. The embedding algorithm encodes the photo global spatial layout into graphlets. Simultaneously, the importance of graphlets from multiple visual channels are dynamically adjusted. Finally, these post-embedding graphlets are integrated for photo aesthetics evaluation using a probabilistic model. Experimental results show that: 1) the visualized graphlets explicitly capture the aesthetically arranged atomic regions; 2) the proposed approach generalizes and improves four prominent aesthetic rules; and 3) our approach significantly outperforms state-of-the-art algorithms in photo aesthetics prediction. |
文章类型 | Article |
关键词 | Multi-channel Structural Cues Aesthetic Evaluation Probabilistic Model |
WOS标题词 | Science & Technology ; Technology |
DOI | 10.1109/TIP.2014.2303650 |
收录类别 | SCI ; EI |
关键词[WOS] | IMAGE CLASSIFICATION ; MANIFOLD ; SEARCH |
语种 | 英语 |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000335390100005 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.opt.ac.cn/handle/181661/22377 |
专题 | 光谱成像技术研究室 |
作者单位 | 1.Natl Univ Singapore, Sch Comp, Singapore 119613, Singapore 2.Tsinghua Univ, Beijing 100086, Peoples R China 3.Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA 4.Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Ctr OPT IMagery Anal & Learning, Xian 710119, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, Luming,Gao, Yue,Zimmermann, Roger,et al. Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2014,23(3). |
APA | Zhang, Luming,Gao, Yue,Zimmermann, Roger,Tian, Qi,&Li, Xuelong.(2014).Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation.IEEE TRANSACTIONS ON IMAGE PROCESSING,23(3). |
MLA | Zhang, Luming,et al."Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation".IEEE TRANSACTIONS ON IMAGE PROCESSING 23.3(2014). |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
Fusion of Multichann(2522KB) | 期刊论文 | 出版稿 | 限制开放 | CC BY | 请求全文 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论