OPT OpenIR  > 光谱成像技术研究室
Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation
Zhang, Luming1; Gao, Yue2; Zimmermann, Roger1; Tian, Qi3; Li, Xuelong4
作者部门光学影像学习与分析中心
2014-03-01
发表期刊IEEE TRANSACTIONS ON IMAGE PROCESSING
ISSN1070-986X
卷号23期号:3
摘要Photo aesthetic quality evaluation is a fundamental yet under addressed task in computer vision and image processing fields. Conventional approaches are frustrated by the following two drawbacks. First, both the local and global spatial arrangements of image regions play an important role in photo aesthetics. However, existing rules, e. g., visual balance, heuristically define which spatial distribution among the salient regions of a photo is aesthetically pleasing. Second, it is difficult to adjust visual cues from multiple channels automatically in photo aesthetics assessment. To solve these problems, we propose a new photo aesthetics evaluation framework, focusing on learning the image descriptors that characterize local and global structural aesthetics from multiple visual channels. In particular, to describe the spatial structure of the image local regions, we construct graphlets small-sized connected graphs by connecting spatially adjacent atomic regions. Since spatially adjacent graphlets distribute closely in their feature space, we project them onto a manifold and subsequently propose an embedding algorithm. The embedding algorithm encodes the photo global spatial layout into graphlets. Simultaneously, the importance of graphlets from multiple visual channels are dynamically adjusted. Finally, these post-embedding graphlets are integrated for photo aesthetics evaluation using a probabilistic model. Experimental results show that: 1) the visualized graphlets explicitly capture the aesthetically arranged atomic regions; 2) the proposed approach generalizes and improves four prominent aesthetic rules; and 3) our approach significantly outperforms state-of-the-art algorithms in photo aesthetics prediction.
文章类型Article
关键词Multi-channel Structural Cues Aesthetic Evaluation Probabilistic Model
WOS标题词Science & Technology ; Technology
DOI10.1109/TIP.2014.2303650
收录类别SCI ; EI
关键词[WOS]IMAGE CLASSIFICATION ; MANIFOLD ; SEARCH
语种英语
WOS研究方向Computer Science ; Engineering
WOS类目Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
WOS记录号WOS:000335390100005
引用统计
被引频次:135[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.opt.ac.cn/handle/181661/22377
专题光谱成像技术研究室
作者单位1.Natl Univ Singapore, Sch Comp, Singapore 119613, Singapore
2.Tsinghua Univ, Beijing 100086, Peoples R China
3.Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
4.Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Ctr OPT IMagery Anal & Learning, Xian 710119, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Luming,Gao, Yue,Zimmermann, Roger,et al. Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2014,23(3).
APA Zhang, Luming,Gao, Yue,Zimmermann, Roger,Tian, Qi,&Li, Xuelong.(2014).Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation.IEEE TRANSACTIONS ON IMAGE PROCESSING,23(3).
MLA Zhang, Luming,et al."Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation".IEEE TRANSACTIONS ON IMAGE PROCESSING 23.3(2014).
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Fusion of Multichann(2522KB)期刊论文出版稿限制开放CC BY请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zhang, Luming]的文章
[Gao, Yue]的文章
[Zimmermann, Roger]的文章
百度学术
百度学术中相似的文章
[Zhang, Luming]的文章
[Gao, Yue]的文章
[Zimmermann, Roger]的文章
必应学术
必应学术中相似的文章
[Zhang, Luming]的文章
[Gao, Yue]的文章
[Zimmermann, Roger]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。