Fine-Grained Visual Categorization by Localizing Object Parts With Single Image

doi:10.1109/TMM.2020.2993960

OPT OpenIR > 光谱成像技术研究室

	Fine-Grained Visual Categorization by Localizing Object Parts With Single Image
	Zheng, Xiangtao1 ; Qi, Lei 1; Ren, Yutao 2; Lu, Xiaoqiang1
作者部门	光谱成像技术研究室
	2021
发表期刊	IEEE TRANSACTIONS ON MULTIMEDIA
ISSN	1520-9210;1941-0077
卷号	23 页码:1187-1199
产权排序	1
摘要	Fine-grained visual categorization (FGVC) refers to assigning fine-grained labels to images which belong to the same base category. Due to the high inter-class similarity, it is challenging to distinguish fine-grained images under different subcategories. Recently, researchers have proposed to firstly localize key object parts within images and then find discriminative clues on object parts. To localize object parts, existing methods train detectors for different kinds of object parts. However, due to the fact that the same kind of object part in different images often changes intensely in appearance, the existing methods face two shortages: 1) Training part detector for object parts with diverse appearance is laborious; 2) Discriminative parts with unusual appearance may be neglected by the trained part detectors. To localize the key object parts efficiently and accurately, a novel FGVC method is proposed in the paper. The main novelty is that the proposed method localizes the key object parts within each image only depending on a single image and hence avoid the influence of diversity between parts in different images. The proposed FGVC method consists of two key steps. Firstly, the proposed method localizes the key parts in each image independently. To this end, potential object parts in each image are identified and then these potential parts are merged to generate the final representative object parts. Secondly, two kinds of features are extracted for simultaneously describing the discriminative clues within each part and the relationship between object parts. In addition, a part based dropout learning technique is adopted to boost the classification performance further in the paper. The proposed method is evaluated in comparison experiments and the experiment results show that the proposed method can achieve comparable or better performance than state-of-the-art methods.
关键词	Feature extraction Detectors Training Image representation Visualization Semantics Birds Fine-grained visual categorization Part localization Part relationship Spectral clustering Dropout learning
DOI	10.1109/TMM.2020.2993960
收录类别	SCI
语种	英语
WOS记录号	WOS:000645068200003
出版者	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
引用统计	被引频次：18[WOS] [WOS记录] [WOS相关记录]
文献类型	期刊论文
条目标识符	http://ir.opt.ac.cn/handle/181661/94750
专题	光谱成像技术研究室
通讯作者	Lu, Xiaoqiang
作者单位	1.Chinese Acad Sci, Key Lab Spectral Imaging Technol CAS, Xian Inst Opt & Precis Mech, Xian 710119, Peoples R China 2.Wuhan Univ Technol, Wuhan 430070, Peoples R China
推荐引用方式 GB/T 7714	Zheng, Xiangtao,Qi, Lei,Ren, Yutao,et al. Fine-Grained Visual Categorization by Localizing Object Parts With Single Image[J]. IEEE TRANSACTIONS ON MULTIMEDIA,2021,23:1187-1199.
APA	Zheng, Xiangtao,Qi, Lei,Ren, Yutao,&Lu, Xiaoqiang.(2021).Fine-Grained Visual Categorization by Localizing Object Parts With Single Image.IEEE TRANSACTIONS ON MULTIMEDIA,23,1187-1199.
MLA	Zheng, Xiangtao,et al."Fine-Grained Visual Categorization by Localizing Object Parts With Single Image".IEEE TRANSACTIONS ON MULTIMEDIA 23(2021):1187-1199.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
Fine-Grained Visual （3601KB）	期刊论文	出版稿	限制开放	CC BY-NC-SA	请求全文