OPT OpenIR  > 光谱成像技术研究室
A semi-supervised cross-modal memory bank for cross-modal retrieval
Huang, Yingying1,2,3; Hu, Bingliang3; Zhang, Yipeng1,2,3; Gao, Chi1,2,3; Wang, Quan1,3
作者部门光谱成像技术研究室
2024-04-28
发表期刊NEUROCOMPUTING
ISSN0925-2312;1872-8286
卷号579
产权排序1
摘要

The core of semi -supervised cross -modal retrieval tasks lies in leveraging limited supervised information to measure the similarity between cross -modal data. Current approaches assume an association between unlabelled data and pre -defined k -nearest neighbour data, relying on classifier performance for this selection. With diminishing labelled data, classifier performance weakens, resulting in erroneous associations among unlabelled instances. Moreover, the lack of interpretability in class probabilities of unlabelled data hinders classifier learning. Thus, this paper focuses on learning pseudo -labels for unlabelled data, providing pseudosupervision to aid classifier learning. Specifically, a cross -modal memory bank is proposed, dynamically storing feature representations in a common space and class probability representations in a label space for each cross -modal data. Pseudo -labels are derived by computing feature representation similarity and adjusting class probabilities. During this process, imposing constraints on the classification loss between labelled data and contrastive losses between paired cross -modal data is a prerequisite for the successful learning of pseudolabels. This procedure significantly contributes to enhancing the credibility of these pseudo -labels. Empirical findings demonstrate that using only 10% labelled data, compared to prevailing semi -supervised techniques, this method achieves improvements of 2.6%, 1.8%, and 4.9% in MAP@50 on the Wikipedia, NUS -WIDE, and MS-COCO datasets, respectively.

关键词Common space Cross-modal memory bank Pseudo-labels Class probability
DOI10.1016/j.neucom.2024.127430
收录类别SCI
语种英语
WOS记录号WOS:001198409500001
出版者ELSEVIER
引用统计
文献类型期刊论文
条目标识符http://ir.opt.ac.cn/handle/181661/97392
专题光谱成像技术研究室
通讯作者Wang, Quan
作者单位1.Chinese Acad Sci, Key Lab Spectral Imaging Technol, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
3.Key Lab Biomed Spect, Xian 710119, Shaanxi, Peoples R China
推荐引用方式
GB/T 7714
Huang, Yingying,Hu, Bingliang,Zhang, Yipeng,et al. A semi-supervised cross-modal memory bank for cross-modal retrieval[J]. NEUROCOMPUTING,2024,579.
APA Huang, Yingying,Hu, Bingliang,Zhang, Yipeng,Gao, Chi,&Wang, Quan.(2024).A semi-supervised cross-modal memory bank for cross-modal retrieval.NEUROCOMPUTING,579.
MLA Huang, Yingying,et al."A semi-supervised cross-modal memory bank for cross-modal retrieval".NEUROCOMPUTING 579(2024).
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
A semi-supervised cr(3286KB)期刊论文出版稿限制开放CC BY-NC-SA请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Huang, Yingying]的文章
[Hu, Bingliang]的文章
[Zhang, Yipeng]的文章
百度学术
百度学术中相似的文章
[Huang, Yingying]的文章
[Hu, Bingliang]的文章
[Zhang, Yipeng]的文章
必应学术
必应学术中相似的文章
[Huang, Yingying]的文章
[Hu, Bingliang]的文章
[Zhang, Yipeng]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。