OPT OpenIR  > 光谱成像技术研究室
Deep Cross-Modal ImageVoice Retrieval in Remote Sensing
Chen, Yaxiong1,2; Lu, Xiaoqiang1; Wang, Shuai1
作者部门光谱成像技术研究室
2020-10
发表期刊IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
ISSN0196-2892;1558-0644
卷号58期号:10页码:7049-7061
产权排序1
摘要

With the rapid progress of satellite and aircraft technologies, cross-modal remote sensing imagevoice retrieval has been studied in geography recently. However, there still exist some bottlenecks: how to consider the characteristics of remote sensing data adequately and how to reduce the memory and improve the retrieval efficiency in large-scale remote sensing data. In this article, we propose a novel deep cross-modal remote sensing imagevoice retrieval approach, namely, deep imagevoice retrieval (DIVR), to capture more information of remote sensing data to generate hash codes with low memory and fast retrieval properties. Especially, the DIVR approach proposes inception dilated convolution module to capture multiscale contextual information of remote sensing images and voices. Moreover, in order to enhance cross-modal similarity, the deep features similarity term is designed to make paired similar deep features as close as possible and paired dissimilar deep features as mutually far as possible. In addition, the quantization error term is designed to drive hash-like codes to approximate hash codes, which can effectively reduce the quantization error for hash codes learning. Extensive experimental results on three remote sensing imagevoice data sets show that the proposed DIVR approach can outperform other cross-modal retrieval approaches.

关键词Remote sensing Convolution Quantization (signal) Image retrieval Deep learning Convolutional codes Hamming distance Cross-modal remote sensing image-voice retrieval deep features' similarity deep hash codes quantization error
DOI10.1109/TGRS.2020.2979273
收录类别SCI ; EI
语种英语
WOS记录号WOS:000573923100021
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
EI入藏号20204209349066
引用统计
被引频次:55[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.opt.ac.cn/handle/181661/93727
专题光谱成像技术研究室
通讯作者Lu, Xiaoqiang
作者单位1.Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol CAS, Xian 710119, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
推荐引用方式
GB/T 7714
Chen, Yaxiong,Lu, Xiaoqiang,Wang, Shuai. Deep Cross-Modal ImageVoice Retrieval in Remote Sensing[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING,2020,58(10):7049-7061.
APA Chen, Yaxiong,Lu, Xiaoqiang,&Wang, Shuai.(2020).Deep Cross-Modal ImageVoice Retrieval in Remote Sensing.IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING,58(10),7049-7061.
MLA Chen, Yaxiong,et al."Deep Cross-Modal ImageVoice Retrieval in Remote Sensing".IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 58.10(2020):7049-7061.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Deep Cross-Modal Ima(3711KB)期刊论文出版稿限制开放CC BY-NC-SA请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Chen, Yaxiong]的文章
[Lu, Xiaoqiang]的文章
[Wang, Shuai]的文章
百度学术
百度学术中相似的文章
[Chen, Yaxiong]的文章
[Lu, Xiaoqiang]的文章
[Wang, Shuai]的文章
必应学术
必应学术中相似的文章
[Chen, Yaxiong]的文章
[Lu, Xiaoqiang]的文章
[Wang, Shuai]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。