OPT OpenIR  > 光谱成像技术研究室
Remote Sensing Image Generation From Audio
Zheng, Zhiyuan1,2; Chen, Jun1; Zheng, Xiangtao2; Lu, Xiaoqiang2
作者部门光谱成像技术研究室
2021-06
发表期刊IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
ISSN1545-598X;1558-0571
卷号18期号:6页码:994-998
产权排序1
摘要

Generating image from other modal data has attracted much attention in cross-modal studies, since the generated image offers intuitive vision information. Unlike the previous works which generate an image from text, a novel task is introduced, generating an image from audio. However, semantic gap intrinsically exists in cross-modal data, which disturbs the generative results. In order to explore the relevance between the audio and image, a novel reranking audio-image translation method is proposed. The proposed method: 1) maps the audio and image into a uniform feature space; 2) designs an audio-audio matching network to match the related audio; and 3) adopts an audio-image matching network for every matched audio to generate a related image, and the most frequent image is voted as the final result. Extensive experiments on two remote sensing cross-modal data sets demonstrate that the proposed method can visualize the content of audio.

关键词Remote sensing Semantics Feature extraction Gallium nitride Neural networks Sensors Mel frequency cepstral coefficient Cross-modal generation reranking
DOI10.1109/LGRS.2020.2992324
收录类别SCI ; EI
语种英语
WOS记录号WOS:000652799700012
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
EI入藏号20212210436960
引用统计
被引频次:4[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.opt.ac.cn/handle/181661/94861
专题光谱成像技术研究室
通讯作者Zheng, Xiangtao
作者单位1.Wuhan Univ, Sch Comp Sci, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China
2.Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol CAS, Xian 710119, Peoples R China
推荐引用方式
GB/T 7714
Zheng, Zhiyuan,Chen, Jun,Zheng, Xiangtao,et al. Remote Sensing Image Generation From Audio[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,2021,18(6):994-998.
APA Zheng, Zhiyuan,Chen, Jun,Zheng, Xiangtao,&Lu, Xiaoqiang.(2021).Remote Sensing Image Generation From Audio.IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,18(6),994-998.
MLA Zheng, Zhiyuan,et al."Remote Sensing Image Generation From Audio".IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 18.6(2021):994-998.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Remote Sensing Image(2017KB)期刊论文出版稿限制开放CC BY-NC-SA请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zheng, Zhiyuan]的文章
[Chen, Jun]的文章
[Zheng, Xiangtao]的文章
百度学术
百度学术中相似的文章
[Zheng, Zhiyuan]的文章
[Chen, Jun]的文章
[Zheng, Xiangtao]的文章
必应学术
必应学术中相似的文章
[Zheng, Zhiyuan]的文章
[Chen, Jun]的文章
[Zheng, Xiangtao]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。