Remote Sensing Image Generation From Audio | |
Zheng, Zhiyuan1,2; Chen, Jun1; Zheng, Xiangtao2; Lu, Xiaoqiang2 | |
作者部门 | 光谱成像技术研究室 |
2021-06 | |
发表期刊 | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS |
ISSN | 1545-598X;1558-0571 |
卷号 | 18期号:6页码:994-998 |
产权排序 | 1 |
摘要 | Generating image from other modal data has attracted much attention in cross-modal studies, since the generated image offers intuitive vision information. Unlike the previous works which generate an image from text, a novel task is introduced, generating an image from audio. However, semantic gap intrinsically exists in cross-modal data, which disturbs the generative results. In order to explore the relevance between the audio and image, a novel reranking audio-image translation method is proposed. The proposed method: 1) maps the audio and image into a uniform feature space; 2) designs an audio-audio matching network to match the related audio; and 3) adopts an audio-image matching network for every matched audio to generate a related image, and the most frequent image is voted as the final result. Extensive experiments on two remote sensing cross-modal data sets demonstrate that the proposed method can visualize the content of audio. |
关键词 | Remote sensing Semantics Feature extraction Gallium nitride Neural networks Sensors Mel frequency cepstral coefficient Cross-modal generation reranking |
DOI | 10.1109/LGRS.2020.2992324 |
收录类别 | SCI ; EI |
语种 | 英语 |
WOS记录号 | WOS:000652799700012 |
出版者 | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
EI入藏号 | 20212210436960 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.opt.ac.cn/handle/181661/94861 |
专题 | 光谱成像技术研究室 |
通讯作者 | Zheng, Xiangtao |
作者单位 | 1.Wuhan Univ, Sch Comp Sci, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China 2.Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol CAS, Xian 710119, Peoples R China |
推荐引用方式 GB/T 7714 | Zheng, Zhiyuan,Chen, Jun,Zheng, Xiangtao,et al. Remote Sensing Image Generation From Audio[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,2021,18(6):994-998. |
APA | Zheng, Zhiyuan,Chen, Jun,Zheng, Xiangtao,&Lu, Xiaoqiang.(2021).Remote Sensing Image Generation From Audio.IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,18(6),994-998. |
MLA | Zheng, Zhiyuan,et al."Remote Sensing Image Generation From Audio".IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 18.6(2021):994-998. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
Remote Sensing Image(2017KB) | 期刊论文 | 出版稿 | 限制开放 | CC BY-NC-SA | 请求全文 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论