Cross-model retrieval with deep learning for business application | |
Wang, Yufei1; Wang, Huanting2,3; Yang, Jiating2; Chen, Jianbo3 | |
2021-03-09 | |
会议名称 | 2020 7th International Conference on Computer-Aided Design, Manufacturing, Modeling and Simulation, CDMMS 2020 |
会议录名称 | 7th International Conference on Computer-Aided Design, Manufacturing, Modeling and Simulation, CDMMS 2020 - 2. Algorithm Design and Computational Science |
卷号 | 1802 |
期号 | 3 |
会议日期 | 2020-11-14 |
会议地点 | Busan, Korea, Republic of |
出版者 | IOP Publishing Ltd |
产权排序 | 2 |
摘要 | Cross-modal retravel has been used in many fields, such as business and search engines. Most search engines for business are text-based, but text-based search engines are limited by equipment and the strict requirement for knowledge. Text-based search needs keyboards to finish the search process, which requires users to have the knowledge of using keyboards. Compared to the text-based search, audio-based search has advantages. First, it avoids the traditional ways of inputting information. And it gets rid of the gap in time between inputting information for searching and getting useful information. In this paper, we propose a way to use audio to search images for business applications. We use deep learning to implement cross-modal retrieval systems between images and audio. We first extract features from images and audio respectively. And then we implement a neural network with two identical networks to learn the correspondence between images and audio. The first network extracts the features from images and audio further for calculation, and the second network learns whether two features from different modalities are related. This research provides a new way for business applications to search for information more instantly. © Published under licence by IOP Publishing Ltd. |
关键词 | Cross-modal retrieval Audio features Deep hashing Useful information |
作者部门 | 光谱成像技术研究室 |
DOI | 10.1088/1742-6596/1802/3/032035 |
收录类别 | EI |
语种 | 英语 |
ISSN号 | 17551307;17551315 |
EI入藏号 | 20211210123555 |
引用统计 | |
文献类型 | 会议论文 |
条目标识符 | http://ir.opt.ac.cn/handle/181661/94577 |
专题 | 光谱成像技术研究室 |
通讯作者 | Yang, Jiating |
作者单位 | 1.Simon Fraser University, 8888 University Dr, Bumaby; BC; V5A 1S6, Canada 2.Xian Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xian, China 3.University of Chinese Academy of Sciences, Beijing; 100049, China |
推荐引用方式 GB/T 7714 | Wang, Yufei,Wang, Huanting,Yang, Jiating,et al. Cross-model retrieval with deep learning for business application[C]:IOP Publishing Ltd,2021. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
Wang_2021_J._Phys.__(1219KB) | 会议论文 | 限制开放 | CC BY-NC-SA | 请求全文 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论