Simulating Human Visual System Based on Vision Transformer | |
Qiu, Mengyu1; Guo, Yi2; Zhang, Mingguang1; Zhang, Jingwei1; Lan, Tian1; Liu, Zhilin1 | |
2023 | |
会议名称 | 11th ACM Symposium on Spatial User Interaction (SUI) |
会议录名称 | ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023 |
会议日期 | 2023-10-13 |
会议地点 | Sydney, AUSTRALIA |
出版者 | ASSOC COMPUTING MACHINERY |
产权排序 | 2 |
摘要 | The human visual system (HVS) is capable of responding in real-time to complex visual environments. During the process of freely observing visual scenes, predicting eye movements and visual fixations is a task known as scanpath prediction, which aims to simulate the HVS. In this paper, we propose a visual transformer-based model to study the attentional processes of the human visual system in analyzing visual scenes, thereby achieving scanpath prediction. This technology has important applications in human-computer interaction, virtual reality, augmented reality, and other fields. We have significantly simplified the workflow of scanpath prediction and the overall model architecture, achieving performance superior to existing methods. |
关键词 | Visual scanpath prediction fixation duration prediction saccade Sequences visual attention scene analysis |
作者部门 | 光谱成像技术研究室 |
DOI | 10.1145/3607822.3616408 |
收录类别 | CPCI |
ISBN号 | 979-8-4007-0281-5 |
语种 | 英语 |
WOS记录号 | WOS:001138802600058 |
引用统计 | |
文献类型 | 会议论文 |
条目标识符 | http://ir.opt.ac.cn/handle/181661/97182 |
专题 | 光谱成像技术研究室 |
通讯作者 | Zhang, Mingguang |
作者单位 | 1.Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China 2.Chinese Acad Sci Xian, Xian Inst Opt & Precis Mech, Xian, Peoples R China |
推荐引用方式 GB/T 7714 | Qiu, Mengyu,Guo, Yi,Zhang, Mingguang,et al. Simulating Human Visual System Based on Vision Transformer[C]:ASSOC COMPUTING MACHINERY,2023. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
Simulating Human Vis(1919KB) | 会议论文 | 限制开放 | CC BY-NC-SA | 请求全文 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论