OPT OpenIR  > 光谱成像技术研究室
Action recognition by spatio-temporal oriented energies
Zhen, Xiantong1,2; Shao, Ling1,2; Li, Xuelong3
作者部门光学影像学习与分析中心
2014-10-10
发表期刊INFORMATION SCIENCES
ISSN0020-0255
卷号281页码:295-309
摘要In this paper, we present a unified representation based on the spatio-temporal steerable pyramid (STSP) for the holistic representation of human actions. A video sequence is viewed as a spatio-temporal volume preserving all the appearance and motion information of an action in it. By decomposing the spatio-temporal volumes into band-passed sub-volumes, the spatio-temporal Laplacian pyramid provides an effective technique for multi-scale analysis of video sequences, and spatio-temporal patterns with different scales could be well localized and captured. To efficiently explore the underlying local spatio-temporal orientation structures at multiple scales, a bank of three-dimensional separable steerable filters are conducted on each of the sub-volume from the Laplacian pyramid. The outputs of the quad-rature pair of steerable filters are squared and summed to yield a more robust oriented energy representation. To be further invariant and compact, a spatio-temporal max pooling operation is performed between responses of the filtering at adjacent scales and over spatio-temporal neighbourhoods. In order to capture the appearance, local geometric structure and motion of an action, we apply the STSP on the intensity, 3D gradients and optical flow of video sequences, yielding a unified holistic representation of human actions.
文章类型Article
关键词Action Recognition Steerable Filters Spatio-temporal Oriented Energies Spatio-temporal Laplacian Pyramid
WOS标题词Science & Technology ; Technology
DOI10.1016/j.ins.2014.05.021
收录类别SCI ; EI
关键词[WOS]SCENE CLASSIFICATION ; VISUAL-ATTENTION ; REPRESENTATION ; MOTION ; MODELS
语种英语
WOS研究方向Computer Science
WOS类目Computer Science, Information Systems
WOS记录号WOS:000340315600019
引用统计
被引频次:44[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.opt.ac.cn/handle/181661/22386
专题光谱成像技术研究室
作者单位1.Nanjing Univ Informat Sci & Technol, Coll Elect & Informat Engn, Nanjing 210044, Jiangsu, Peoples R China
2.Univ Sheffield, Dept Elect & Elect Engn, Sheffield S1 3JD, S Yorkshire, England
3.Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Xian 710119, Shaanxi, Peoples R China
推荐引用方式
GB/T 7714
Zhen, Xiantong,Shao, Ling,Li, Xuelong. Action recognition by spatio-temporal oriented energies[J]. INFORMATION SCIENCES,2014,281:295-309.
APA Zhen, Xiantong,Shao, Ling,&Li, Xuelong.(2014).Action recognition by spatio-temporal oriented energies.INFORMATION SCIENCES,281,295-309.
MLA Zhen, Xiantong,et al."Action recognition by spatio-temporal oriented energies".INFORMATION SCIENCES 281(2014):295-309.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Action recognition b(2053KB)期刊论文出版稿限制开放CC BY请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zhen, Xiantong]的文章
[Shao, Ling]的文章
[Li, Xuelong]的文章
百度学术
百度学术中相似的文章
[Zhen, Xiantong]的文章
[Shao, Ling]的文章
[Li, Xuelong]的文章
必应学术
必应学术中相似的文章
[Zhen, Xiantong]的文章
[Shao, Ling]的文章
[Li, Xuelong]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。