Action recognition using spatial-optical data organization and sequential learning framework

doi:10.1016/j.neucom.2018.06.071

OPT OpenIR > 光谱成像技术研究室

	Action recognition using spatial-optical data organization and sequential learning framework
	Yuan, Yuan 1; Zhao, Yang 1,2; Wang, Qi 3,4
作者部门	光学影像学习与分析中心
	2018-11-13
发表期刊	NEUROCOMPUTING
ISSN	0925-2312;1872-8286
卷号	315 页码:221-233
产权排序	1
摘要	Recognizing human actions in videos is a challenging problem owning to complex motion appearance, various backgrounds and semantic gap between low-level features and high-level semantics. Existing methods have scored some achievements and many new thoughts have been proposed for action recognition. They focus on designing a robust feature description and training an elaborate learning model, and many of them can benefit from a two-stream network with a stack of RGB frames and optical flow frames. However, these features for human action representation are struggling with the limited feature representation as RGB videos are confused by static appearance redundancy and optical flow videos cannot represent the detailed appearance. To solve these problems, we propose an efficient algorithm based on the spatial-optical data organization and the sequential learning framework. There are two contributions of our method: a novel data organization based on hierarchical weighting segmentation and optical flow for video representation, and a lightweight deep learning model based on the Convolutional 3D (C3D) network and the Recurrent Neural Network (RNN) for complicated action recognition. The new data organization aggregates the merits of motion appearance, movement trajectories and optical flow in a creative way to highlight the meaningful information. And the proposed lightweight model has an insight into patterns and semantics of sequential data by low-level spatiotemporal feature extraction and high-level information mining. The proposed method is evaluated on the state-of-the-art dataset and the results demonstrate that our method have a good performance for complex human action recognition. (c) 2018 Elsevier B.V. All rights reserved.
关键词	Action Recognition Spatiotemporal Feature Deep Learning Sequential Learning Framework
DOI	10.1016/j.neucom.2018.06.071
收录类别	SCI ; EI
语种	英语
WOS记录号	WOS:000445934400022
出版者	ELSEVIER SCIENCE BV
EI入藏号	20183005605340
引用统计	被引频次：29[WOS] [WOS记录] [WOS相关记录]
文献类型	期刊论文
条目标识符	http://ir.opt.ac.cn/handle/181661/30659
专题	光谱成像技术研究室
通讯作者	Wang, Qi
作者单位	1.Chinese Acad Sci, Ctr Opt IMagery Anal & Learning OPTIMAL, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China 2.Univ Chinese Acad Sci, Beijing 049, Peoples R China 3.Northwestern Polytech Univ, Sch Comp Sci, Ctr Opt IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China 4.Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Shaanxi, Peoples R China
推荐引用方式 GB/T 7714	Yuan, Yuan,Zhao, Yang,Wang, Qi. Action recognition using spatial-optical data organization and sequential learning framework[J]. NEUROCOMPUTING,2018,315:221-233.
APA	Yuan, Yuan,Zhao, Yang,&Wang, Qi.(2018).Action recognition using spatial-optical data organization and sequential learning framework.NEUROCOMPUTING,315,221-233.
MLA	Yuan, Yuan,et al."Action recognition using spatial-optical data organization and sequential learning framework".NEUROCOMPUTING 315(2018):221-233.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
Action recognition u（2476KB）	期刊论文	出版稿	限制开放	CC BY-NC-SA	请求全文