Towards convolutional neural networks compression via global error reconstruction | |
Lin, Shaohui1,2; Ji, Rongrong1,2; Guo, Xiaowei3; Li, Xuelong4![]() | |
2016 | |
会议名称 | 25th International Joint Conference on Artificial Intelligence, IJCAI 2016 |
会议录名称 | Proceedings of the 25th International Joint Conference on Artificial Intelligence, IJCAI 2016 |
卷号 | 2016-January |
页码 | 1753-1759 |
会议日期 | 2016-07-09 |
会议地点 | New York, NY, United states |
出版者 | International Joint Conferences on Artificial Intelligence |
产权排序 | 4 |
摘要 | In recent years, convolutional neural networks (CNNs) have achieved remarkable success in various applications such as image classification, object detection, object parsing and face alignment. Such CNN models are extremely powerful to deal with massive amounts of training data by using millions and billions of parameters. However, these models are typically deficient due to the heavy cost in model storage, which prohibits their usage on resource-limited applications like mobile or embedded devices. In this paper, we target at compressing CNN models to an extreme without significantly losing their discriminability. Our main idea is to explicitly model the output reconstruction error between the original and compressed CNNs, which error is minimized to pursuit a satisfactory rate-distortion after compression. In particular, a global error reconstruction method termed GER is presented, which firstly leverages an SVD-based low-rank approximation to coarsely compress the parameters in the fully connected layers in a layerwise manner. Subsequently, such layer-wise initial compressions are jointly optimized in a global perspective via back-propagation. The proposed GER method is evaluated on the ILSVRC2012 image classification benchmark, with implementations on two widely-adopted convolutional neural networks, i.e., the AlexNet and VGGNet-19. Comparing to several state-of-the-art and alternative methods of CNN compression, the proposed scheme has demonstrated the best rate-distortion performance on both networks. |
关键词 | Approximation Theory Artificial Intelligence Backpropagation Compaction Convolution Data Compression Digital Storage Electric Distortion Errors Image Coding Neural Networks Signal Distortion |
学科领域 | Electricity: Basic Concepts And Phenomena |
作者部门 | 光学影像学习与分析中心 |
收录类别 | EI |
语种 | 英语 |
ISSN号 | 10450823 |
文献类型 | 会议论文 |
条目标识符 | http://ir.opt.ac.cn/handle/181661/28569 |
专题 | 光谱成像技术研究室 |
通讯作者 | Ji, Rongrong (rrji@xmu.edu.cn) |
作者单位 | 1.Fujian Key Laboratory of Sensing and Computing for Smart City, Xiamen University, 361005, China 2.School of Information Science and Engineering, Xiamen University, 361005, China 3.BestImage, Tencent Technology (Shanghai) Co., Ltd, China 4.Xi'An Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an, China |
推荐引用方式 GB/T 7714 | Lin, Shaohui,Ji, Rongrong,Guo, Xiaowei,et al. Towards convolutional neural networks compression via global error reconstruction[C]:International Joint Conferences on Artificial Intelligence,2016:1753-1759. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
Towards convolutiona(928KB) | 会议论文 | 限制开放 | CC BY-NC-SA | 请求全文 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论