PixelLink: Detecting scene text via instance segmentation | |
Deng, Dan1,3,5; Liu, Haifeng1; Li, Xuelong4![]() | |
2018 | |
会议名称 | 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 |
会议录名称 | 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 |
页码 | 6773-6780 |
会议日期 | 2018-02-02 |
会议地点 | New Orleans, LA, United states |
出版者 | AAAI press |
产权排序 | 4 |
摘要 | Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/non-text classification and location regression. Regression plays a key role in the acquisition of bounding boxes in these methods, but it is not indispensable because text/non-text prediction can also be considered as a kind of semantic segmentation that contains full location information in itself. However, text instances in scene images often lie very close to each other, making them very difficult to separate via semantic segmentation. Therefore, instance segmentation is needed to address this problem. In this paper, PixelLink, a novel scene text detection algorithm based on instance segmentation, is proposed. Text instances are first segmented out by linking pixels within the same instance together. Text bounding boxes are then extracted directly from the segmentation result without location regression. Experiments show that, compared with regression-based methods, PixelLink can achieve better or comparable performance on several benchmarks, while requiring many fewer training iterations and less training data. Copyright © 2018, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. |
作者部门 | 光谱成像技术研究室 |
收录类别 | EI ; CPCI |
ISBN号 | 9781577358008 |
语种 | 英语 |
WOS记录号 | WOS:000485488906105 |
EI入藏号 | 20190506436164 |
引用统计 | |
文献类型 | 会议论文 |
条目标识符 | http://ir.opt.ac.cn/handle/181661/31241 |
专题 | 光谱成像技术研究室 |
作者单位 | 1.State Key Lab of CAD and CG, College of Computer Science, Zhejiang University, China; 2.Alibaba-Zhejiang University Joint Institute of Frontier Technologies, China; 3.CVTE Research, China; 4.Xi'an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, China; 5.Visual Computing Group, CVTE Research, China |
推荐引用方式 GB/T 7714 | Deng, Dan,Liu, Haifeng,Li, Xuelong,et al. PixelLink: Detecting scene text via instance segmentation[C]:AAAI press,2018:6773-6780. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
PixelLink: Detecting(798KB) | 会议论文 | 限制开放 | CC BY-NC-SA | 请求全文 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论