[简体中文]

Scene Text Detection and Recognition

time: 2017-12-01 17:55:53, category:

Mask TextSpotter:

Pengyuan Lyu, Minghui Liao, Cong Yao, Wenhao Wu, Xiang Bai. Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes. ECCV, 2018.

CornerText:

Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan, Xiang Bai. Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation.  IEEE CVPR, 2018. [pdf]

RRD:

Minghui Liao, Zhen Zhu, Baoguang Shi, Gui-song Xia, Xiang Bai. Rotation-Sensitive Regression for Oriented Scene Text Detection. IEEE CVPR, 2018. [pdf]

SegLink:

Baoguang Shi, Xiang Bai, Serge Belongie. Detecting Oriented Text in Natural Images by Linking Segments.  IEEE CVPR, 2017. [pdf][code]

TextBoxes:

Minghui Liao, Baoguang Shi, Xiang Bai, et al. TextBoxes: A Fast Text Detector with a Single Deep Neural Network. The 31st AAAI Conference on Artificial Intelligence (AAAI), 2017. [pdf][code]

FCN_Text:

Zheng Zhang, Chengquan Zhang, Xiang Bai, et al. Multi-Oriented Text Detection with Fully Convolutional Networks. IEEE CVPR, 2016. [pdf][code]

Symmetry-Based Text Line Detection:

Symmetry-Based Text Line Detection in Natural Scenes. Zheng Zhang, Wei Shen, Cong Yao, Xiang Bai. IEEE CVPR, 2015. [pdf][code]

A Unified Framework for Multi-Oriented Text Detection and Recognition:

Cong Yao, Xiang Bai, Wenyu Liu. A Unified Framework for Multi-Oriented Text Detection and Recognition. IEEE Transactions on Image Processing (TIP), 2014. [pdf][HUST-TR400 Dataset]

ASTER:

Baoguang Shi, Mingkun Yang, Xinggang Wang, Pengyuan Lyu, Cong Yao, Xiang Bai. ASTER: An Attentional Scene Text Recognizer with Flexible Rectification IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, accepted. [pdf]

CRNN:

Baoguang Shi, Xiang Bai, Cong Yao. An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition.  IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, accepted.  [pdf][code]

Strokelets:

Xiang Bai, Cong Yao, Wenyu Liu. Strokelets: A Learned Multi-Scale Mid-Level Representation for Scene Text Recognition. IEEE Transations on Image Proc., 2016. [pdf][code]

Join the Discussion