├── README.md └── datasets ├── .DS_Store ├── CASIA-10K.md ├── _vnote.json └── svt-p.zip /README.md: -------------------------------------------------------------------------------- 1 | # SceneTextPapers 2 | Tracking the latest progress in Scene Text Detection and Recognition: must-read papers well organized 3 | 4 | ## Information about this repository 5 | This repo serves as a complement to our IJCV paper: 6 | 7 | - __Scene Text Detection and Recognition: The Deep Learning Era__. _Shangbang Long, Xin He, Cong Yao_. [\[arxiv preprint\]](https://arxiv.org/abs/1811.04256) [\[IJCV version\]](https://link.springer.com/article/10.1007/s11263-020-01369-0) 8 | 9 | 10 | ## Citing this work 11 | 12 | If you find this paper helpful in understanding the latest history of scene text detection&recognition algorithms as well as designing new ones , you are highly encouraged 13 | (though not required) to cite our paper 14 | 15 | ``` 16 | @article{long2020scene, 17 | title={Scene text detection and recognition: The deep learning era}, 18 | author={Long, Shangbang and He, Xin and Yao, Cong}, 19 | journal={International Journal of Computer Vision}, 20 | pages={1--24}, 21 | year={2020}, 22 | publisher={Springer} 23 | } 24 | ``` 25 | 26 | 27 | ## Papers 28 | 29 | ### I. Other Survey Papers: 30 | 1. __Scene text detection and recognition: Recent advances and future trends.__ _Zhu, Yingying and Yao, Cong and Bai, Xiang_. _Frontiers of Computer Science_, 2016[\[paper\]](https://link.springer.com/article/10.1007/s11704-015-4488-0) 31 | 2. __Text detection, tracking and recognition in video: A comprehensive survey.__ _Yin, Xu-Cheng and Zuo, Ze-Yu and Tian, Shu and Liu, Cheng-Lin_. __TIP__, 2016 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/7452620/) 32 | 3. __Text detection and recognition in imagery: A survey.__ _Ye, Qixiang and Doermann, David_. __TPAMI__, 2015 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/6945320/) 33 | 4. __Text localization and recognition in images and video.__ _Uchida, Seiichi_. 2014 [\[paper\]](https://link.springer.com/referenceworkentry/10.1007%2F978-0-85729-859-1_28) 34 | 35 | ### II. Main: Scene Text Detection and Recognition 36 | 37 | #### 2.1 Detection 38 | 39 | ##### 2.1.1 Pipeline Simplification 40 | 41 | ###### Anchor-based methods 42 | 1. __Single Shot Text Detector With Regional Attention.__ _He, Pan and Huang, Weilin and He, Tong and Zhu, Qile and Qiao, Yu and Li, Xiaolin_. __ICCV__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/He_Single_Shot_Text_ICCV_2017_paper.pdf) [\[code\]](https://github.com/BestSonny/SSTD) 43 | 2. __TextBoxes: A Fast Text Detector with a Single Deep Neural Network.__ _Liao, Minghui and Shi, Baoguang and Bai, Xiang and Wang, Xinggang and Liu, Wenyu_. __AAAI__, 2017 [\[paper\]](https://www.aaai.org/ocs/index.php/AAAI/AAAI17/paper/download/14202/14295) [\[code\]](https://github.com/MhLiao/TextBoxes) 44 | 3. __Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection.__ _Liu, Yuliang and Jin, Lianwen_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Liu_Deep_Matching_Prior_CVPR_2017_paper.pdf) 45 | 4. __Detecting Oriented Text in Natural Images by Linking Segments.__ _Shi, Baoguang and Bai, Xiang and Belongie, Serge_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Shi_Detecting_Oriented_Text_CVPR_2017_paper.pdf) [\[code\]](https://github.com/bgshih/seglink) 46 | 5. __EAST: An Efficient and Accurate Scene Text Detector.__ _Zhou, Xinyu and Yao, Cong and Wen, He and Wang, Yuzhi and Zhou, Shuchang and He, Weiran and Liang, Jiajun_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Zhou_EAST_An_Efficient_CVPR_2017_paper.pdf) [\[code\]](https://github.com/zxytim/EAST) 47 | 48 | ###### Region proposal methods 49 | 1. __Detecting Curve Text in the Wild: New Dataset and New Solution.__ _Yuliang, Liu and Lianwen, Jin and Shuaitao, Zhang and Sheng, Zhang_. 2017 [\[paper\]](https://arxiv.org/abs/1712.02170) [\[code\]](https://github.com/Yuliang-Liu/Curve-Text-Detector) 50 | 2. __R2CNN: rotational region CNN for orientation robust scene text detection.__ _Jiang, Yingying and Zhu, Xiangyu and Wang, Xiaobing and Yang, Shuli and Li, Wei and Wang, Hua and Fu, Pei and Luo, Zhenbo_. 2017 [\[paper\]](https://arxiv.org/abs/1706.09579) 51 | 3. __Arbitrary-Oriented Scene Text Detection via Rotation Proposals.__ _Ma, Jianqi and Shao, Weiyuan and Ye, Hao and Wang, Li and Wang, Hong and Zheng, Yingbin and Xue, Xiangyang_. __T MULTIMEDIA__, 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/8323240/) [\[code\]](https://github.com/mjq11302010044/RRPN) 52 | 4. __weakly supervised text attention network for generating text proposals in scene images.__ _Rong, Li and MengYi, En and JianQiang, Li and HaiBin, Zhang_. __ICDAR__, 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/8269992/) 53 | 5. __Rotation-Sensitive Regression for Oriented Scene Text Detection.__ _Liao, Minghui and Zhu, Zhen and Shi, Baoguang and Xia, Gui-song and Bai, Xiang_. __CVPR__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2018/papers/Liao_Rotation-Sensitive_Regression_for_CVPR_2018_paper.pdf) [\[code\]](https://github.com/MhLiao/RRD) 54 | 6. __Feature Enhancement Network: A Refined Scene Text Detector.__ _Sheng, Zhang and Yuliang, Liu and Lianwen, Jin and Canjie, Luo_. __AAAI__, 2017 [\[paper\]](https://arxiv.org/abs/1711.04249) 55 | 56 | ##### 2.1.2 Differnt Prediction Units 57 | 58 | ###### Text instance level 59 | 1. __Detecting Curve Text in the Wild: New Dataset and New Solution.__ _Yuliang, Liu and Lianwen, Jin and Shuaitao, Zhang and Sheng, Zhang_. 2017 [\[paper\]](https://arxiv.org/abs/1712.02170) [\[code\]](https://github.com/Yuliang-Liu/Curve-Text-Detector) 60 | 2. __TextBoxes: A Fast Text Detector with a Single Deep Neural Network.__ _Liao, Minghui and Shi, Baoguang and Bai, Xiang and Wang, Xinggang and Liu, Wenyu_. __AAAI__, 2017 [\[paper\]](https://www.aaai.org/ocs/index.php/AAAI/AAAI17/paper/download/14202/14295) [\[code\]](https://github.com/MhLiao/TextBoxes) 61 | 3. __EAST: An Efficient and Accurate Scene Text Detector.__ _Zhou, Xinyu and Yao, Cong and Wen, He and Wang, Yuzhi and Zhou, Shuchang and He, Weiran and Liang, Jiajun_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Zhou_EAST_An_Efficient_CVPR_2017_paper.pdf) [\[code\]](https://github.com/zxytim/EAST) 62 | 4. __R2CNN: rotational region CNN for orientation robust scene text detection.__ _Jiang, Yingying and Zhu, Xiangyu and Wang, Xiaobing and Yang, Shuli and Li, Wei and Wang, Hua and Fu, Pei and Luo, Zhenbo_. 2017 [\[paper\]](https://arxiv.org/abs/1706.09579) 63 | 5. __Arbitrary-Oriented Scene Text Detection via Rotation Proposals.__ _Ma, Jianqi and Shao, Weiyuan and Ye, Hao and Wang, Li and Wang, Hong and Zheng, Yingbin and Xue, Xiangyang_. __T MULTIMEDIA__, 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/8323240/) [\[code\]](https://github.com/mjq11302010044/RRPN) 64 | 6. __Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection.__ _Liu, Yuliang and Jin, Lianwen_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Liu_Deep_Matching_Prior_CVPR_2017_paper.pdf) 65 | 7. __Deep Direct Regression for Multi-Oriented Scene Text Detection.__ _He, Wenhao and Zhang, Xu-Yao and Yin, Fei and Liu, Cheng-Lin_. __ICCV__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/He_Deep_Direct_Regression_ICCV_2017_paper.pdf) 66 | 8. __Fused Text Segmentation Networks for Multi-oriented Scene Text Detection.__ _Dai, Yuchen and Huang, Zheng and Gao, Yuting and Chen, Kai_. 2017 [\[paper\]](https://arxiv.org/abs/1709.03272) 67 | 9. __Feature Enhancement Network: A Refined Scene Text Detector.__ _Sheng, Zhang and Yuliang, Liu and Lianwen, Jin and Canjie, Luo_. __AAAI__, 2017 [\[paper\]](https://arxiv.org/abs/1711.04249) 68 | 10. __Rotation-Sensitive Regression for Oriented Scene Text Detection.__ _Liao, Minghui and Zhu, Zhen and Shi, Baoguang and Xia, Gui-song and Bai, Xiang_. __CVPR__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2018/papers/Liao_Rotation-Sensitive_Regression_for_CVPR_2018_paper.pdf) [\[code\]](https://github.com/MhLiao/RRD) 69 | 70 | ###### Bottom-up (Pixel) 71 | 1. __Scene text detection via holistic, multi-channel prediction.__ _Yao, Cong and Bai, Xiang and Sang, Nong and Zhou, Xinyu and Zhou, Shuchang and Cao, Zhimin_. 2016 [\[paper\]](https://arxiv.org/abs/1606.09002) 72 | 2. __Multi-oriented text detection with fully convolutional networks.__ _Zhang, Zheng and Zhang, Chengquan and Shen, Wei and Yao, Cong and Liu, Wenyu and Bai, Xiang_. __CVPR__, 2016 [\[paper\]](https://www.cv-foundation.org/openaccess/content_cvpr_2016/html/Zhang_Multi-Oriented_Text_Detection_CVPR_2016_paper.html) [\[code\]](https://github.com/stupidZZ/FCN_Text) 73 | 3. __Self-organized Text Detection with Minimal Post-processing via Border Learning.__ _Wu, Yue and Natarajan, Prem_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/Wu_Self-Organized_Text_Detection_ICCV_2017_paper.pdf) 74 | 4. __Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild.__ _He, Dafang and Yang, Xiao and Liang, Chen and Zhou, Zihan and Ororbia, Alexander G and Kifer, Daniel and Giles, C Lee_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/He_Multi-Scale_FCN_With_CVPR_2017_paper.pdf) 75 | 5. __Single Shot Text Detector With Regional Attention.__ _He, Pan and Huang, Weilin and He, Tong and Zhu, Qile and Qiao, Yu and Li, Xiaolin_. __ICCV__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/He_Single_Shot_Text_ICCV_2017_paper.pdf) [\[code\]](https://github.com/BestSonny/SSTD) 76 | 6. __PixelLink: Detecting Scene Text via Instance Segmentation.__ _Dan, Deng and Haifeng, Liu and Xuelong, Li and Deng, Cai_. __AAAI__, 2018 [\[paper\]](https://arxiv.org/abs/1801.01315) [\[code\]](https://github.com/ZJULearning/pixel_link) 77 | 78 | ###### Bottom-up (Components) 79 | 1. __Detecting text in natural image with connectionist text proposal network.__ _Tian, Zhi and Huang, Weilin and He, Tong and He, Pan and Qiao, Yu_. __ECCV__, 2016 [\[paper\]](https://link.springer.com/chapter/10.1007/978-3-319-46484-8_4) [\[code\]](https://github.com/tianzhi0549/CTPN) 80 | 2. __Aggregating local context for accurate scene text detection.__ _He, Dafang and Yang, Xiao and Huang, Wenyi and Zhou, Zihan and Kifer, Daniel and Giles, C Lee_. __ACCV__, 2016 [\[paper\]](https://link.springer.com/chapter/10.1007/978-3-319-54193-8_18) 81 | 3. __Detecting Oriented Text in Natural Images by Linking Segments.__ _Shi, Baoguang and Bai, Xiang and Belongie, Serge_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Shi_Detecting_Oriented_Text_CVPR_2017_paper.pdf) [\[code\]](https://github.com/bgshih/seglink) 82 | 4. __Scene Text Detection with Novel Superpixel Based Character Candidate Extraction.__ _Wang, Cong and Yin, Fei and Liu, Cheng-Lin_. 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/8270087/) 83 | 5. __Deep Residual Text Detection Network for Scene Text.__ _Zhu, Xiangyu and Jiang, Yingying and Yang, Shuli and Wang, Xiaobing and Li, Wei and Fu, Pei and Wang, Hua and Luo, Zhenbo_. __ICDAR__, 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/8270068/) 84 | 6. __Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation.__ _Lyu, Pengyuan and Yao, Cong and Wu, Wenhao and Yan, Shuicheng and Bai, Xiang_. __CVPR__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2018/CameraReady/1688.pdf) 85 | 86 | ##### 2.1.3 Specific Targets 87 | 88 | ###### Long text 89 | 1. __Detecting Oriented Text in Natural Images by Linking Segments.__ _Shi, Baoguang and Bai, Xiang and Belongie, Serge_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Shi_Detecting_Oriented_Text_CVPR_2017_paper.pdf) [\[code\]](https://github.com/bgshih/seglink) 90 | 2. __R2CNN: rotational region CNN for orientation robust scene text detection.__ _Jiang, Yingying and Zhu, Xiangyu and Wang, Xiaobing and Yang, Shuli and Li, Wei and Wang, Hua and Fu, Pei and Luo, Zhenbo_. 2017 [\[paper\]](https://arxiv.org/abs/1706.09579) 91 | 3. __Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation.__ _Lyu, Pengyuan and Yao, Cong and Wu, Wenhao and Yan, Shuicheng and Bai, Xiang_. __CVPR__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2018/CameraReady/1688.pdf) 92 | 93 | ###### Multi-oriented text 94 | 1. __R2CNN: rotational region CNN for orientation robust scene text detection.__ _Jiang, Yingying and Zhu, Xiangyu and Wang, Xiaobing and Yang, Shuli and Li, Wei and Wang, Hua and Fu, Pei and Luo, Zhenbo_. 2017 [\[paper\]](https://arxiv.org/abs/1706.09579) 95 | 2. __TextBoxes: A Fast Text Detector with a Single Deep Neural Network.__ _Liao, Minghui and Shi, Baoguang and Bai, Xiang and Wang, Xinggang and Liu, Wenyu_. __AAAI__, 2017 [\[paper\]](https://www.aaai.org/ocs/index.php/AAAI/AAAI17/paper/download/14202/14295) [\[code\]](https://github.com/MhLiao/TextBoxes) 96 | 3. __Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection.__ _Liu, Yuliang and Jin, Lianwen_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Liu_Deep_Matching_Prior_CVPR_2017_paper.pdf) 97 | 4. __Arbitrary-Oriented Scene Text Detection via Rotation Proposals.__ _Ma, Jianqi and Shao, Weiyuan and Ye, Hao and Wang, Li and Wang, Hong and Zheng, Yingbin and Xue, Xiangyang_. __T MULTIMEDIA__, 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/8323240/) [\[code\]](https://github.com/mjq11302010044/RRPN) 98 | 5. __Detecting Oriented Text in Natural Images by Linking Segments.__ _Shi, Baoguang and Bai, Xiang and Belongie, Serge_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Shi_Detecting_Oriented_Text_CVPR_2017_paper.pdf) [\[code\]](https://github.com/bgshih/seglink) 99 | 6. __EAST: An Efficient and Accurate Scene Text Detector.__ _Zhou, Xinyu and Yao, Cong and Wen, He and Wang, Yuzhi and Zhou, Shuchang and He, Weiran and Liang, Jiajun_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Zhou_EAST_An_Efficient_CVPR_2017_paper.pdf) [\[code\]](https://github.com/zxytim/EAST) 100 | 7. __Rotation-Sensitive Regression for Oriented Scene Text Detection.__ _Liao, Minghui and Zhu, Zhen and Shi, Baoguang and Xia, Gui-song and Bai, Xiang_. __CVPR__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2018/papers/Liao_Rotation-Sensitive_Regression_for_CVPR_2018_paper.pdf) [\[code\]](https://github.com/MhLiao/RRD) 101 | 8. __Geometry-Aware Scene Text Detection With Instance Transformation Network.__ _Wang, Fangfang and Zhao, Liming and Li, Xi and Wang, Xinchao and Tao, Dacheng_. __CVPR__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2018/papers/Wang_Geometry-Aware_Scene_Text_CVPR_2018_paper.pdf) [\[code\]](https://github.com/zlmzju/itn) 102 | 103 | ###### Irregular text 104 | 1. __Detecting Curve Text in the Wild: New Dataset and New Solution.__ _Yuliang, Liu and Lianwen, Jin and Shuaitao, Zhang and Sheng, Zhang_. 2017 [\[paper\]](https://arxiv.org/abs/1712.02170) [\[code\]](https://github.com/Yuliang-Liu/Curve-Text-Detector) 105 | 2. __Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes.__ _Lyu, Pengyuan and Liao, Minghui and Yao, Cong and Wu, Wenhao and Bai, Xiang_. __ECCV__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_ECCV_2018/papers/Pengyuan_Lyu_Mask_TextSpotter_An_ECCV_2018_paper.pdf) 106 | 3. __TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes.__ _Long, Shangbang and Ruan, Jiaqiang and Zhang, Wenjie and He, Xin and Wu, Wenhao and Yao, Cong_. __ECCV__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_ECCV_2018/papers/Shangbang_Long_TextSnake_A_Flexible_ECCV_2018_paper.pdf) 107 | 4. __Scene Text Detection with Supervised Pyramid Context Network.__ _Enze Xie, Yuhang Zang, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li_. __AAAI__, 2019 [\[paper\]](https://arxiv.org/pdf/1811.08605.pdf) 108 | 5. __Learning Shape-Aware Embedding for Scene Text Detection.__ _Zhuotao Tian, Michelle Shu, Pengyuan Lyu, Ruiyu Li, Chao Zhou, Xiaoyong Shen, Jiaya Jia_. __CVPR__, 2019 [\[paper\]](http://jiaya.me/papers/textdetection_cvpr19.pdf) 109 | 6. __Arbitrary Shape Scene Text Detection With Adaptive Text Region Representation.__ _Xiaobing Wang, Yingying Jiang, Zhenbo Luo, Cheng-Lin Liu, Hyunsoo Choi, Sungjin Kim_. __CVPR__, 2019 [\[paper\]](https://arxiv.org/abs/1905.05980) 110 | 7. __Towards Robust Curve Text Detection With Conditional Spatial Expansion.__ _Zichuan Liu, Guosheng Lin, Sheng Yang, Fayao Liu, Weisi Lin, Wang Ling Goh_. __CVPR__, 2019 [\[paper\]](https://arxiv.org/abs/1903.08836) 111 | 8. __Shape Robust Text Detection With Progressive Scale Expansion Network.__ _Xiang Li, Wenhai Wang, Wenbo Hou, Ruo-Ze Liu, Tong Lu, Jian Yang_. __CVPR__, 2019 [\[paper\]](https://arxiv.org/abs/1806.02559) 112 | 9. __Character Region Awareness for Text Detection.__ _Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, Hwalsuk Lee_. __CVPR__, 2019 [\[paper\]](https://arxiv.org/abs/1904.01941) 113 | 10. __Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes.__ _Chengquan Zhang, Borong Liang, Zuming Huang, Mengyi En, Junyu Han, Errui Ding, Xinghao Ding_. __CVPR__, 2019 [\[paper\]](https://arxiv.org/abs/1904.06535) 114 | 11. __Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network.__ _Wang, Wenhai and Xie, Enze and Song, Xiaoge and Zang, Yuhang and Wang, Wenjia and Lu, Tong and Yu, Gang and Shen, Chunhua_. __ICCV__, 2019 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2019/papers/Wang_Efficient_and_Accurate_Arbitrary-Shaped_Text_Detection_With_Pixel_Aggregation_Network_ICCV_2019_paper.pdf) 115 | 116 | 117 | 118 | 119 | ###### Speed up 120 | 1. __EAST: An Efficient and Accurate Scene Text Detector.__ _Zhou, Xinyu and Yao, Cong and Wen, He and Wang, Yuzhi and Zhou, Shuchang and He, Weiran and Liang, Jiajun_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Zhou_EAST_An_Efficient_CVPR_2017_paper.pdf) [\[code\]](https://github.com/zxytim/EAST) 121 | 122 | ###### Easy instance segmentation 123 | 1. __Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild.__ _He, Dafang and Yang, Xiao and Liang, Chen and Zhou, Zihan and Ororbia, Alexander G and Kifer, Daniel and Giles, C Lee_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/He_Multi-Scale_FCN_With_CVPR_2017_paper.pdf) 124 | 2. __Self-organized Text Detection with Minimal Post-processing via Border Learning.__ _Wu, Yue and Natarajan, Prem_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/Wu_Self-Organized_Text_Detection_ICCV_2017_paper.pdf) 125 | 3. __WordFence: Text Detection in Natural Images with Border Awareness.__ _Polzounov, Andrei and Ablavatski, Artsiom and Escalera, Sergio and Lu, Shijian and Cai, Jianfei_. __ICIP__, 2017 [\[paper\]](https://arxiv.org/abs/1705.05483) 126 | 4. __PixelLink: Detecting Scene Text via Instance Segmentation.__ _Dan, Deng and Haifeng, Liu and Xuelong, Li and Deng, Cai_. __AAAI__, 2018 [\[paper\]](https://arxiv.org/abs/1801.01315) [\[code\]](https://github.com/ZJULearning/pixel_link) 127 | 128 | ###### Retrieving designated text 129 | 1. __Unambiguous text localization and retrieval for cluttered scenes.__ _Rong, Xuejian and Yi, Chucai and Tian, Yingli_. __CVPR__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2017/papers/Rong_Unambiguous_Text_Localization_CVPR_2017_paper.pdf) 130 | 131 | ###### Against complex background 132 | 1. __Single Shot Text Detector With Regional Attention.__ _He, Pan and Huang, Weilin and He, Tong and Zhu, Qile and Qiao, Yu and Li, Xiaolin_. __ICCV__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/He_Single_Shot_Text_ICCV_2017_paper.pdf) [\[code\]](https://github.com/BestSonny/SSTD) 133 | 134 | #### 2.2 Recognition 135 | 136 | ##### 2.2.1 CTC based methods 137 | 1. __Unconstrained on-line handwriting recognition with recurrent neural networks.__ _Graves, Alex and Liwicki, Marcus and Bunke, Horst and Schmidhuber, Jurgen and Fernandez, Santiago_. __NIPS__, 2008 [\[paper\]](http://papers.nips.cc/paper/3213-unconstrained-on-line-handwriting-recognition-with) 138 | 2. __Accurate scene text recognition based on recurrent neural network.__ _Su, Bolan and Lu, Shijian_. __ACCV__, 2014 [\[paper\]](https://link.springer.com/chapter/10.1007/978-3-319-16865-4_3) 139 | 3. __STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition.__ _Liu, Wei and Chen, Chaofeng and Wong, Kwan-Yee K and Su, Zhizhong and Han, Junyu_. __BMVC__, 140 | 2016 [\[paper\]](https://pdfs.semanticscholar.org/1cc5/1b093378caa887d25f8d70001ee52874d948.pdf) 141 | 4. __An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition.__ _Shi, Baoguang and Bai, Xiang and Yao, Cong_. __TPAMI__, 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/7801919/) [\[code\]](https://github.com/bgshih/crnn) 142 | 5. __Reading Scene Text with Attention Convolutional Sequence Modeling.__ _Gao, Yunze and Chen, Yingying and Wang, Jinqiao and Lu, Hanqing_. 2017 [\[paper\]](https://arxiv.org/abs/1709.04303), 143 | 6. __Scene Text Recognition with Sliding Convolutional Character Models.__ _Yin, Fei and Wu, Yi-Chao and Zhang, Xu-Yao and Liu, Cheng-Lin_. 2017 [\[paper\]](https://arxiv.org/abs/1709.01727) 144 | 145 | 146 | ##### 2.2.2 Attention based methods 147 | 1. __Robust scene text recognition with automatic rectification.__ _Shi, Baoguang and Wang, Xinggang and Lyu, Pengyuan and Yao, Cong and Bai, Xiang_. __CVPR__, 2016 [\[paper\]](https://www.cv-foundation.org/openaccess/content_cvpr_2016/html/Shi_Robust_Scene_Text_CVPR_2016_paper.html) 148 | 2. __Recursive recurrent nets with attention modeling for ocr in the wild.__ _Lee, Chen-Yu and Osindero, Simon_. __CVPR__, 2016 [\[paper\]](https://www.cv-foundation.org/openaccess/content_cvpr_2016/html/Lee_Recursive_Recurrent_Nets_CVPR_2016_paper.html) 149 | 3. __Visual attention models for scene text recognition.__ _Ghosh, Suman K and Valveny, Ernest and Bagdanov, Andrew D_. __ICDAR__, 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/8270089/) 150 | 4. __Focusing Attention: Towards Accurate Text Recognition in Natural Images.__ _Cheng, Zhanzhan and Bai, Fan and Xu, Yunlu and Zheng, Gang and Pu, Shiliang and Zhou, Shuigeng_. __ICCV__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/Cheng_Focusing_Attention_Towards_ICCV_2017_paper.pdf) 151 | 5. __Learning to Read Irregular Text with Attention Mechanisms.__ _Yang, Xiao and He, Dafang and Zhou, Zihan and Kifer, Daniel and Giles, C Lee_. __IJCAI__, 2017 [\[paper\]](http://www.ijcai.org/proceedings/2017/0458.pdf) 152 | 6. __Arbitrarily-Oriented Text Recognition.__ _Cheng, Zhanzhan and Liu, Xuyang and Bai, Fan and Niu, Yi and Pu, Shiliang and Zhou, Shuigeng_. __CVPR__, 2017 [\[paper\]](https://arxiv.org/abs/1711.04226) 153 | 7. __Edit Probability for Scene Text Recognition.__, _Bai, Fan and Cheng, Zhanzhan and Niu, Yi and Pu, Shiliang and Zhou, Shuigeng_. __CVPR__, 2018 [\[paper\]](https://arxiv.org/pdf/1805.03384.pdf) 154 | 8. __SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network.__ _Liu, Zichuan and Li, Yixing and Ren, Fengbo and Yu, Hao and Goh, Wangling_. __AAAI__, 2018 [\[paper\]](https://ren-fengbo.lab.asu.edu/sites/default/files/16354-77074-1-pb.pdf) 155 | 9. __Show, attend and read: a simple and strong baseline for recognising irregular text.__ _Hui Li, Peng Wang, Chunhua Shen, Guyu Zhang_. __AAAI__, 2019 [\[paper\]](https://arxiv.org/pdf/1811.00751.pdf) 156 | 10. __Scene Text Recognition from Two-Dimensional Perspective.__ _Minghui Liao, Jian Zhang, Zhaoyi Wan, Fengming Xie, Jiajun Liang, Pengyuan Lyu, Cong Yao, Xiang Bai_. __AAAI__, 2019 [\[paper\]](https://arxiv.org/pdf/1809.06508.pdf) 157 | 11. __ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification.__ _Fangneng Zhan, Shijian Lu_. __CVPR__, 2019 [\[paper\]](https://arxiv.org/abs/1812.05824) 158 | 12. __What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis.__ _Baek, Jeonghun and Kim, Geewook and Lee, Junyeop and Park, Sungrae and Han, Dongyoon and Yun, Sangdoo and Oh, Seong Joon and Lee, Hwalsuk_. __ICCV__, 2019 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2019/papers/Baek_What_Is_Wrong_With_Scene_Text_Recognition_Model_Comparisons_Dataset_ICCV_2019_paper.pdf) 159 | 13. __Symmetry-Constrained Rectification Network for Scene Text Recognition.__ _Yang, Mingkun and Guan, Yushuo and Liao, Minghui and He, Xin and Bian, Kaigui and Bai, Song and Yao, Cong and Bai, Xiang_. __ICCV__, 2019 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2019/papers/Yang_Symmetry-Constrained_Rectification_Network_for_Scene_Text_Recognition_ICCV_2019_paper.pdf) 160 | 161 | #### 2.3 End-to-End Text Spotting 162 | ##### 2.3.1 Separately Trained Two-Stage Methods 163 | 1. __Reading text in the wild with convolutional neural networks.__ _Jaderberg, Max and Simonyan, Karen and Vedaldi, Andrea and Zisserman, Andrew_. __IJCV__, 2016 [\[paper\]](https://link.springer.com/article/10.1007/s11263-015-0823-z) 164 | 2. __Synthetic data for text localisation in natural images.__ _Gupta, Ankush and Vedaldi, Andrea and Zisserman, Andrew_. __CVPR__, 2016 [\[paper\]](https://www.cv-foundation.org/openaccess/content_cvpr_2016/html/Gupta_Synthetic_Data_for_CVPR_2016_paper.html) [\[code\]](https://github.com/ankush-me/SynthText) 165 | 3. __TextBoxes: A Fast Text Detector with a Single Deep Neural Network.__ _Liao, Minghui and Shi, Baoguang and Bai, Xiang and Wang, Xinggang and Liu, Wenyu_. __AAAI__, 2017 [\[paper\]](https://www.aaai.org/ocs/index.php/AAAI/AAAI17/paper/download/14202/14295) [\[code\]](https://github.com/MhLiao/TextBoxes) 166 | 167 | ##### 2.3.2 Jointly Trained Two-Stage Methods 168 | 1. __SEE: Towards Semi-Supervised End-to-End Scene Text Recognition.__ _Bartz, Christian and Yang, Haojin and Meinel, Christoph_. 2017 [\[paper\]](https://arxiv.org/abs/1712.05404) [\[code\]](https://github.com/Bartzi/see) 169 | 2. __Deep TextSpotter: An End-To-End Trainable Scene Text Localization and Recognition Framework.__ _Busta, Michal and Neumann, Lukas and Matas, Jiri_. __ICCV__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/Busta_Deep_TextSpotter_An_ICCV_2017_paper.pdf) [\[code\]](https://github.com/MichalBusta/DeepTextSpotter) 170 | 3. __Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks.__ _Li, Hui and Wang, Peng and Shen, Chunhua_. __ICCV__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/Li_Towards_End-To-End_Text_ICCV_2017_paper.pdf) 171 | 4. __An End-to-End TextSpotter With Explicit Alignment and Attention.__ _He, Tong and Tian, Zhi and Huang, Weilin and Shen, Chunhua and Qiao, Yu and Sun, Changming_. __CVPR__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2018/CameraReady/1390.pdf) 172 | 5. __FOTS: Fast Oriented Text Spotting with a Unified Network.__ _Liu, Xuebo and Liang, Ding and Yan, Shi and Chen, Dagui and Qiao, Yu and Yan, Junjie_. __CVPR__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_cvpr_2018/CameraReady/1699.pdf) 173 | 6. __Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes.__ _Lyu, Pengyuan and Liao, Minghui and Yao, Cong and Wu, Wenhao and Bai, Xiang_. __ECCV__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_ECCV_2018/papers/Pengyuan_Lyu_Mask_TextSpotter_An_ECCV_2018_paper.pdf) 174 | 7. __Towards Unconstrained End-to-End Text Spotting.__ _Qin, Siyang and Bissacco, Alessandro and Raptis, Michalis and Fujii, Yasuhisa and Xiao, Ying_. __ICCV__, 2019 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2019/papers/Qin_Towards_Unconstrained_End-to-End_Text_Spotting_ICCV_2019_paper.pdf) 175 | 8. __TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting.__ _Feng, Wei and He, Wenhao and Yin, Fei and Zhang, Xu-Yao and Liu, Cheng-Lin_. __ICCV__, 2019 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2019/papers/Feng_TextDragon_An_End-to-End_Framework_for_Arbitrary_Shaped_Text_Spotting_ICCV_2019_paper.pdf) 176 | 9. __Convolutional character networks.__ _Xing, Linjie and Tian, Zhi and Huang, Weilin and Scott, Matthew R_. __ICCV__, 2019 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2019/papers/Xing_Convolutional_Character_Networks_ICCV_2019_paper.pdf) 177 | 178 | #### 2.4 Auxilliary Techs 179 | 180 | ##### 2.4.1 Synthetic Data 181 | 1. __Synthetic data and artificial neural networks for natural scene text recognition.__ _Jaderberg, Max and Simonyan, Karen and Vedaldi, Andrea and Zisserman, Andrew_. __NIPS__, 2014 [\[paper\]](https://arxiv.org/abs/1406.2227) 182 | 2. __Synthetic data for text localisation in natural images.__ _Gupta, Ankush and Vedaldi, Andrea and Zisserman, Andrew_. __CVPR__, 2016 [\[paper\]](https://www.cv-foundation.org/openaccess/content_cvpr_2016/html/Gupta_Synthetic_Data_for_CVPR_2016_paper.html) [\[code\]](https://github.com/ankush-me/SynthText) 183 | 3. __Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes.__ _Zhan, Fangneng and Lu, Shijian and Xue, Chuhui_. __ECCV__, 2018 [\[paper\]](https://link.springer.com/chapter/10.1007/978-3-030-01237-3_16) [\[code\]](https://github.com/fnzhan/Verisimilar-Image-Synthesis-for-Accurate-Detection-and-Recognition-of-Texts-in-Scenes) 184 | 4. __UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World.__ _Long, Shangbang and Yao, Cong_. __CVPR__, 2020, [\[paper\]](https://arxiv.org/pdf/2003.10608.pdf) [\[code\]](https://jyouhou.github.io/UnrealText/) 185 | 186 | ##### 2.4.2 Weak/Semi-Supervision 187 | 1. __Wetext: Scene text detection under weak supervision.__ _Tian, Shangxuan and Lu, Shijian and Li, Chongshou_. __ICCV__, 2017 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/Tian_WeText_Scene_Text_ICCV_2017_paper.pdf) 188 | 2. __weakly supervised text attention network for generating text proposals in scene images.__ _Rong, Li and MengYi, En and JianQiang, Li and HaiBin, Zhang_. __ICDAR__, 2017 [\[paper\]](https://ieeexplore.ieee.org/abstract/document/8269992/) 189 | 3. __Wordsup: Exploiting word annotations for character based text detection.__ _Hu, Han and Zhang, Chengquan and Luo, Yuxuan and Wang, Yuzhuo and Han, Junyu and Ding, Errui_. __ICCV__, 2018 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2017/papers/Hu_WordSup_Exploiting_Word_ICCV_2017_paper.pdf) 190 | 4. __Chinese Street View Text: Large-Scale Chinese Text Reading With Partially Supervised Learning.__ _Sun, Yipeng and Liu, Jiaming and Liu, Wei and Han, Junyu and Ding, Errui and Liu, Jingtuo_. __ICCV__, 2019 [\[paper\]](http://openaccess.thecvf.com/content_ICCV_2019/papers/Sun_Chinese_Street_View_Text_Large-Scale_Chinese_Text_Reading_With_Partially_ICCV_2019_paper.pdf) 191 | 192 | ##### 2.4.3 Deblurring 193 | 1. __Convolutional neural networks for direct text deblurring.__ _Hradis, Michal and Kotera, Jan and Zemcik, Pavel and Sroubek, Filip_. __BMVC__, 2015 [\[paper\]](http://www.fit.vutbr.cz/research/pubs/habil.php.en?file=%2Fpub%2F10922%2Fhradis15CNNdeblurring.pdf&id=10922) [\[code\]](http://www.fit.vutbr.cz/~ihradis/CNN-Deblur/) 194 | 2. __A blind deconvolution model for scene text detection and recognition in video.__ _Khare, Vijeta and Shivakumara, Palaiahnakote and Raveendran, Paramesran and Blumenstein, Michael_. __PR__, 2016 [\[paper\]](https://www.sciencedirect.com/science/article/pii/S003132031600011X) 195 | 196 | ##### 2.4.4 Context Information 197 | 1. __Could scene context be beneficial for scene text detection?__ _Zhu, Anna and Gao, Renwu and Uchida, Seiichi_. __PR__, 2016 [\[paper\]](https://www.sciencedirect.com/science/article/pii/S0031320316300449) 198 | 199 | ##### 2.4.5 Adversarial Attack 200 | 1. __Adaptive Adversarial Attack on Scene Text Recognition.__ _Yuan, Xiaoyong and He, Pan and Li, Xiaolin Andy_. 2018 [\[paper\]](https://arxiv.org/abs/1807.03326) 201 | 202 | 203 | ##### 2.4.6 Evaluation 204 | 1. __Tightness-Aware Evaluation Protocol for Scene Text Detection.__ _Yuliang Liu, Lianwen Jin, Zecheng Xie, Canjie Luo, Shuaitao Zhang, Lele Xie_. __CVPR__ 2019 [\[paper\]](https://arxiv.org/abs/1904.00813) 205 | 206 | 207 | ### III. Datasets 208 | 209 | | Dataset (Year) | Image Num (train/val/test) | Text Num (train/test) | Orientation| Language| Characteristics | Detec/Recog Task | 210 | |:------:|:------:|:------:|:------:|:------:|:------:|:------:| 211 | |End2End|====|====|====|====|====|====| 212 | | [HierText (2022)](https://github.com/google-research-datasets/hiertext) | 11639 (8281/1724/1634) | 103.8 per image | All | Latin | Word/line/paragraph annotations | ✓/✓ | 213 | | [TextOCR (2021)](https://textvqa.org/textocr/) | 28134 (21778/3124/3232) | 32.1 per image | All | English | - | ✓/✓ | 214 | | [IntelOCR (2021)](https://textvqa.org/textocr/) | 207790 (191059/16731/0) | 10.0 per image | All | English | - | ✓/✓ | 215 | | [ICDAR03 (2003)](http://www.iapr-tc11.org/mediawiki/index.php?title=ICDAR_2003_Robust_Reading_Competitions) | 509 (258/0/251) | 2276 (1110/1156) | Horizontal | En | - | ✓/✓ | 216 | | [ICDAR13 Scene Text(2013)](http://dagdata.cvc.uab.es/icdar2013competition/) | 462 (229/0/233) | - (848/1095) | Horizontal | En | - | ✓/✓ | 217 | | [ICDAR15 Incidental Text(2015)](http://rrc.cvc.uab.es/?ch=4&com=introduction) | 1500 (1000/0/500) | - (-/-) | Multi-Oriented | En | Blur, Small, Defocused | ✓/✓ | 218 | | [ICDAR17 / RCTW (2017)](http://rctw.vlrlab.net/dataset/) | 12263 (8034/0/4229) | - (-/-) | Multi-Oriented | Cn | - | ✓/✓ | 219 | | [Total-Text (2017)](https://github.com/cs-chan/Total-Text-Dataset) | 1555 (1255/0/300) | - (-/-) | Multi-Oriented, Curved | En, Cn | Irregular polygon label | ✓/✓ | 220 | | [SVT (2010)](http://www.iapr-tc11.org/mediawiki/index.php?title=The_Street_View_Text_Dataset) | 350 (100/0/250) | 904 (257/647) | Horizontal| En| - | ✓/✓ | 221 | | [KAIST (2010)](http://www.iapr-tc11.org/mediawiki/index.php?title=KAIST_Scene_Text_Database) | 3000 (-/-/-) | 5000 (-/-) | Horizontal| En, Ko| Distorted | ✓/✓ | 222 | | [NEOCR (2011)](http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset) | 659 (-/-/-) | 5238 (-/-) | Multi-oriented| 8 langs| - | ✓/✓ | 223 | | [CUTE (2014)](http://cs-chan.com/downloads_CUTE80_dataset.html) or [here](https://github.com/Jyouhou/CUTE80) | 80 (-/0/80) | - (-/-) | Curved | En | - | ✓/✓ | 224 | | [CTW (2017)](https://ctwdataset.github.io) | 32K ( 25K/0/6K) | 1M ( 812K/205K) | Multi-Oriented | Cn | Fine-grained annotation | ✓/✓ | 225 | | [CASIA-10K (2018)](https://github.com/Jyouhou/SceneTextPapers/blob/master/datasets/CASIA-10K.md) | 10K (7K/0/3K) | - (-/-) | Multi-Oriented | Cn | | ✓/✓ | 226 | |Detection Only|====|====|====|====|====|====| 227 | | [OSTD (2011)](http://media-lab.ccny.cuny.edu/wordpress/cyi/www/project_scenetextdetection.html) | 89 (-/-/-) | 218 (-/-) | Multi-oriented| En| - | ✓/- | 228 | | [MSRA-TD500 (2012)](http://www.iapr-tc11.org/mediawiki/index.php/MSRA_Text_Detection_500_Database_(MSRA-TD500)) | 500 (300/0/200) | 1719 (1068/651) | Multi-Oriented | En, Cn | Long text | ✓/- | 229 | | [HUST-TR400 (2014)](http://mclab.eic.hust.edu.cn/UpLoadFiles/dataset/HUST-TR400.zip) | 400 (400/0/-) | - (-/-) | Multi-Oriented | En, Cn | Long text | ✓/- | 230 | | [ICDAR17 / RRC-MLT (2017)](http://rrc.cvc.uab.es/?ch=8) | 18000 (9000/9000) | - (-/-/-) | Multi-Oriented | 9 langs | - | ✓/- | 231 | | [CTW1500 (2017)](https://github.com/Yuliang-Liu/Curve-Text-Detector) | 1500 (1000/0/500) | - (-/-) | Multi-Oriented, Curved | En | Bounding box with _14_ vertexes | ✓/- | 232 | |Recognition Only|====|====|====|====|====|====| 233 | | [Char74k (2009)](http://www.ee.surrey.ac.uk/CVSSP/demos/chars74k/) | 74107 (-/-/-) | 74107 (-/-) | Horizontal| En, Kannada | Character label | -/✓ | 234 | | [IIIT 5K-Word (2012)](http://cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K.html) | 5000 (-/-/-) | 5000 (2000/3000) | Horizontal| -| cropped | -/✓ | 235 | | [SVHN (2010)](http://www.iapr-tc11.org/mediawiki/index.php?title=The_Street_View_House_Numbers_(SVHN)_Dataset) | - (-/-/-) | 600000 (-/-) | Horizontal| -| House number digits | -/✓ | 236 | | [SVTP (2013)](https://github.com/Jyouhou/SceneTextPapers/blob/master/datasets/svt-p.zip) | 639 (-/-/639) | - (-/-) | | En | Distorted | -/✓ | 237 | 238 | 239 | 240 | 241 | 242 | -------------------------------------------------------------------------------- /datasets/.DS_Store: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Jyouhou/SceneTextPapers/8daf1f1937640ba9c10b5eef9fd323aaebdc4d44/datasets/.DS_Store -------------------------------------------------------------------------------- /datasets/CASIA-10K.md: -------------------------------------------------------------------------------- 1 | # README for dataset CASIA-10K 2 | 3 | ## Paper 4 | This dataset is from the paper: __Multi-Oriented and Multi-Lingual Scene Text Detection With Direct Regression.__ _He, Wenhao and Zhang, Xu-Yao and Yin, Fei and Liu, Cheng-Lin._ 2018 [paper](https://ieeexplore.ieee.org/document/8410577/) 5 | 6 | ## Download 7 | - (in Chinese) through[百度网盘/Baidu Pan](https://pan.baidu.com/s/1oDh7xjj7mFtpPdTxTya1gg) 8 | - [Google Drive](https://drive.google.com/open?id=14pjUQsuNsYpZ4VIKwYZShfVgM4OUUy9c) 9 | -------------------------------------------------------------------------------- /datasets/_vnote.json: -------------------------------------------------------------------------------- 1 | { 2 | "created_time": "2020-01-15T04:25:51Z", 3 | "files": [ 4 | { 5 | "attachment_folder": "", 6 | "attachments": [ 7 | ], 8 | "created_time": "2020-01-15T04:25:51Z", 9 | "modified_time": "2020-01-15T04:25:51Z", 10 | "name": "CASIA-10K.md", 11 | "tags": [ 12 | ] 13 | } 14 | ], 15 | "sub_directories": [ 16 | ], 17 | "version": "1" 18 | } 19 | -------------------------------------------------------------------------------- /datasets/svt-p.zip: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Jyouhou/SceneTextPapers/8daf1f1937640ba9c10b5eef9fd323aaebdc4d44/datasets/svt-p.zip --------------------------------------------------------------------------------