Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

作者: S. Fang, H. Xie, Y. Wang, Z. Mao and Y. Zhang

A paddle implementation for ABINet (CVPR 2021, Oral).

1. 简介

ABINet使用一个视觉模型和一个显示语言模型来识别场景文字，并且可以端到端地训练。语言模型（BCN）模拟了完形填空式的双向语言模型。另外，该语言模型使用了迭代式的文本修正策略。具体细节可查看abinet.ipynb.
本项目基于PaddleOCR复现，利用其中丰富的OCR相关工具大大减小了项目复现的难度。复现过程中代码参考了ABINet中的实现，提高了本repo复现论文的效率。在此表示感谢。

2.数据集和复现精度

Evaluation datasets, LMDB datasets can be downloaded from BaiduNetdisk(passwd:1dbv), GoogleDrive.

1. ICDAR 2013 (IC13)
2. ICDAR 2015 (IC15)
3. IIIT5K Words (IIIT)
4. Street View Text (SVT)
5. Street View Text-Perspective (SVTP)
6. CUTE80 (CUTE)

paddle版本模型使用的权重是基于ABINet提供的权重转化而来。下载链接（提取码：b83l）

IC13	SVT	IIIT	IC15	SVTP	CUTE	AVG
97.0	93.4	96.4	85.9	89.5	89.2	92.7

3.1 准备环境

框架：
- PaddlePaddle == 2.2.1
- PaddleOCR == 2.4

克隆本项目：

git clone https://github.com/Huntersdeng/abinet-paddle.git
cd abinet-paddle

安装第三方库：
```
pip install -r requirements.txt
```

3.2 快速开始

模型验证:（需要首先下载数据集，并在配置文件中修改数据集的路径）

  python tools/eval.py -c configs/rec/rec_r45_abinet.yml -o Global.pretrained_model='your model path'

模型推断并可视化结果:（需要在配置文件中将“infer_img”字段修改为预测图片的路径）
```
  python tools/infer_rec.py -c configs/rec/rec_r45_abinet.yml -o Global.pretrained_model='your model path'
```

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
configs		configs
figs		figs
modules		modules
ppocr		ppocr
tools		tools
README.md		README.md
abinet.ipynb		abinet.ipynb
charset_36.txt		charset_36.txt
demo.py		demo.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

目录

1. 简介

2.数据集和复现精度

3.1 准备环境

3.2 快速开始

About

Releases

Packages

Languages

Huntersdeng/abinet-paddle

Folders and files

Latest commit

History

Repository files navigation

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

目录

1. 简介

2.数据集和复现精度

3.1 准备环境

3.2 快速开始

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages