README.md

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

MORAN is a network with rectification mechanism for general scene text recognition. The paper (accepted to appear in Pattern Recognition, 2019) in arXiv, online version is available now.

Here is a brief introduction in Chinese.

Improvements of MORAN v2:

More stable rectification network for one-stage training
Replace VGG backbone by ResNet
Use bidirectional decoder (a trick borrowed from ASTER)

Version IIIT5K SVT IC03 IC13 SVT-P CUTE80 IC15 (1811) IC15 (2077) MORAN v1 (curriculum training)* 91.2 88.3 95.0 92.4 76.1 77.4 74.7 68.8 MORAN v2 (one-stage training) 93.4 88.3 94.2 93.2 79.7 81.9 77.8 73.9

*The results of v1 were reported in our paper. If this project is helpful for your research, please cite our Pattern Recognition paper.

Requirements

Use pip to install the following libraries.

    pip install -r requirements.txt

Data Preparation

Please convert your own dataset to LMDB format by using the tool provided by @Baoguang Shi.

You can also download the training (NIPS 2014, CVPR 2016) and testing datasets prepared by us.

about 20G training datasets and testing datasets in LMDB format, password: l8em

The raw pictures of testing datasets can be found here.

Training and Testing

Modify the path to dataset folder in train_MORAN.sh:

	--train_nips path_to_dataset \
	--train_cvpr path_to_dataset \
	--valroot path_to_dataset \

And start training: (manually decrease the learning rate for your task)

	sh train_MORAN.sh

Demo

Download the model parameter file demo.pth.

BaiduYun (password: l8em)
Google Drive
OneDrive

Put it into root folder. Then, execute the demo.py for more visualizations.

	python demo.py

Citation

@article{cluo2019moran,
  author  = {Canjie Luo, Lianwen Jin, Zenghui Sun},
  title   = {MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition},
  journal = {Pattern Recognition}, 
  volume  = {}, 
  number  = {}, 
  pages   = {},
  year    = {2019}, 
}

Acknowledgment

The repo is developed based on @Jieru Mei's crnn.pytorch and @marvis' ocr_attention. Thanks for your contribution.

Attention

The project is only free for academic research purposes.

GitHub - Canjie-Luo/MORAN_v2: MORAN: A Multi-Object Rectified Attention Network...

README.md

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Improvements of MORAN v2:

Requirements

Data Preparation

Training and Testing

Demo

Citation

Acknowledgment

Attention

Recommend

声称 `Oumuamua 是外星飞船的哈佛天文学家

盒马鲜生们想要下沉，得先和这些“地头蛇”过过招

发现豆瓣有个组叫“抠门男性联合会”，看完我哈哈哈哈

交易所困局：裁员、转型、突围

传极路由创始人王楚云已被警方拘留，是谁扼住了他的创业命门？

案例分析：绘制流程图需要注意哪些事项？

GitHub - RealTimeLogic/MinnowServer: A super small and fast embedded HTTP(S) Web...

应用商店8大趋势：消费支出高速增长，苹果商店支出远超谷歌

微信小程序，逃离“克制”的陷阱

Why is INDEX REORGANIZE and UPDATE STATISTICS causing SQL Server blocking?

About Joyk