Characters Recognition

A Chinese characters recognition repository based on convolutional recurrent networks. (Below please scan the QR code to join the wechat group.)

Performance

Recognize characters in pictures

Dev Environments

WIN 10 or Ubuntu 16.04
PyTorch 1.2.0 (may fix ctc loss) with cuda 10.0 🔥
yaml
easydict
tensorboardX

Data

Synthetic Chinese String Dataset

Download the dataset
Edit lib/config/360CC_config.yaml DATA:ROOT to you image path

    DATASET:
      ROOT: 'to/your/images/path'

Download the labels (password: eaqb)
Put char_std_5990.txt in lib/dataset/txt/
And put train.txt and test.txt in lib/dataset/txt/

eg. test.txt

    20456343_4045240981.jpg 89 201 241 178 19 94 19 22 26 656
    20457281_3395886438.jpg 120 1061 2 376 78 249 272 272 120 1061
    ...

Or your own data

Edit lib/config/OWN_config.yaml DATA:ROOT to you image path

    DATASET:
      ROOT: 'to/your/images/path'

And put your train_own.txt and test_own.txt in lib/dataset/txt/

eg. test_own.txt

    20456343_4045240981.jpg 你好啊！祖国！
    20457281_3395886438.jpg 晚安啊！世界！
    ...

note: fixed-length training is supported. yet you can modify dataloader to support random length training.

Train

   [run] python train.py --cfg lib/config/360CC_config.yaml
or [run] python train.py --cfg lib/config/OWN_config.yaml

#### loss curve

```angular2html
   [run] cd output/360CC/crnn/xxxx-xx-xx-xx-xx/
   [run] tensorboard --logdir log

loss overview(first epoch)

Demo

   [run] python demo.py --image_path images/test.png --checkpoint output/checkpoints/mixed_second_finetune_acc_97P7.pth

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.github		.github
.idea		.idea
images		images
lib		lib
output		output
README.md		README.md
demo.py		demo.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Characters Recognition

Performance

Recognize characters in pictures

Dev Environments

Data

Synthetic Chinese String Dataset

Or your own data

Train

loss overview(first epoch)

Demo

References

About

Releases

Packages

Languages

huohuaqi/CRNN_Chinese_Characters_Rec

Folders and files

Latest commit

History

Repository files navigation

Characters Recognition

Performance

Recognize characters in pictures

Dev Environments

Data

Synthetic Chinese String Dataset

Or your own data

Train

loss overview(first epoch)

Demo

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages