Skip to content

Commit

Permalink
first commit
Browse files Browse the repository at this point in the history
  • Loading branch information
abdoelsayed2016 committed Sep 24, 2020
0 parents commit 65f57cb
Show file tree
Hide file tree
Showing 12 changed files with 60 additions and 0 deletions.
Binary file added Application_Form/Application_Form_for_HKR.doc
Binary file not shown.
60 changes: 60 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,60 @@
# HKR_Dataset
The SCUT-HCCDoc Dataset for Russian and Kazakh database (with about 95% of Russian and 5% of
Kazakh words/sentences respectively) for offline handwriting recognition.
The dataset can be downloaded through the following link:

- [Cloud](https://cloud.mail.ru/public/25xw/2YPdtaFAF/)

Note: The HKR Dataset can only be used for non-commercial research purpose.
For researchers who wants to use the HKR database, please first fill
in this [Application Form](Application_Form/Application_Form_for_HKR.doc)
and send it via email to us ([[email protected]](mailto:[email protected]),[[email protected]](mailto:[email protected])).

## Description
The database is written in Cyrillic and shares the same 33 characters. Besides these characters, the Kazakh alphabet also contains 9 additional
specific characters. This dataset is a collection of forms. The sources of all the forms in the datasets were generated by LATEX which subsequently was filled out by
persons with their handwriting. The database consists of more than 1400 filled forms. There are approximately 63000 sentences, more than 715699 symbols produced by approximately 200 diferent writers.
We utilized three different datasets described as following:
* Handwritten samples (Forms) of keywords in Kazakh and Russian (Areas, Cities , Village , etc.)
* Handwritten Kazakh and Russian alphabet in cyrillic
* Handwritten samples (Forms) of poems in Russian

The following are some sample of forms from HKR dataset:

![sample of form](images/sample1.png)

The following are some word images after segmented the forms:

![](images/0_9_16.jpg)
![word1](images/0_9_623.jpg)
![word2](images/0_10_23.jpg)
![word3](images/0_10_30.jpg)
![word4](images/0_10_615_.jpg)
![word5](images/0_11_55.jpg)
![word6](images/0_13_55.jpg)
![word7](images/0_13_614.jpg)


For example, the following image shows the number of character in the HKR dataset.

![distribution](images/sample2.png)

## Citation and Contact
Please consider to cite our paper when you use our dataset:
```
@article{nurseitov2020hkr,
title={Hkr for handwritten kazakh \& russian database},
author={Nurseitov, Daniyar and Bostanbekov, Kairat and Kurmankhojayev, Daniyar and Alimova, Anel and Abdallah, Abdelrahman},
journal={arXiv preprint arXiv:2007.03579},
year={2020}
}
```
For any quetions about the dataset please contact the authors by sending email toProf. Daniyar Nurseitov
([[email protected]](mailto:[email protected])), Dr. Kairat Bostanbekov
([[email protected]](mailto:[email protected]))






Binary file added images/0_10_23.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/0_10_30.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/0_10_615_.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/0_11_55.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/0_13_55.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/0_13_614.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/0_9_16.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/0_9_623.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/sample1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/sample2.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 65f57cb

Please sign in to comment.