Skip to content

IAM Dataset

Compare
Choose a tag to compare
@vittoriopippi vittoriopippi released this 02 Nov 14:39
· 55 commits to main since this release

The IAM database contains 13,353 images of handwritten lines of text created by 657 writers. The texts those writers transcribed are from the Lancaster-Oslo/Bergen Corpus of British English. It includes contributions from 657 writers making a total of 1,539 handwritten pages comprising of 115,320 words and is categorized as part of modern collection. The database is labeled at the sentence, line, and word levels.

Terms of usage

The IAM Handwriting Database is publicly accessible and freely available for non-commercial research purposes. If you are using data from the IAM Handwriting Database, we request you to register, so we are aware of who is using our data. If you are publishing scientific work based on the IAM Handwriting Database, we request you to include a reference to the paper.

@article{marti2002iam,
  title={The IAM-database: an English sentence database for offline handwriting recognition},
  author={Marti, U-V},
  journal={International journal on document analysis and recognition},
  volume={5},
  pages={39--46},
  year={2002},
  publisher={Springer}
}

Evaluation settings

In this release there is the data necessary to evaluate the models with the standard procedure described in VATr++: Choose Your Words Wisely for Handwritten Text Generation