Skip to content

adikamunyao/dataClan

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PenPal: Providing a solution for digitizing and securely storing handwritten notes

Introduction

Handwriting is a crucial method of storing knowledge for the future, yet it is easily destroyed or lost. It is critical to save handwritten papers digitally so that they may be quickly found and kept secure. However, because everyone's handwriting is unique, this is a difficult task. This difficulty can be solved with the aid of PenPal. Handwritten text recognition (HTR) is a method of using a computer to read and comprehend handwritten words. To do this, the computer must be able to separate words and read them in the correct order. Scientists are continually looking for new and better ways to achieve this, such as employing various neural network combinations. A new sort of neural network known as a "vision transformer" has recently been utilized to evaluate photos. They have performed just as well as the greatest neural networks when it comes to picture recognition. The scientists will investigate the use of vision transformers to detect handwritten text in this study.

penpal

Metric of Success

  • Character Error Rate: this is a metric of the performance of an automatic speech recognition (ASR) system. (CER) will be a useful metric to evaluate the performance of our ASR system. It will also help assess the accuracy of the system in converting spoken words to text and identify areas for improvement. By aiming for a lower CER, we can improve the quality of the ASR output and enhance the overall performance of our project.

  • Word Error Rate: this is the ratio of errors in a transcript to the total words spoken. Our project involves speech-to-text transcription, the Word Error Rate (WER) will be a useful metric to evaluate the accuracy of the transcription. It provides a ratio of errors in the transcript to the total words spoken, and a lower WER indicates better accuracy in recognizing speech. By monitoring the WER, we can assess the performance of our speech-to-text system, identify areas for improvement, and adjust the system's parameters to reduce errors and improve the quality of the transcription.

Aims

  • Develop a machine learning model that can recognize different styles of handwriting, so that it can accurately convert handwritten notes into digital format.
  • Create a database to store digitized notes in a secure and organized manner.
  • Implement encryption and other security measures to protect the privacy of the notes.

Challenges & Limitations

There are a few big challenges when it comes to teaching a computer to read handwritten text.

  • One of the biggest challenges is creating a big enough dataset of labeled handwritten text to train the machine learning model. If there isn't enough data, then transfer learning might need to be used.
  • Another key challenge is teaching the computer to recognize words that it hasn't been specifically trained on. This is important if the computer is going to be able to read text in different languages. Currently, most of the available datasets are in English, so this is a big challenge. The computer will also only be able to recognize letters and numbers from the Latin alphabet.

Background

  • Handwritten Text Recognition.

Handwritten text recognition comes in two types: offline and online. We can use offline recognition for handwritten text recognition tasks where we only have access to a static image of the writing. For online recognition, we need a special electronic pen that records the order of the written strokes, so we can use it for tasks that require this level of detail, such as signature verification or handwriting analysis. However, online recognition is less common due to the requirement for specialized equipment.

  • Neural Networks.

Neural networks extract information from handwriting images. They have an input, hidden, and output layer. Deep learning networks have more than three layers and recognize complex patterns. Each node in a layer is connected to all nodes in the next layer, called a fully connected layer. The value of a node is determined by multiplying the input with the weight, passing through an activation function. During training, our goal is to minimize a loss function.

  • Convolutional Neural Networks

A CNN is a neural network for images and audio. It has an input layer, convolutional layers that identify important features, pooling layers that simplify data, and an output layer. Early layers find simple features and later layers find complex shapes to recognize the object in the image. We can use it for image and audio recognition tasks.

Data source

Iam.

EMNIST.

Data Description

  • The EMNIST dataset contains a set of handwritten characters including both upper and lower case alphabets, digits, and special characters. Each character is represented as a grayscale image of size 28x28. The displayed data samples provide a visual representation of what the handwritten characters look like in the dataset. By examining the data samples, we can see that there are variations in handwriting styles, thickness, and slant of the characters. This diversity in handwriting makes the task of character recognition more challenging.

  • The IAM forms dataset contains handwritten text from various sources such as letters, faxes, and application forms. The displayed samples show different types of handwritten text with varying styles and layouts. The dataset presents a challenge for handwriting recognition models due to the complexity and variability of the handwritten text.

modeel

Minimal Viable Product (MVP)

deploy

Conclusions

  • Learning Rate and Batch Size: Reducing the learning rate and batch size may have helped the model converge more effectively during training, leading to better optimization and improved accuracy.
  • Machine learning models can accurately recognize different styles of handwriting, making them a suitable technology for character recognition in the PenPal system.
  • Increased Number of Epochs and Layers: The addition of more epochs and layers to the model has likely provided more opportunities.

Recommendations

  • Penpal Matching Platform: Develop a web application or mobile app that matches penpals based on their interests, hobbies, or language preferences.
  • Language Exchange Platform: Create a language exchange platform that facilitates language learning through pen paling.
  • Virtual Penpal Experience: Develop a virtual penpal experience that simulates the process of exchanging letters with a pen pal.
  • Penpal Blogging Platform: Create a blogging platform that allows users to write and share blog posts about their penpal experiences, cultural exchanges, or language learning journey.
  • Penpal Tracking App: Develop a mobile app that helps users keep track of their penpal relationships, letters exchanged, and milestones reached.

About

capstone project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published