Script Identification for Indian Language Scene Text

This repository contains scripts and models for identifying scripts in text using different machine learning approaches. The project is structured into three main sections, each implementing a different method for script identification: CLIP, CRNN, and ViT (Vision Transformer).

Overview

This repository provides implementations for three different script identification methods:

CLIP: A contrastive language-image pre-training model for script identification.
CRNN: A Convolutional Recurrent Neural Network-based approach for recognizing scripts in text images.
ViT: A Vision Transformer-based model for script identification tasks. Each method has its own folder with specific scripts for training, testing, and inference, as well as web app deployments (via FastAPI for CLIP and CRNN). All models are compatible with Python environments, and each method has its own dependencies listed in the respective requirements.txt file

Installation

To get started, clone the repository and install the necessary dependencies for the respective method you wish to use:

Clone the repositoty

git clone https://github.com/Bhashini-IITJ/ScriptIdentification
cd ScriptIdentification

Install dependencies for the desired model (e.g., CLIP, CRNN, or ViT):

Installtion for CRNN

Installtion for CLIP

Installtion for ViT

Usage

Usage of each model can be found in their respective directory.

CRNN Usage

CLIP Usage

ViT Usage

Acknowledgements

We would like to express our gratitude to the authors and contributors of the following repositories for their valuable contributions to this project:

CLIP: We acknowledge OpenAI for the CLIP model, which provided the foundation for our script identification approach using contrastive language-image pre-training.
CRNN: Thanks to the creators of the CRNN model, which served as a core component for the script identification system based on Convolutional Recurrent Neural Networks.
ViT: Special thanks to the developers behind the ViT model, whose implementation of Vision Transformers greatly influenced our approach to script identification.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
CRNN		CRNN
Vit		Vit
clip		clip
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Script Identification for Indian Language Scene Text

Overview

Installation

Clone the repositoty

Install dependencies for the desired model (e.g., CLIP, CRNN, or ViT):

Installtion for CRNN

Installtion for CLIP

Installtion for ViT

Usage

CRNN Usage

CLIP Usage

ViT Usage

Acknowledgements

About

Releases 1

Packages

Contributors 4

Languages

License

Bhashini-IITJ/ScriptIdentification

Folders and files

Latest commit

History

Repository files navigation

Script Identification for Indian Language Scene Text

Overview

Installation

Clone the repositoty

Install dependencies for the desired model (e.g., CLIP, CRNN, or ViT):

Usage

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Languages