slt-demo

Sign Language Translation - Gradio web app for Video to Text

This demo is a proof of concept for the translation system of ASL. It uses YoloV8, MediaPipe and T5 in the backend, trained on the YoutubeASL dataset by the CV team at UWB. See T5_for_SLT and PoseEstimation.

Find the demo at HuggingFace spaces

How to run locally

Clone repository

git clone https://github.com/JSALT2024/slt-demo.git

Install dependencies

pip install -r requirements.txt

Run the demo

python app.py

Open http://127.0.0.1:7860/
You are good to go! Upload any video or use your webcam 📷

Code structure

slt-demo/
├── backend.py
├── predict_pose.py
├── sltGradio.py
├── checkpoints/
│   ├── pose/
│   │   └── [pose model files]
│   └── t5-v1_1-base/
│       └── [T5 model files]
├── configs/
│   └── predict_config_demo.yaml
├── dataset/
│   └── generic_sl_dataset.py
├── model/
│   ├── configuration_t5.py
│   └── modeling_t5.py
├── utils/
│   └── translation.py
└── video/
    └── [example videos]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

slt-demo

Find the demo at HuggingFace spaces

How to run locally

Code structure

About

Uh oh!

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
YouTube-ASL Clip Keypoint Dataset		YouTube-ASL Clip Keypoint Dataset
configs		configs
dataset		dataset
model		model
utils		utils
video		video
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
backend.py		backend.py
predict_pose.py		predict_pose.py
requirements.txt		requirements.txt
sltGradio.py		sltGradio.py

License

JSALT2024/slt-demo

Folders and files

Latest commit

History

Repository files navigation

slt-demo

Find the demo at HuggingFace spaces

How to run locally

Code structure

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages