Voice-Activity-Detection

Voice Activity Detection (VAD) is a critical component in many speech processing applications. It involves distinguishing between segments of audio that contain speech and those that contain non-speech (silence, background noise, etc.). This project aims to evaluate and compare the performance of several state-of-the-art VAD models on a diverse set of languages.

Compares the performance of various VAD models across different languages. The models evaluated include:

pyannote.audio
SpeechBrain
FunASR
Silero

It includes the full implementation of their inference.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
data		data
evaluation		evaluation
helper		helper
testing		testing
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice-Activity-Detection

About

Releases

Packages

Languages

sOR-o/Voice-Activity-Detection

Folders and files

Latest commit

History

Repository files navigation

Voice-Activity-Detection

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages