Keras-oke

Exploring a pipeline for generating karaoke videos via Automatic Speech Recongnition inference

pip install spleeter

for aeneas pipeline:

sudo apt-get install libespeak-dev
pip install numpy
pip install aeneas

for whisper pipeline(deprecated): OpenAI whisper

pip install git+https://github.com/openai/whisper.git

for correct whisper dependency versions, make sure you have CUDA >= 11.1 and run

pip install torchvision==0.11.0+cu113 torchaudio==0.10.0 -f https://download.pytorch.org/whl/torch_stable.html

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Aeneas		Aeneas
kerasoke		kerasoke
.code-workspace		.code-workspace
.gitignore		.gitignore
AllAlongTheWatchtower.mp3		AllAlongTheWatchtower.mp3
Anaconda.mp3		Anaconda.mp3
BabyGotBack.mp3		BabyGotBack.mp3
README.md		README.md
Snake.mp3		Snake.mp3
aeneas_pipeline.ipynb		aeneas_pipeline.ipynb
whisper_pipeline.ipynb		whisper_pipeline.ipynb