A toolset to quickly make custom datasets for TTS #2323

rioharper · 2023-02-04T23:28:08Z

rioharper
Feb 4, 2023

Hi!

A while back, I made a tutorial for how to create datasets using your own voice and shared it here but the last few weeks I have automated that process greatly with the following:

Audio processing to remove non speech elements, overlapping speech, and isolate the same speaker across multiple files
Transcription of audio using OpenAI's whisper, CTC segmentation to clip the audio into usable chunks, and use text normalization to create a voice dataset in the same format as LJSpeech

I have been using it for a few datasets, and while its not perfect its brought down time spent creating custom datasets by 90% or more.

The github link is https://github.com/rioharper/VocalForge

Let me know what you think!

DrewThomasson · 2024-10-17T01:21:54Z

DrewThomasson
Oct 17, 2024

Amazing! I was looking for something like this! :)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A toolset to quickly make custom datasets for TTS #2323

{{title}}

Replies: 1 comment

{{title}}

Select a reply

A toolset to quickly make custom datasets for TTS #2323

rioharper Feb 4, 2023

Replies: 1 comment

DrewThomasson Oct 17, 2024

rioharper
Feb 4, 2023

DrewThomasson
Oct 17, 2024