Open source speech translate stacks #130

ILG2021 · 2023-04-09T11:11:25Z

ILG2021
Apr 9, 2023

Nowadays I am using open source models to realize a speech to speech translator. Because I only have a 1070ti, I have to use ctranslator models. I use faster-whisper(really amazing fast) as the ASR, nllb-200-3.3b-ct2 as the text translator and gTTS for the tts. I found nllb-200 is not very precise so I have to change to deepl api. For the tts, I have tried conqui tts, their models are scattered and not easy to use, so I use gTTS directly. This is the stacks I used now. fast whisper + deepl api + gTTS

For the stacks, can anyone give me some suggestion? Thank you very much.

phineas-pta · 2023-04-12T09:46:57Z

phineas-pta
Apr 12, 2023

you can take a look at this: https://github.com/ardha27/AI-Waifu-Vtuber

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Open source speech translate stacks #130

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Open source speech translate stacks #130

ILG2021 Apr 9, 2023

Replies: 1 comment

phineas-pta Apr 12, 2023

ILG2021
Apr 9, 2023

phineas-pta
Apr 12, 2023