is it really fast as it is being advertised? #115

ghost · 2023-07-10T11:29:21Z

Truth be told, I finetuned original Tortoise TTS autoregressive model for an asmr voice to have breathy responses. Loved the outputs! But to make the pipeline production-ready, I though of using tortoise-tts-fast. Installed python 3.8 in a new conda environment and installed torch versions as mentioned in some of the open/closed issues.

Haven't yet felt where is the improvement in the generation time? I am running it on A10G. Vram unnecessarily goes up to 21GB. outputs are not great with 'ultra_fast' preset. Need to have 'fast' preset or 'very_fast' at least. Am I doing something wrong. though everything works without any error, I am running it through streamlit app.py file. Would anyone like to comment?

Turbine1991 · 2023-07-25T23:17:01Z

Are you using the .sh script or doing it programmatically? (only loading components once vs each time)

seanfcastillo · 2023-08-03T08:01:12Z

I've learned you can seriously impact the speed by tweaking the advanced options and tuning options. Tweaking the number of autoregressive samples, as well as the amount of text processed in each batch made a big difference, and then applying some of the fancy stuff on top to make it sound good, like cond_free.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is it really fast as it is being advertised? #115

is it really fast as it is being advertised? #115

ghost commented Jul 10, 2023 •

edited by ghost

Loading

Turbine1991 commented Jul 25, 2023

seanfcastillo commented Aug 3, 2023

is it really fast as it is being advertised? #115

is it really fast as it is being advertised? #115

Comments

ghost commented Jul 10, 2023 • edited by ghost Loading

Turbine1991 commented Jul 25, 2023

seanfcastillo commented Aug 3, 2023

ghost commented Jul 10, 2023 •

edited by ghost

Loading