You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The SS task model is significantly slower than the SE model. Why is this the case?
When compared with the same mossformer-16k.
Also, for the SE task, mossformer-GAN is far superior to mossformer-se. However, it is much slower. Is there any way to have a model that is both fast and accurate?
(The better here means that GAN maintains the voice style without much distortion, but mossformer-se is very distorted.)
The text was updated successfully, but these errors were encountered:
Hi, thank you for your feedback. SS model is time-domain basis, the window size is quite small. That means it needs to process more frames compared to the time-frequency dimain se model.
MossformerGAN is processing on both magnitude and phases, and trained using GAN. So it focuses more on perception, while mossformer-se is masking basis, and trained using mse losses
The SS task model is significantly slower than the SE model. Why is this the case?
When compared with the same mossformer-16k.
Also, for the SE task, mossformer-GAN is far superior to mossformer-se. However, it is much slower. Is there any way to have a model that is both fast and accurate?
(The better here means that GAN maintains the voice style without much distortion, but mossformer-se is very distorted.)
The text was updated successfully, but these errors were encountered: