Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about generation quality #33

Open
xizaoqu opened this issue Sep 24, 2024 · 1 comment
Open

Questions about generation quality #33

xizaoqu opened this issue Sep 24, 2024 · 1 comment

Comments

@xizaoqu
Copy link

xizaoqu commented Sep 24, 2024

Hi, thanks for presenting this interesting paper.

Table 2 shows that Show-o achieves impressive generation ability (better than SDv1.5) with a much smaller training scale.

Could you provide some insights about why the discrete diffusion process can be even better than continuous diffusion with many denoising steps?

Thanks.

@Sierkinhane
Copy link
Collaborator

Hello, as the laion data is currently not accessible, it's a bit hard for a fair comparison. However, in our experiment, what we want to demonstrate is that with relatively sufficient data, discrete diffusion is able to have a good performance. I think one of the most important factors is the quality of image-text pairs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants