You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
the timestep in both training and inference are from 1000 to 0. so the default value is workable for inference.
to determine the hyper param, we will quantilize the difference of attention outputs for adjacent diffusion timesteps. Based on the difference, we can then finetune the threshold and gap by visualizing the results.
对于PAB的threshold、gap应该如何确定合适的超参数,需要对比不同step的att数值变化么,有什么经验?以及如果使用full attention是否仍然适用?
The text was updated successfully, but these errors were encountered: