You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello @XiangLi1999 and @ari-holtzman,
I'm trying to follow the implementation of the typical decoding baselines in the code base. Here a typical argument is passed, but the generate method does not accept any such argument but only an optional argument typical_p. At the same time I am unable to find any call to model_kwargs['"typical"] anywhere so it seems no warping of the logits to do typical sampling is ever performed.
Am I missing something trivial?
The text was updated successfully, but these errors were encountered:
Thanks for pointing this out! I think this is my bad, the right thing is to change "typical=0.95" to "typical_p=0.95". I will fix this code release and double check my experiments. This might just be something I dropped when migrating my code base from dev to release.
Hello @XiangLi1999 and @ari-holtzman,
I'm trying to follow the implementation of the typical decoding baselines in the code base.
Here a
typical
argument is passed, but the generate method does not accept any such argument but only an optional argumenttypical_p
. At the same time I am unable to find any call tomodel_kwargs['"typical"]
anywhere so it seems no warping of the logits to do typical sampling is ever performed.Am I missing something trivial?
The text was updated successfully, but these errors were encountered: