You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi Acme team, I think JAX DQN might set the evaluation epsilon to the exploration epsilon if deterministic evaluation is requested (eps=0.0, here). Replacing this with self._config.eval_epsilon is not None fixed it for me - does this also occur on your end?
Thanks so much for checking!
The text was updated successfully, but these errors were encountered:
Hi Acme team, I think JAX DQN might set the evaluation epsilon to the exploration epsilon if deterministic evaluation is requested (eps=0.0, here). Replacing this with
self._config.eval_epsilon is not None
fixed it for me - does this also occur on your end?Thanks so much for checking!
The text was updated successfully, but these errors were encountered: