Stale hidden states #278

aklein1995 · 2021-06-09T14:09:34Z

Hi!

I was taking a look at your code and wondering if you tackle the stale hidden states after each rollout. As I have seen, the code is used in order to be stateful at episode level, and then, when done is found, the hidden states are reset. However, from one rollout to another, the output hidden state of the last rollout is copied in order to be the input hidden state of the current rollout, although the actor-critic network parameters (including GRU) have already been updated.

Is there any reason why you do not recalculate the last rollouts hidden state taking into account the new network weights?
Thank you in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stale hidden states #278

Stale hidden states #278

aklein1995 commented Jun 9, 2021

Stale hidden states #278

Stale hidden states #278

Comments

aklein1995 commented Jun 9, 2021