batch info None in target_q of ddpg #1241

JoshuaSchenk · 2025-02-03T13:17:49Z

The target_q function of the ddpg policy sets the info in batch to None. Why does this happen?

def target_q(self, buffer: ReplayBuffer, indices: np.ndarray) -> torch.Tensor:
obs_next_batch = Batch(
obs=buffer[indices].obs_next,
info=[None] * len(indices), #WHY?
) # obs_next: s{t+n}
return self.critic_old(obs_next_batch.obs, self(obs_next_batch, model="actor_old").act)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch info None in target_q of ddpg #1241

batch info None in target_q of ddpg #1241

JoshuaSchenk commented Feb 3, 2025

batch info None in target_q of ddpg #1241

batch info None in target_q of ddpg #1241

Comments

JoshuaSchenk commented Feb 3, 2025