Skip to content

0.4.5

Compare
Choose a tag to compare
@Trinkle23897 Trinkle23897 released this 28 Nov 15:14
· 545 commits to master since this release
3592f45

Bug Fix

  1. Fix tqdm issue (#481)
  2. Fix atari wrapper to be deterministic (#467)
  3. Add writer.flush() in TensorboardLogger to ensure real-time logging result (#485)

Enhancement

  1. Implements set_env_attr and get_env_attr for vector environments (#478)
  2. Implement BCQPolicy and offline_bcq example (#480)
  3. Enable test_collector=None in 3 trainers to turn off testing during training (#485)
  4. Fix an inconsistency in the implementation of Discrete CRR. Now it uses Critic class for its critic, following conventions in other actor-critic policies (#485)
  5. Update several offline policies to use ActorCritic class for its optimizer to eliminate randomness caused by parameter sharing between actor and critic (#485)
  6. Move Atari offline RL examples to examples/offline and tests to test/offline (#485)