Skip to content

0.2.6

Compare
Choose a tag to compare
@Trinkle23897 Trinkle23897 released this 19 Aug 07:21
· 652 commits to master since this release
a9f9940

API Change

  1. Replay buffer allows stack_num = 1 (#165)
  2. add policy.update to enable post process and remove collector.sample (#180)
  3. Remove collector.close and rename VectorEnv to DummyVectorEnv (#179)

Enhancement

  1. Enable async simulation for all vector envs (#179)
  2. Improve PER (#159): use segment tree and enable all Q-learning algorithms to use PER
  3. unify single-env and multi-env in collector (#157)
  4. Pickle compatible for replay buffer and improve buffer.get (#182): fix #84 and make buffer more efficient
  5. Add ShmemVectorEnv implementation (#174)
  6. Add Dueling DQN implementation (#170)
  7. Add profile workflow (#143)
  8. Add BipedalWalkerHardcore-v3 SAC example (#177) (about 1 hour it is well-trained)

Bug fix

  1. fix #162 of multi-dim action (#160)

Note: 0.3 is coming soon!