0.3.0

Trinkle23897 released this 26 Sep 08:39

· 634 commits to master since this release

Since at this point, the code has largely changed from v0.2.0, we release version 0.3 from now on.

API Change

add policy.updating and clarify collecting state and updating state in training (#224)
change train_fn(epoch) to train_fn(epoch, env_step) and test_fn(epoch) to test_fn(epoch, env_step) (#229)
remove out-of-the-date API: collector.sample, collector.render, collector.seed, VectorEnv (#210)

Bug Fix

fix a bug in DDQN: target_q could not be sampled from np.random.rand (#224)
fix a bug in DQN atari net: it should add a ReLU before the last layer (#224)
fix a bug in collector timing (#224)
fix a bug in the converter of Batch: deepcopy a Batch in to_numpy and to_torch (#213)
ensure buffer.rew has a type of float (#229)

Enhancement

Anaconda support: conda install -c conda-forge tianshou (#228)
add PSRL (#202)
add SAC discrete (#216)
add type check in unit test (#200)
format code and update function signatures (#213)
add pydocstyle and doc8 check (#210)
several documentation fix (#210)

Assets 4