rlberry-v0.4.0
Release of version 0.4.0 of rlberry.
New in 0.4.0
PR #273
- Change the default behavior of plot_writer_data so that if seaborn has version >= 0.12.0 then a 90% percentile interval is used instead of sd.
PR #269
- Add rlberry.envs.PipelineEnv a way to define pipeline of wrappers in a simple way.
PR #262
- PPO can now handle continuous actions.
-
Implementation of Munchausen DQN in rlberry.agents.torch.MDQNAgent.
-
Comparison of MDQN with DQN agent in the long tests.
- Compress the pickles used to save the trained agents.
PR #235
- Implementation of rlberry.envs.SpringCartPole environment, an RL environment featuring two cartpoles linked by a spring.
-
Improve logging, the logging level can now be changed with rlberry.utils.logging.set_level().
-
Introduce smoothing in curves done with plot_writer_data when only one seed is used.
PR #223
- Moved PPO from experimental to torch agents. Tested and benchmarked.