Hyperparameter tuning + TRPO

araffin released this 28 May 09:19

· 102 commits to master since this release

added hyperparameter tuning using optuna
a2c for continuous actions
upgrade stable-baselines (v2.5.1)
add support for trpo + mpi training
fixed frame stack loading

now more than 100 trained agents.

Assets 2