Release v1.1.1
Benchmark
The benchmark results of IQL and NFQ have been added to d3rlpy-benchmarks. Plus, the results of the more random seeds up to 10 have been added to all algorithms. The benchmark results are more reliable now.
Documentation
- More descriptions have been added to
Finetuning
tutorial page. Offline Policy Selection
tutorial page has been added
Enhancements
cloudpickle
andGPUUtil
dependencies have been removed.- gaussian likelihood computation for MOPO becomes more mathematically right (thanks @tominku )