Releases: rlberry-py/rlberry
Releases · rlberry-py/rlberry
rlberry-v0.2
Improving interface and tools for parallel execution (#50)
AgentStats
renamed toAgentManager
.AgentManager
can handle agents that cannot be pickled.Agent
interface requireseval()
method instead ofpolicy()
to handle more general agents (e.g. reward-free, POMDPs etc).- Multi-processing and multi-threading are now done with
ProcessPoolExecutor
andThreadPoolExecutor
(allowing nested processes for example). Processes are created withspawn
(jax does not work withfork
, see #51).
New experimental features (see #51, #62)
- JAX implementation of DQN and replay buffer using reverb.
rlberry.network
: server and client interfaces to exchange messages via sockets.RemoteAgentManager
to train agents in a remote server and gather the results locally (usingrlberry.network
).
Logging and rendering:
- Data logging with a new
DefaultWriter
and improved evaluation and plot methods inrlberry.manager.evaluation
. - Fix rendering bug with OpenGL (bf606b4).
Bug fixes.
rlberry-v0.1
feat(project): pre-release test