Skip to content

Latest commit

 

History

History
4 lines (4 loc) · 568 Bytes

developping-algorithms.md

File metadata and controls

4 lines (4 loc) · 568 Bytes

Developping Algorithms

Although users may do whatever they like to design and try their algorithms. We recommend wrapping a new algorithm as an Agent class as the example agents. To be compatible with the toolkit, the agent should have the following functions:

  • step: Given the current state, predict the next action.
  • eval_step: Similar to step, but for evaluation purpose. Reinforcement learning algorithms will usually add some noise for better exploration in training. In evaluation, no noise will be added to make predictions.