Skip to content

Latest commit

 

History

History
13 lines (9 loc) · 297 Bytes

File metadata and controls

13 lines (9 loc) · 297 Bytes

gradient-ascent-cartpoleEnv

Open AI Cartpole environment gradient ascent

Implementation of gradient ascent for policy learning in DRL

Includes

  • Cross entropy model
  • Adoptive noise scaling

Other methods for gradient ascent (TO DO)

  • Steepest ascent (Hill climbing)
  • Stimulated annealing