Skip to content

Latest commit

 

History

History
10 lines (9 loc) · 184 Bytes

README.md

File metadata and controls

10 lines (9 loc) · 184 Bytes

Basic rl

based on cse 276f lectures and only using agrad, numpy, and gymnasium.

Todo

  • reinforce (agrad)
  • PPO
  • TRPO
  • Dyna-Q
  • TD-MPC
  • DDPG