Official Implementations of "A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees" [Experiments in Section 6]
This repository is the official implementation of the UOpt-RPGPD algorithm in A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees.
- You can see the implementation in UOpt-RPGPD-ablation.ipynb.
# make sure you are in Uniform-PAC-Primal-Dual-CMDP/
poetry install
Run all the cells in UOpt-RPGPD-ablation.ipynb.