Skip to content
This repository has been archived by the owner on Oct 16, 2022. It is now read-only.

Adding a sample_action method for ActorCritic #4

Open
lemikhovalex opened this issue Feb 14, 2021 · 0 comments
Open

Adding a sample_action method for ActorCritic #4

lemikhovalex opened this issue Feb 14, 2021 · 0 comments

Comments

@lemikhovalex
Copy link

lemikhovalex commented Feb 14, 2021

Hello! I've been learning how to code RL form your repo. I've replace duplicating code lines from
def train
def update_policy

to agent's method self.sample_action(). And it seems that agent now solves Cart-Pole problem x2 slower(num of episodes). And it happes everytime. I have no idea what happens with torch and havn't found anything on Internet.
Can you pls help me?

https://github.com/lemikhovalex/pytorch-rl
5_tr - Proximal Policy Optimization (PPO) [CartPole]-Copy1.ipynb

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant