Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stability issues #4

Open
rubenftech opened this issue Apr 16, 2024 · 1 comment
Open

Stability issues #4

rubenftech opened this issue Apr 16, 2024 · 1 comment

Comments

@rubenftech
Copy link

Hi,

Thank you for the amazing work!

While experimenting with your code, despite running the training multiple times, we're observing stability issues. Here is an example of one of the rew_total graphs:
image

Is this behavior expected or indicative of an underlying problem? Is the maximum total reward achieved here (around 350) the same as you got? Additionally, if you could share the graphs from one of your runs it might help us to track down the issue and understand the expected behavior.

Thanks!

@YandongJi
Copy link
Collaborator

YandongJi commented Apr 16, 2024

Thanks for bringing up the issue! Actually we never tried to train it for 500k, usually 50k at most. As for the curve around 50k, it looks very similar to my curve. The reward scales should be tuned better to make graph look more stable. I can try to tune it in recent days. But can you also evaluate the policy? The policy should usually be performing ok. FYI this work uses the same reward scale and looks like they can have similar results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants