Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] How to verify who is the winner of a game? #3265

Open
gzrjzcx opened this issue Apr 6, 2024 · 3 comments
Open

[Question] How to verify who is the winner of a game? #3265

gzrjzcx opened this issue Apr 6, 2024 · 3 comments

Comments

@gzrjzcx
Copy link

gzrjzcx commented Apr 6, 2024

Question

Like the Tennis env e.g., the return infos only inlcude the lives, is this the valid live for a RPG game? Im confused about the meanning of the returned value.

Also, Im interested in how to check who is the winner for the sport game? There is only DONE flag to check the end of the game.

Thanks.

@pseudo-rnd-thoughts
Copy link
Contributor

An agent only wants to maximise its rewards, for symmetric competitive games then it is normally just the sum of rewards, if positive, then your agent wins, otherwise, the opponent wins

@gzrjzcx
Copy link
Author

gzrjzcx commented Apr 7, 2024

Thanks so much.

BTW, Im testing the PPO algo with Tennis env, I found the rewards increased to -1 then stop to rise. Is it means that my agent lose the game always? It looks like the policy is converted to a local optimal strategy. However, the logs show that the env always stop at 99999 step. Im curious that is there any maximum step limitation for the env?

Also, Is there any ways to evaluate the trained model? Or rendering the trained frame to figure out the real performance of the agent?

image
image

@pseudo-rnd-thoughts
Copy link
Contributor

From the looks of it, then the optimal solution is a positive value
https://arxiv.org/pdf/1710.02298.pdf see page 11
I would look at some rendering of the agent playing the environment to understand what is happening

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants