Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CFR agent #203

Open
DoxakisCh opened this issue Jan 29, 2021 · 4 comments
Open

CFR agent #203

DoxakisCh opened this issue Jan 29, 2021 · 4 comments
Assignees

Comments

@DoxakisCh
Copy link

Hi!

I have a question about the cfr agent that you provide in this repo. I read that it is an vanilla cfr agent, but from what I understand i think that this implementation is a chance sampling cfr. Can you please clarify this?

Thank you in advance!!!

@daochenzha
Copy link
Member

@DoxakisCh Thanks for the question! Yes, you are correct. The implementation is chance sampling CFR. The deck is randomly dealt for each new game for traversal. We will modify the document shortly to resolve any confusion.

@DoxakisCh
Copy link
Author

Thank you very much!

I also think that i found a bug in your texas holdem no limit environment and because of that we cant use CFR agents. It has to do with action 3 (raise half pot) and maybe 4 (raise full pot). When the cfr agent performs one of these actions the bet is raised only by 1 because self.dealer.pot in line 97 at file rlcard/rlcard/games/nolimitholdem/round.py remains always 3. Performing CFR iterations is still taking much time but not as before.

@ruzhwei
Copy link
Collaborator

ruzhwei commented Feb 9, 2021

Hi! @DoxakisCh Sorry I can't reproduce the dealer.pot bug. Would you please give more detailed information?

@DoxakisCh
Copy link
Author

Hello!

I cant exactly understand the bug myself but if you want you can add print(self.env.game.dealer.pot) at line 73 of you cfr agent and run an example of no-limit-texas-holdem game. This command will print the pot size after every action. You will notice that all numbers from 3-100 are present which cannot be happening with the available raise actions (raise half and full pot). I am sorry if a cant help you more but I am sure that there is a bug there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants