-
Notifications
You must be signed in to change notification settings - Fork 655
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CFR agent #203
Comments
@DoxakisCh Thanks for the question! Yes, you are correct. The implementation is chance sampling CFR. The deck is randomly dealt for each new game for traversal. We will modify the document shortly to resolve any confusion. |
Thank you very much! I also think that i found a bug in your texas holdem no limit environment and because of that we cant use CFR agents. It has to do with action 3 (raise half pot) and maybe 4 (raise full pot). When the cfr agent performs one of these actions the bet is raised only by 1 because self.dealer.pot in line 97 at file rlcard/rlcard/games/nolimitholdem/round.py remains always 3. Performing CFR iterations is still taking much time but not as before. |
Hi! @DoxakisCh Sorry I can't reproduce the dealer.pot bug. Would you please give more detailed information? |
Hello! I cant exactly understand the bug myself but if you want you can add print(self.env.game.dealer.pot) at line 73 of you cfr agent and run an example of no-limit-texas-holdem game. This command will print the pot size after every action. You will notice that all numbers from 3-100 are present which cannot be happening with the available raise actions (raise half and full pot). I am sorry if a cant help you more but I am sure that there is a bug there. |
Hi!
I have a question about the cfr agent that you provide in this repo. I read that it is an vanilla cfr agent, but from what I understand i think that this implementation is a chance sampling cfr. Can you please clarify this?
Thank you in advance!!!
The text was updated successfully, but these errors were encountered: