Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add a circle detection as penalty to agent and add mps support to the MLP #36

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

Sean652039
Copy link

在训练过程中对蛇增加了一个惩罚。如果蛇的身子包住了空格子,将会根据包围住的格子大小受到惩罚。该策略目的在于使蛇在一定程度上折叠自己的身子。(图片显示在某一局蛇的长度达到了103)

A penalty was added to the snake during training. If the snake's body wrapped around empty cells, it would be penalized according to the size of the cells. The purpose of the strategy is to make the snake fold its body to some extent. (Picture shows a snake reaching 103 in one game)

IMG_2434

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant