add a circle detection as penalty to agent and add mps support to the MLP #36
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
在训练过程中对蛇增加了一个惩罚。如果蛇的身子包住了空格子,将会根据包围住的格子大小受到惩罚。该策略目的在于使蛇在一定程度上折叠自己的身子。(图片显示在某一局蛇的长度达到了103)
A penalty was added to the snake during training. If the snake's body wrapped around empty cells, it would be penalized according to the size of the cells. The purpose of the strategy is to make the snake fold its body to some extent. (Picture shows a snake reaching 103 in one game)