Reinforcement Learning

What is this?

This is the joint Master's thesis by Thor Bagge and Kent Grigo. You can read our thesis here.

We combine reinforcement-learning algorithms and search algorithms with a neural network to make a strong player for Connect Four.

Connect Four is like an extended Tic-Tac-Toe where you have a board with six rows and seven columns. The board is vertical, meaning that you can only insert a disc at the bottom and then let them stack. The end goal is to get four disks in a vertical, horizontal, or diagonal line.

This project contains the reinforcement-learning algorithms:

Temporal learning (TD-lambda)
SARSA

combined with the noise-injection methods:

Epsilon greedy
Softmax

and the following search algorithms:

Minimax
Monte-Carlo Tree Search (MCTS)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
build		build
dist		dist
libs		libs
nbproject		nbproject
src		src
temporary networks		temporary networks
.gitignore		.gitignore
Master's Thesis - Thor Bagge and Kent Grigo.pdf		Master's Thesis - Thor Bagge and Kent Grigo.pdf
README.md		README.md
build.xml		build.xml
manifest.mf		manifest.mf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning

What is this?

About

Uh oh!

Releases

Packages

Languages

KentGrigo/Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning

What is this?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages