Skip to content

Master's thesis: "Getting to Know the Captain’s Mistress with Reinforcement Learning" by Thor Bagge and Kent Grigo

Notifications You must be signed in to change notification settings

KentGrigo/Reinforcement-Learning

Repository files navigation

Reinforcement Learning

What is this?

This is the joint Master's thesis by Thor Bagge and Kent Grigo. You can read our thesis here.

We combine reinforcement-learning algorithms and search algorithms with a neural network to make a strong player for Connect Four.

Connect Four is like an extended Tic-Tac-Toe where you have a board with six rows and seven columns. The board is vertical, meaning that you can only insert a disc at the bottom and then let them stack. The end goal is to get four disks in a vertical, horizontal, or diagonal line.

This project contains the reinforcement-learning algorithms:

  • Temporal learning (TD-lambda)
  • SARSA

combined with the noise-injection methods:

  • Epsilon greedy
  • Softmax

and the following search algorithms:

  • Minimax
  • Monte-Carlo Tree Search (MCTS)

About

Master's thesis: "Getting to Know the Captain’s Mistress with Reinforcement Learning" by Thor Bagge and Kent Grigo

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages