Reinforcement Learning Grid Walker

Reinforcement learning toy model of a walker on a two-dimensional grid. The task is to avoid the mines (negative rewards, penalties) while reaching the treats (positive rewards and fixed points).

The learning algorithm used is the canonical Q-learning.

The user can play with different hyperparameters such as the learning rate, reward forecast discount, exploration-exploitation trade-off on grids of varying size and different reward/penalty densities.

The panel of settings:

The grid and the learned policy can be visualized.

This project was generated with Angular 7, and visualization uses roughjs.

Play with me here, or locally:

Build dependencies with:

npm install

then run with

ng serve

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
e2e		e2e
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
angular.json		angular.json
package-lock.json		package-lock.json
package.json		package.json
rl-grid-vis.png		rl-grid-vis.png
rl-settings.png		rl-settings.png
tsconfig.json		tsconfig.json
tslint.json		tslint.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Grid Walker

About

Releases

Packages

Languages

License

vnherdeiro/rl-grid-walker

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Grid Walker

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages