- This repository contains Python implementation of basic RL algorithms, including
- Value Iteration
- Q-Learning
- Environments are provided by Gymnasium
- Gymnasium is a maintained fork of OpenAI’s Gym library
- Recent official message
- All development of Gym has been moved to Gymnasium, a new package in the Farama Foundation that's maintained by the same team of developers who have maintained Gym for the past 18 months.
- If you're already using the latest release of Gym (v0.26.2), then you can switch to v0.27.0 of Gymnasium by simply replacing
import gym
withimport gymnasium as gym
with no additional steps. - Gym will not be receiving any future updates or bug fixes, and no further changes will be made to the core API in Gymnasium.
- A
$4 \times 12$ grid world - Positions represented as flattened index
- For example, the starting point (3, 0) is represented as
$3 \times 12 + 0 = 36$
- For example, the starting point (3, 0) is represented as
- 0: Move up
- 1: Move right
- 2: Move down
- 3: Move left
- run
pip install requirements.txt
- run
python main.py
- the
main.ipynb
Jupyter Notebook file contains a brief walkthrough of the current implementation
- Currently, I'm actively working on modifying this project to make it more readable and modularized
- The main purpose of creating this repository is to have fun playing around with RL
- More specifically, I wanted to
- run experiment new RL algorithms on simple game-like environments
- refresh my memory of the basic RL algorithms
- hopefully I can figure out some interesting variations of the existing RL algorithms and contribute to this field
- share my implementation with the community to get feedback
- maybe help people who are also interested in RL and just started to explore this intriguing field
- Researchers from DeepMind published an article about the relationship between neuroscience and AI in early 2020: Dopamine and temporal difference learning: A fruitful relationship between neuroscience and AI
- I learn RL mainly from
- the book Reinforcement Learning, Second Edition by Rich Sutton and Andrew Barto
- courses in the Reinforcement Learning Specialization offered by University of Alberta on Coursera
- research papers on Google Scholar, ACM DL, Google AI Blog, and Workshops