Deep Q-Learning

This project is an implementation of a Q-learning algorithm for solving unity or gym environments.

Required packages:

(gym if solving gym environment)
numpy
python (version 3.6)
pytorch
unityagents if solving unity environment

Required files:

The unity exe file has to be inside this folder; here is an environment called "Bananas.exe" included.

Bananas environment:

The included Bananas environement is an environment, in which the agent moves inside a square. Inside the square, there are yellow bananas giving a reward of 1 and blue bananas giving a reward of -1. The state space consists of 37 dimensions and 4 actions can be taken (move forward, move backwards, turn left, turn right). The environment is considered solved, when an average score of 13.0 over 100 episodes is reached.

Starting the program:

hyperparameters and settings for the algorithm can be changed in the file "Hyperparameter.py". -> for viewing a trained agent set "LOAD" to True,"FILENAME_FOR_LOADING" to the name of the files of the model weights(without "_model_local.pth" or "_model_target.pth"), "EPS_START" to 0.01 and for unity "ENV_TRAIN" to False. -> for training a new agent set "LOAD" to False, "EPS_START" to 1.00 and for unity "ENV_TRAIN" to True and if you want to save the model weights, set "Save" to True and "FILENAME_FOR_SAVING" to the name for the weight files
use the file "Main.py". -> comment out the method you want to start (either gym or unity). -> run the file "Main.py".
other gym or unity environments can be used as well. Note that you cannot load a trained version for new environments until one is saved.
changes in all other files are not recommended.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
Banana_Data		Banana_Data
Neural_networks		Neural_networks
__pycache__		__pycache__
Agents.py		Agents.py
BANANA_Scores.png		BANANA_Scores.png
BANANA_Scores_duel.png		BANANA_Scores_duel.png
BANANA_Scores_pexpr.png		BANANA_Scores_pexpr.png
BANANA_pexpr_Scores_duel.png		BANANA_pexpr_Scores_duel.png
Banana.exe		Banana.exe
Environment.py		Environment.py
Hyperparameter.py		Hyperparameter.py
Main.py		Main.py
Neural_networks.py		Neural_networks.py
Profile.py		Profile.py
QLearning.py		QLearning.py
README.md		README.md
Replay_buffer.py		Replay_buffer.py
Report.md		Report.md
Sum_tree.py		Sum_tree.py
Test.py		Test.py
UnityPlayer.dll		UnityPlayer.dll
delete.txt		delete.txt
unity-environment.log		unity-environment.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Q-Learning

Required packages:

Required files:

Bananas environment:

Starting the program:

About

Releases

Packages

Languages

jpruente92/RL-Deep-Q-Learning

Folders and files

Latest commit

History

Repository files navigation

Deep Q-Learning

Required packages:

Required files:

Bananas environment:

Starting the program:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages