Ball-Balancer-with-Deep-Q-Network-Unity

In this project we train a Ball with Q-learning To stay on the platform.

Reinforcement learning:

In Reinforcement learning agents learn to perform actions in an environment in order to to maximize a reward. The key difference between reinforcement learning from supervised or unsupervised learning is presence of two things:
An environment
An agent

Q-Learning:

Q-learning is a reinforcement learning algorithm that seeks to find the best action to take given the current state. Q-Learning is based on a Q-function.

Which means that the maximum return from state "s" and action "a" is the sum of the immediate reward r and the maximum reward from the next state " s' " .

Deep Q-Learning:

Deep Q-learning makes use of neural networks and The Deep Q-Network algorithm was developed by DeepMind in 2015. It actually enhance Q-Learning which is a classic Reinforcement learning algorithm, with deep neural networks and a technique called experience replay.

Experience Replay:

At each time step of data collection, the transitions are added to a circular buffer called the replay buffer. Then during training, instead of using just the latest transition to compute the loss and its gradient, we compute them using a mini-batch of transitions sampled from the replay buffer. This is called Experience Replay which makes the network updates more stable and has the following benefits:
 A better data efficiency by by make use of each transition in many updates.
 A better stability using uncorrelated transitions in a batch.

Our Network:

For input we use Platform X Rotation, Ball Z Position, and Ball's X Velocity.
The outputs are Quality Values of how quality to the left and the right of the platform is.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Q-Learning Ball Balancer		Q-Learning Ball Balancer
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ball-Balancer-with-Deep-Q-Network-Unity

Reinforcement learning:

Q-Learning:

Deep Q-Learning:

Experience Replay:

Our Network:

Training Process:

About

Releases

Packages

Languages

rkoramtin/Ball-Balancer-with-Deep-Q-Network-Unity

Folders and files

Latest commit

History

Repository files navigation

Ball-Balancer-with-Deep-Q-Network-Unity

Reinforcement learning:

Q-Learning:

Deep Q-Learning:

Experience Replay:

Our Network:

Training Process:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages