This is a tool to explore the implentation of the Double Deep Q Learning Algorithm. It is very useful for looking at reward hacking and perceptron optimization.
The environemnt is a very simple 2D physics engine written in java.
The machine learning is written in python and uses Tensorflow.