DQN with 20000 steps Reference Playing Atari with Deep Reinforcement Learning Parameters Default parameters as per Stable Baselines Performance logs Renders Random Modelled