This is based off of the implementation of the paper 'Toward Diverse Text Generation with Inverse Reinforcement Learning' https://arxiv.org/abs/1804.11258 IJCAI2018
conda create -n rl-project python=3.6
conda env update -n rl-project -f environment.yml
conda activate rl-project