Open
Description
I'm currently working on a project (branching from this) where I created a new gym environment adding policies for autonomous driving in a highway. I defined my own reward weights for each of the actions the agent could take, so I would like to evaluate them, in order to get an optimal value for the reward function and, then, perform a DQN. My question is if your tool supports or not custom gym environments and, if so, how could I exploit it. I tried your notebook in colab but there's a problem with the python version, so I installed it on linux but I don't get how to implement it in my code.
Thanks in advance for your help.
Metadata
Metadata
Assignees
Labels
No labels