Home

Jump to bottom

Jacob edited this page Feb 9, 2019 · 6 revisions

Welcome to the trashbots-RL Roadmap!

Reinforcement Learning Models to Try:

Encoding current bot as a coordinate instead of location in array.
Policy Gradient
Actor Critic Update

Environment Variants to Try:

Different distributions for trash i.e. coastal etc.
More than one item of trash per location
Impassable Terrain

Other Things to Try:

Return trash to deposit area

.

.

.

continuous state space???