Home

Jump to bottom

BjoernMichele edited this page Feb 23, 2019 · 6 revisions

Welcome to the trashbots-RL Roadmap!

Reinforcement Learning Models to Try:

Encoding current bot as a coordinate instead of location in array.
Policy Gradient
Actor Critic Update

Environment Variants to Try:

Different distributions for trash i.e. coastal etc.
More than one item of trash per location
Impassable Terrain

Other Things to Try:

Return trash to deposit area

. . . .

continuous state space???

Current To-Do:

Debugging (Evalfunctions: Action Counter / Memory, Trashsource closeness)
Visualization
Report