Question about visualize the change of reward when training #7

huangjiancong1 · 2019-06-23T07:37:14Z

Why we use test() to see the reward and testing_sample_step?

Can I use the train() to see how the reward change when training?
It seems that the last perf is the reward.

Because we want to compare with the openai,

The text was updated successfully, but these errors were encountered:

matthieu637 · 2019-06-24T02:42:07Z

The test() function disable exploration for testing the performance of deterministic policies.
0.0.monitor.csv contains training performance (with exploration)
0.1.monitor.csv contains testing performance (without exploration)

huangjiancong1 changed the title ~~Question about to see the change of reward when trainning~~ Question about visualize the change of reward when training Jun 23, 2019

matthieu637 added the wontfix label Sep 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about visualize the change of reward when training #7

Question about visualize the change of reward when training #7

huangjiancong1 commented Jun 23, 2019 •

edited

Loading

matthieu637 commented Jun 24, 2019

Question about visualize the change of reward when training #7

Question about visualize the change of reward when training #7

Comments

huangjiancong1 commented Jun 23, 2019 • edited Loading

matthieu637 commented Jun 24, 2019

huangjiancong1 commented Jun 23, 2019 •

edited

Loading