The design of rewarder #4

qingyue2014 · 2019-08-26T12:46:47Z

In your paper, the rewarder network is modeled a simple feed-forward neural network. When I try to understand it thought this code, I found that it was modeled a LSTM. The value of reward comes from the prediction of LSTM network each time. Why ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The design of rewarder #4

The design of rewarder #4

qingyue2014 commented Aug 26, 2019

The design of rewarder #4

The design of rewarder #4

Comments

qingyue2014 commented Aug 26, 2019