You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After I read your code carefully, I cannot figure out how you define your action space. For example , how many actions you define and how to represent each action? Waiting for your answers.
Sincerely
The text was updated successfully, but these errors were encountered:
I am also waiting for answers on this.
In deep Q learning, the target Qt is calculate as target_q_t = (1. - terminal) * self.discount_r * max_q_t_plus_1 + reward, which means the action space is one dimension? If it is two dimension or above, numpy matric can't conduct multiply and add method as this in the code.
After I read your code carefully, I cannot figure out how you define your action space. For example , how many actions you define and how to represent each action? Waiting for your answers.
Sincerely
The text was updated successfully, but these errors were encountered: