Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how did you define your action space? #33

Open
OsgoodWu opened this issue Nov 21, 2017 · 1 comment
Open

how did you define your action space? #33

OsgoodWu opened this issue Nov 21, 2017 · 1 comment

Comments

@OsgoodWu
Copy link

After I read your code carefully, I cannot figure out how you define your action space. For example , how many actions you define and how to represent each action? Waiting for your answers.

Sincerely

@danielkaifeng
Copy link

I am also waiting for answers on this.
In deep Q learning, the target Qt is calculate as target_q_t = (1. - terminal) * self.discount_r * max_q_t_plus_1 + reward, which means the action space is one dimension? If it is two dimension or above, numpy matric can't conduct multiply and add method as this in the code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants