-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
observation_space does not match reset() observation, though I confirmed they are identical. #921
Comments
you should check the type (and dtype)... |
@araffin Thanks! It worked! It was resulted from this line of code from souse code of gym: np.can_cast(x.dtype, self.dtype): dtype('float32'),dtype('float64') ->False Could you please give some hint on these questions 🙏:
Thanks a lot araffin! |
If you want to use recurrent policy, you don't need to change anything (Stable-Baselines-Team/stable-baselines3-contrib#53), you also should not need to do framestacking (but feedforward PPO + framestacking is usually competitive, see link to SB3 contrib with recurrent PPO benchmark).
this is correct, you should flatten to 1D (mainly to avoid broadcasting issues and misinterpretation of the input).
not sure what you mean, but sounds more like tech support/consulting which we don't do (see links in the issue template for alternatives) |
Thanks for the advice! 👍 |
Console report "The observation returned by the
reset()
method does not match the given observation space", though I printed the shape of observation from reset() and observation_space which are identical.minimal code as below:
### Checklist
Above that, I have two more questions:
The text was updated successfully, but these errors were encountered: