-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assertion assert_equal_shape failed for MultiDiscrete action space #21
Comments
Hello, |
@UweGensheimer Thank you for your suggestion, flatten the output is kinda complicated so instead I tried to specify the size of my |
Thanks for reporting this! I can confirm that this is a proper bug. It has to do with the way variates are pre/post-processed. I'll have a closer look at it as soon as I have time, which is either tonight or tomorrow. |
First of all, thank you for developing this package and I really like the modular design. I am a bit new to RL and the JAX ecosystem so my question my be a bit naive. I am currently doing a baseline study with my customized
gym
environment andVanillaPG
but I encounter this bug shown below and I could not figure it out. My understanding is that it is complaining that the shape oflog_pi
should not be(4,)
. But I do have aMultiDiscrete
action space and its correspondinglog_pi
should be something like(4,)
or(1, 4)
. I also attached the output when I callcoax.Policy.example_data(env)
and my policy function definition below if that helps explain the situation.So my questions are:
I would appreciate any feedback. Thank you!
Error message
Example data
Policy function
The text was updated successfully, but these errors were encountered: