-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature request] Implement SAC-Discrete #157
Comments
Cheers for the nice comments :). We are (still) working on getting v1.0 out, i.e. mainly bug testing and reviewing of the code. After the release we can discuss adding new algorithms or improvements to existing algorithms. On a quick glimpse this seems simple enough that it could be added with not much extra code. |
Hello, Thanks for the suggestion =) In principle I would be for that addition. We mostly need to discuss the advantage of it vs DQN and variants (QR-DQN, ...) in term of performance and runtime and see how much effort it requires and complexity it adds. @Miffyli maybe a good candidate for stable-baselines3 "contrib" (same as #83 ) |
Thank you for the response. According to the paper, SAC-Discrete is evaluated with 100k environment steps because they are most interested in sample efficiency, not final performance. Its results at 100k steps were not bad, but it failed to solve some simple tasks like Pong. Once v1.0 is released, I can contribute to implementing QR-DQN and IQN, in addition to SAC-Discrete. Thanks :) |
The contrib repo is here ;) https://github.com/Stable-Baselines-Team/stable-baselines3-contrib make sure to read the contributing guide carefully first ;). (we don't advertise it yet as we want to check the process and not get too many request for now) |
I was asked to post it here, @partiallytyped, regarding the following comment. PartiallyTyped posted an academic paper link for a SAC algorithm that takes a discrete input. I think PartiallyTyped is already aware since the main github link was mentioned on the paper page, there is a source code example for it. The author publicised his code. Hope this helps, |
I would now close this one as it rather belongs the contrib repo.
Academic, yes, but not peer-reviewed... |
@araffin How about the following paper? |
Hi, thank you for your great work!!
I'm interested in contributing to Stable-Baselines3.
I want to implement SAC-Discrete(paper, my implementation).
Can we discuss before implementing??
The text was updated successfully, but these errors were encountered: