😯
SIU
Reinforcement Learning PhD | Building autonomous decision making systems | Prev @microsoft @deepseek-ai
-
National University of Singapore
- Singapore
-
06:25
(UTC +08:00) - benjamin-eecs.github.io
- @Benjamin_eecs
- in/bo-liu-eecs
- https://huggingface.co/Benjamin-eecs
- https://benjamin-eecs.medium.com/
Highlights
Pinned Loading
-
deepseek-ai/DeepSeek-V2
deepseek-ai/DeepSeek-V2 PublicDeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
-
deepseek-ai/DeepSeek-VL
deepseek-ai/DeepSeek-VL PublicDeepSeek-VL: Towards Real-World Vision-Language Understanding
-
metaopt/torchopt
metaopt/torchopt PublicTorchOpt is an efficient library for differentiable optimization built upon PyTorch.
-
sail-sg/envpool
sail-sg/envpool PublicC++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
-
-
waterhorse1/Natural-language-RL
waterhorse1/Natural-language-RL PublicNatural Language Reinforcement Learning
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.