🔭 I’m a CS PhD student at Rutgers
🌱 I’m doing LLM research in agentic RL and test-time scaling for LLM agents.
-
LLM Researcher
- New Jersey
-
23:22
(UTC -04:00) - https://dongyuanjushi.github.io/
- @KaiMei_2000
- in/kai-mei-849969248
Pinned Loading
-
slime
slime PublicForked from THUDM/slime
slime is an LLM post-training framework for RL Scaling.
Python
-
-
cua
cua PublicForked from trycua/cua
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




