🤡
Now PhD student at Fudan NLP Group of Fudan University. Previously got Bechler's degree at Nanjing University.
-
Fudan University
-
01:57
(UTC +08:00) - https://woooodyy.github.io/
- https://scholar.google.com.hk/citations?user=zSVLkqAAAAAJ&hl=zh-CN
- @Be1ong1
Pinned Loading
-
AgentGym-RL
AgentGym-RL PublicCode and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.
-
LLM-Agent-Paper-List
LLM-Agent-Paper-List PublicThe paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
-
LLM-Reverse-Curriculum-RL
LLM-Reverse-Curriculum-RL PublicImplementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.
-
MathCritique
MathCritique PublicImplementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.