Skip to content
View WooooDyy's full-sized avatar
🤡
🤡

Block or report WooooDyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. AgentGym-RL AgentGym-RL Public

    Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

    Python 394 39

  2. AgentGym AgentGym Public

    Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

    Python 602 83

  3. LLM-Agent-Paper-List LLM-Agent-Paper-List Public

    The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

    7.9k 479

  4. LLM-Reverse-Curriculum-RL LLM-Reverse-Curriculum-RL Public

    Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

    Python 110 9

  5. MathCritique MathCritique Public

    Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

    Python 56 1

  6. BMMR BMMR Public

    Python 14