AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents
ArXiv 2026. First author. Open-source 4B agent model achieving SOTA on GAIA & HLE, surpassing GPT-5 and Claude-4.5-Sonnet.
ArXiv 2026. First author. Open-source 4B agent model achieving SOTA on GAIA & HLE, surpassing GPT-5 and Claude-4.5-Sonnet.
ArXiv 2026. First author. Open-source 4B agent model achieving SOTA on GAIA & HLE, surpassing GPT-5 and Claude-4.5-Sonnet.
Submitted to ACL 2026. First author. Unified evaluation platform for tool-learning agents.
Submitted to ACL 2026. First author. Unified evaluation platform for tool-learning agents.
Submitted to ACL 2026. Reflective reinforcement learning for tool learning.
Submitted to ACL 2026. Reflective reinforcement learning for tool learning.
ArXiv 2025. Team contribution (led MCP agent capabilities). 8,300+ GitHub Stars.
ArXiv 2025. Team contribution (led MCP agent capabilities). 8,300+ GitHub Stars.
ArXiv 2024. Co-first author. Collaborative evolving strategy for data-centric development. 11,400+ GitHub Stars.
ArXiv 2024. Co-first author. Collaborative evolving strategy for data-centric development. 11,400+ GitHub Stars.