AgentCPM-Explore
🏆 **项目负责人** · 我主导了一个面向长程深度探索的开源 4B 智能体模型 [](https://github.com/OpenBMB/AgentCPM)
•
1 分钟阅读时长
🏆 **项目负责人** · 我主导了一个面向长程深度探索的开源 4B 智能体模型 [](https://github.com/OpenBMB/AgentCPM)
ArXiv 2026. First author. Open-source 4B agent model achieving SOTA on GAIA & HLE, surpassing GPT-5 and Claude-4.5-Sonnet.
Submitted to ACL 2026. Reflective reinforcement learning for tool learning.
EMNLP 2025 Demo. GUI agents with reinforcement fine-tuning. 1,200+ GitHub Stars.