Reflective Reinforcement Tool Learning
Submitted to ACL 2026. Reflective reinforcement learning for tool learning.
Submitted to ACL 2026. Reflective reinforcement learning for tool learning.
Submitted to ACL 2026. Reflective reinforcement learning for tool learning.
Submitted to ACL 2026. Corresponding author. Diversity-aware training for test-time scaling.
Submitted to ACL 2026. Corresponding author. Diversity-aware training for test-time scaling.
Submitted to ACL 2026. Corresponding author. Learnable dynamic agentic memory with atomic operations.
Submitted to ACL 2026. Corresponding author. Learnable dynamic agentic memory with atomic operations.
ArXiv 2025. Team contribution (led MCP agent capabilities). 8,300+ GitHub Stars.
ArXiv 2025. Team contribution (led MCP agent capabilities). 8,300+ GitHub Stars.
ArXiv 2024. Co-first author. Collaborative evolving strategy for data-centric development. 11,400+ GitHub Stars.
ArXiv 2024. Co-first author. Collaborative evolving strategy for data-centric development. 11,400+ GitHub Stars.