Article

Reflective Reinforcement Tool Learning

Submitted to ACL 2026. Reflective reinforcement learning for tool learning.

avatar
Haotian Chen

Reflective Reinforcement Tool Learning

Submitted to ACL 2026. Reflective reinforcement learning for tool learning.

avatar
Haotian Chen

Diversity-aware Training for Test-time Scaling

Submitted to ACL 2026. Corresponding author. Diversity-aware training for test-time scaling.

bohan-lyu

Diversity-aware Training for Test-time Scaling

Submitted to ACL 2026. Corresponding author. Diversity-aware training for test-time scaling.

bohan-lyu

AtomMem: Learnable Dynamic Agentic Memory with Atomic Memory Operation

Submitted to ACL 2026. Corresponding author. Learnable dynamic agentic memory with atomic operations.

yupeng-huo

AtomMem: Learnable Dynamic Agentic Memory with Atomic Memory Operation

Submitted to ACL 2026. Corresponding author. Learnable dynamic agentic memory with atomic operations.

yupeng-huo

MiniCPM4: Ultra-Efficient LLMs on End Devices

ArXiv 2025. Team contribution (led MCP agent capabilities). 8,300+ GitHub Stars.

minicpm-team

MiniCPM4: Ultra-Efficient LLMs on End Devices

ArXiv 2025. Team contribution (led MCP agent capabilities). 8,300+ GitHub Stars.

minicpm-team

Collaborative Evolving Strategy for Automatic Data-Centric Development

ArXiv 2024. Co-first author. Collaborative evolving strategy for data-centric development. 11,400+ GitHub Stars.

xu-yang

Collaborative Evolving Strategy for Automatic Data-Centric Development

ArXiv 2024. Co-first author. Collaborative evolving strategy for data-centric development. 11,400+ GitHub Stars.

xu-yang