Article

Learning from Human Demonstrations Enables Adaptive GUI Agents

Pre-submission to ARR. Corresponding author. Adaptive GUI agents learning from human demonstrations.

yuepeng-fu

Learning from Human Demonstrations Enables Adaptive GUI Agents

Pre-submission to ARR. Corresponding author. Adaptive GUI agents learning from human demonstrations.

yuepeng-fu

Test-Time Exploration in Unknown Environments

Submitted to KDD 2026. Corresponding author. Test-time exploration in unknown environments.

wentong-chen

Test-Time Exploration in Unknown Environments

Submitted to KDD 2026. Corresponding author. Test-time exploration in unknown environments.

wentong-chen

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Submitted to KDD 2026. Corresponding author. Diagnosing step-level process quality in tool-using agents.

shengda-fan

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Submitted to KDD 2026. Corresponding author. Diagnosing step-level process quality in tool-using agents.

shengda-fan

AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents

ArXiv 2026. First author. Open-source 4B agent model achieving SOTA on GAIA & HLE, surpassing GPT-5 and Claude-4.5-Sonnet.

avatar
Haotian Chen

AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents

ArXiv 2026. First author. Open-source 4B agent model achieving SOTA on GAIA & HLE, surpassing GPT-5 and Claude-4.5-Sonnet.

avatar
Haotian Chen

ToLeaP: Rethinking Development of Tool Learning with Large Language Models

Submitted to ACL 2026. First author. Unified evaluation platform for tool-learning agents.

avatar
Haotian Chen

ToLeaP: Rethinking Development of Tool Learning with Large Language Models

Submitted to ACL 2026. First author. Unified evaluation platform for tool-learning agents.

avatar
Haotian Chen