AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
Sep 1, 2025·,,,,,
,,,,·
0 min read
Zhong Zhang
Equal contribution
,Yaxi Lu
Equal contribution
,Yikun Fu
Yupeng Huo
Shenzhi Yang
Yesai Wu
Han Si
Xin Cong
Haotian Chen
Yankai Lin
et al.
Zhiyuan Liu
Maosong Sun
Abstract
We build a multimodal LLM-based GUI agent with visual interface perception, task understanding, and reasoning & planning for autonomous control of mobile and desktop platforms.
Type
Publication
In EMNLP 2025 Demo

Authors
Haotian Chen
(he/him)
Assistant Researcher
Haotian Chen is an Assistant Researcher at the School of Artificial Intelligence, Shanghai Jiao Tong University, working with Prof. Junchi Yan at RethinkLab. His research goal is to understand and develop AI for automating tasks that require extensive time, effort, and creative thinking. He works on automating data-driven scientific research, contributing to both alleviating the burden on humans and revolutionizing human productivity. His research focuses on Autonomous Agents, Large Language Models, and AI4Research. He received his PhD in Data Science from Fudan University and completed postdoctoral research at Tsinghua University (THUNLP), where he worked with Prof. Zhiyuan Liu and Prof. Maosong Sun. He was also a research intern at Microsoft Research Asia, where the RD-Agent project he co-developed was featured in the Microsoft Build 2025 Global Keynote.