AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
Sep 1, 2025·,,,,,
,,,,·
0 min read
Zhong Zhang
Equal contribution
,Yaxi Lu
Equal contribution
,Yikun Fu
Yupeng Huo
Shenzhi Yang
Yesai Wu
Han Si
Xin Cong
Haotian Chen
Yankai Lin
et al.
Zhiyuan Liu
Maosong Sun

Abstract
We build a multimodal LLM-based GUI agent with visual interface perception, task understanding, and reasoning & planning for autonomous control of mobile and desktop platforms.
Type
Publication
In EMNLP 2025 Demo

Authors
Haotian Chen
(he/him)
Research Assistant Professor
I am a Research Assistant Professor at the School of Artificial Intelligence, Shanghai Jiao Tong University, where I work with Prof. Junchi Yan at RethinkLab. I study how to build AI systems that can automate long-horizon, effort-intensive, and creativity-demanding tasks such as research, engineering, and development. My current work focuses on autonomous agents, large language models, and AI4Research. Before joining SJTU, I received my PhD in Data Science from Fudan University, advised by Prof. Xiangdong Zhou, and completed postdoctoral research at Tsinghua University (THUNLP), working with Prof. Zhiyuan Liu and Prof. Maosong Sun. I was also a research intern at the Machine Learning Research Group of Microsoft Research Asia, mentored by Xiao Yang, and at the Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University, working with Prof. Yang Yu.