AgentCPM-GUI

8月 1, 2025 · 1 分钟阅读时长
projects

📱 角色:GUI 智能体训练数据建设负责人

基于多模态大语言模型的 GUI 智能体,具备视觉界面感知、任务理解与推理规划能力,可自主操控手机和桌面平台。

项目影响:

  • 📦 首月下载量 3,000+,登上 Hugging Face 热榜
  • 🐦 Twitter 曝光量 150,000+,YouTube 播放量 1,300+
  • 📖 微信文章阅读量 7,000+
Haotian Chen
Authors
Research Assistant Professor
I am a Research Assistant Professor at the School of Artificial Intelligence, Shanghai Jiao Tong University, where I work with Prof. Junchi Yan at RethinkLab. I study how to build AI systems that can automate long-horizon, effort-intensive, and creativity-demanding tasks such as research, engineering, and development. My current work focuses on autonomous agents, large language models, and AI4Research. Before joining SJTU, I received my PhD in Data Science from Fudan University, advised by Prof. Xiangdong Zhou, and completed postdoctoral research at Tsinghua University (THUNLP), working with Prof. Zhiyuan Liu and Prof. Maosong Sun. I was also a research intern at the Machine Learning Research Group of Microsoft Research Asia, mentored by Xiao Yang, and at the Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University, working with Prof. Yang Yu.