🤖

Haotian Chen

(he/him)

Research Assistant Professor

Biography

I am a Research Assistant Professor at the School of Artificial Intelligence, Shanghai Jiao Tong University, where I work with Prof. Junchi Yan at RethinkLab. I study how to build AI systems that can automate long-horizon, effort-intensive, and creativity-demanding tasks such as research, engineering, and development. My current work focuses on autonomous agents, large language models, and AI4Research. Before joining SJTU, I received my PhD in Data Science from Fudan University, advised by Prof. Xiangdong Zhou, and completed postdoctoral research at Tsinghua University (THUNLP), working with Prof. Zhiyuan Liu and Prof. Maosong Sun. I was also a research intern at the Machine Learning Research Group of Microsoft Research Asia, mentored by Xiao Yang, and at the Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University, working with Prof. Yang Yu.

Education

PhD in Data Science

2018-09-01
2024-06-30

Fudan University

BEng in EE

2014-09-01
2018-06-30

Dalian University of Technology

Interests

Autonomous Agents Large Language Models AI4Research Agent Reinforcement Learning Tool Learning

Experience

Research Assistant Professor

Shanghai Jiao Tong University

May 2026 – Present

School of Artificial Intelligence, RethinkLab. I work with Prof. Junchi Yan on autonomous agents, LLMs, and AI4Research.

Postdoctoral Researcher

Tsinghua University

July 2024 – April 2026

Department of Computer Science and Technology, THUNLP. I worked with Prof. Zhiyuan Liu and Prof. Maosong Sun on autonomous agents, LLMs, and tool-use reinforcement learning for agents. Honor: Tsinghua “Shuimu Scholar” Program.

Research Intern

Microsoft Research Asia

November 2023 – June 2024

Machine Learning Research Group. I worked with Xiao Yang and Jiang Bian on autonomous agents and LLMs, and served as a core contributor to RD-Agent. Honor: Microsoft “Star of Tomorrow”.

Research Intern

Tsinghua University (IIIS)

June 2021 – April 2023

Institute for Interdisciplinary Information Sciences. I worked with Prof. Yang Yu on LLM safety auditing and AI decision-bias analysis.

Education

PhD in Data Science

Fudan University

September 2018 – June 2024

School of Computer Science and Technology. Advisor: Prof. Xiangdong Zhou. I worked on NLP, data mining, and information extraction, and received honors including Outstanding Student, Outstanding Graduate, and Academic Excellence Scholarship.

BEng in EE

Dalian University of Technology

September 2014 – June 2018

School of Electrical Engineering. Advisors: Prof. Qianjin Yue and Prof. Mingfeng He. I ranked 1/74 in the program and graduated as a Provincial Outstanding Graduate.

🔬 Research

I study how to build AI systems that can automate long-horizon work in research, engineering, and development. A central goal of my work is to make data-driven scientific research more scalable, reliable, and productive. I currently focus on three closely connected directions:

🧪 AI4Research — I build AI research assistants for literature retrieval, experiment design, data analysis, and hypothesis generation and verification.

🤖 Autonomous Agents — I develop agent models and systems with environment perception, memory, planning, and tool-use capabilities, with LLMs as the core intelligence.

🧠 Foundation Models — I explore how to train and adapt general-purpose models for language understanding, knowledge representation, multimodal fusion, and downstream generalization.

📄 Featured Publications

Autonomous Agents

AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents

ArXiv 2026. First author. Open-source 4B agent model achieving SOTA on GAIA & HLE, surpassing GPT-5 and Claude-4.5-Sonnet.

Haotian Chen

• Feb 1, 2026 • 1 min read

GUI Agents

AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning

EMNLP 2025 Demo. GUI agents with reinforcement fine-tuning. 1,200+ GitHub Stars.

zhong-zhang

• Sep 1, 2025 • 1 min read

Foundation Models

MiniCPM4: Ultra-Efficient LLMs on End Devices

ArXiv 2025. Team contribution (led MCP agent capabilities). 8,300+ GitHub Stars.

minicpm-team

• Jun 1, 2025 • 1 min read

AI4Research

Towards Data-Centric Automatic Research and Development

ArXiv 2024. First author. RD-Agent for automatic R&D. Featured in Microsoft Build 2025 Keynote. 11,400+ GitHub Stars.

Haotian Chen

• Jun 1, 2024 • 1 min read

📚 All Publications

Haotian Chen

Equal contribution

, Boye Niu

Equal contribution

, Ke Zhang

Equal contribution

, Yaxi Lu, Xin Cong, Zhong Zhang, Yankai Lin, Zhiyuan Liu, Maosong Sun (2026). Reflective Reinforcement Tool Learning. ACL 2026 (submitted).

Haotian Chen

Equal contribution

, Zijun Song

Equal contribution

, Boye Niu

Equal contribution

, Ke Zhang

Equal contribution

, Litu Ou

Equal contribution

, Yaxi Lu, Zhong Zhang, Xin Cong, Yankai Lin, Zhiyuan Liu, Maosong Sun (2026). ToLeaP: Rethinking Development of Tool Learning with Large Language Models. ACL 2026 (submitted).

PDF Code

Zhong Zhang

Equal contribution

, Yaxi Lu

Equal contribution

, Yikun Fu, Yupeng Huo, Shenzhi Yang, Yesai Wu, Han Si, Xin Cong, Haotian Chen, Yankai Lin, et al., Zhiyuan Liu, Maosong Sun (2025). AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning. In EMNLP 2025 Demo.

PDF Code

MiniCPM Team, et al. (2025). MiniCPM4: Ultra-Efficient LLMs on End Devices. ArXiv 2025.

PDF Code

Xu Yang

Equal contribution

, Haotian Chen

Equal contribution

, Wenjun Feng

Equal contribution

, Haoxue Wang, Zeqi Ye, Xinjie Shen, Xiao Yang, Shizhao Sun, Weiqing Liu, Jiang Bian (2024). Collaborative Evolving Strategy for Automatic Data-Centric Development. ArXiv 2024.

PDF Code

Haotian Chen

Equal contribution

, Xinjie Shen

Equal contribution

, Zeqi Ye, Xiao Yang, Xu Yang, Weiqing Liu, Jiang Bian (2024). Towards Data-Centric Automatic Research and Development. ArXiv 2024.

PDF Code

Haotian Chen, Houjing Guo, Bingsheng Chen, Xiangdong Zhou (2024). OODREB: Benchmarking State-of-the-Art Methods for Out-Of-Distribution Generalization on Relation Extraction. In WWW 2024.

PDF Code DOI

Haotian Chen

Equal contribution

, Xinjie Shen

Equal contribution

, Zeqi Ye, Xiao Yang, Xu Yang, Weiqing Liu, Jiang Bian (2024). RD2Bench: Toward Data-Centric Automatic R&D. ICLR 2024 Workshop.

PDF

Haotian Chen, Lingwei Zhang, Yiran Liu, Yang Yu (2024). Rethinking the Development of Large Language Models from the Causal Perspective: A Legal Text Prediction Case Study. In AAAI 2024.

PDF Code DOI

Haotian Chen, Han Zhang, Houjing Guo, Shuchang Yi, Bingsheng Chen, Xiangdong Zhou (2023). SALAS: Supervised Aspect Learning Improves Abstractive Multi-Document Summarization through Aspect Information Loss. In ECML-PKDD 2023.

PDF Code DOI

Haotian Chen, Bingsheng Chen, Xiangdong Zhou (2023). Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation Extraction. In ACL 2023.

PDF Code Video DOI

Haotian Chen, Han Zhang, Houjing Guo, Shuchang Yi, Bingsheng Chen, Xiangdong Zhou (2023). Recovering Missing Key Information: An Aspect-guided Generator for Abstractive Multi-document Summarization. In DASFAA 2023.

PDF Code

💻 Open-Source Projects

All my open-source contributions revolve around the central theme of LLM-powered agents, forming a cohesive ecosystem: Data (AgentCPM-GUI) → Algorithm (AgentRL, AgentCPM-Explore, MiniCPM4-MCP) → Execution (RD-Agent) → Evaluation (ToLeaP).

AI4Research

RD-Agent

🔬 **Core Founding Member & Primary Coder** · I helped build an automated R&D agent for data science and finance …

Mar 30, 2026 • 1 min read

Autonomous Agents

AgentCPM-Explore

🏆 **Project Lead** · I led an open-source 4B agent model for long-horizon deep exploration …

Mar 20, 2026 • 1 min read

Reinforcement Learning

AgentRL

🏆 **Project Lead** · Fully asynchronous agent RL training infrastructure for the AgentCPM model family `100+ Tools` · `20+ Benchmarks` · `Full-cycle Visualization`

Jan 1, 2026 • 1 min read

Benchmark

ToLeaP

📏 **Project Lead** · One-click evaluation platform for tool-learning agents …

Jan 1, 2026 • 1 min read

Tool Learning

MiniCPM4-MCP

🏆 **Project Lead** · Edge-scale (8B) agent LLM mastering MCP tools [![Stars](https://img.shields.io/github/stars/OpenBMB/MiniCPM?style=social)](https://github.com/OpenBMB/MiniCPM) …

Oct 1, 2025 • 1 min read

GUI Agents

AgentCPM-GUI

📱 **Training Data Lead** · Multimodal LLM-based GUI agent for mobile & desktop …

Aug 1, 2025 • 1 min read

🎓 Academic Service

🧑‍🏫 Student Supervision — I co-supervise undergraduate, master’s, and PhD students on LLM agents and data mining.

📝 Conference Reviewing — I regularly review for venues including NeurIPS, ICLR, EMNLP, KDD, WWW, and COLING.

📰 Journal Reviewing — I have also reviewed for journals such as TKDE, Science China, and AI Open.

🎤 Invited Talks — Recent invited talks include Autonomous Agents and Tool Learning with LLMs (RLChina 2025) and LLM-Driven Autonomous Agents (Huawei OpenHarmony AI Agent TSG).

💰 Research Grants — I currently serve as PI for the China Postdoctoral Science Foundation General Grant and the National Postdoctoral Researcher Program Category C.

📬 Contact

Feel free to reach out if you would like to discuss research ideas, collaborations, or open-source projects.

✉️ Email: haotian.chen@163.com / ht1ian.chen@gmail.com

🐙 GitHub: github.com/Hytn

✖️ X: @HytnChen

🎓 Google Scholar: scholar profile

No results found

Haotian Chen

Biography

Education

Interests

Experience

Research Assistant Professor

Postdoctoral Researcher

Research Intern

Research Intern

Education

PhD in Data Science

BEng in EE