RD2Bench: Toward Data-Centric Automatic R&D

5月 1, 2024·
Haotian Chen
Haotian Chen
Equal contribution
,
Xinjie Shen
Equal contribution
,
Zeqi Ye
,
Xiao Yang
,
Xu Yang
,
Weiqing Liu
,
Jiang Bian
· 0 分钟阅读时长
PDF
摘要
We propose RD2Bench, a Real-world Data-centric automatic R&D Benchmark. RD2Bench benchmarks all operations in data-centric automatic R&D as a whole, revealing that while challenging to GPT-4, LLMs possess promising potential.
类型
出版物
ICLR 2024 Workshop: How Far Are We From AGI
publications
Haotian Chen
Authors
Research Assistant Professor
I am a Research Assistant Professor at the School of Artificial Intelligence, Shanghai Jiao Tong University, where I work with Prof. Junchi Yan at RethinkLab. I study how to build AI systems that can automate long-horizon, effort-intensive, and creativity-demanding tasks such as research, engineering, and development. My current work focuses on autonomous agents, large language models, and AI4Research. Before joining SJTU, I received my PhD in Data Science from Fudan University, advised by Prof. Xiangdong Zhou, and completed postdoctoral research at Tsinghua University (THUNLP), working with Prof. Zhiyuan Liu and Prof. Maosong Sun. I was also a research intern at the Machine Learning Research Group of Microsoft Research Asia, mentored by Xiao Yang, and at the Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University, working with Prof. Yang Yu.