Haotian Chen

Haotian Chen

Researcher in Artificial Intelligence

Fudan University

Biography

My research goal is to understand and develop AI for automating the tasks (e.g., engineering, research, development, etc.) that require extensive time, effort, and sometimes even the creative thinking of humans. I work on automating the data-driven scientific research for contributing to both alleviating the burden on humans and revolutionizing human productivity. My research experience focuses on automating the Real-World Data-Driven Research and Development (R&D) cycle in the financial domain, analyzing and correcting the decision rules of deep models (e.g., language models) in their real-world applications such as legal judgment prediction (LJP) and knowledge extraction (e.g., relation extraction), and improving the performance, robustness, and out-of-distribution generalization ability of deep models in these applications.

Interests
  • Large Language Model Agent
  • Computational Linguistics
  • Information Extraction
  • Trustworthy or Explainable Artificial Intelligence
Education
  • PhD in Data Science, 2024

    Fudan University

  • BEng in Ocean Technology, 2018

    Dalian University of Technology

Experience

 
 
 
 
 
Microsoft Research Asia - Machine Learning Group
Internship
November 2023 – Present Beijing

Responsibilities include:

  • Contribute to developing an open-source code framework aiming to apply LLM (e.g., GPT-4) agent to perform automatic research and development (R&D) in real-world scenarios.
  • Identify research questions through the implementation and then formalize questions and expose the bottleneck of current LLMs.
  • Demonstrate the promising future of LLM-driven automatic R&D and propose the effective technical methods (knowledge-augmented evolving strategies).
 
 
 
 
 
Tsinghua University - Institute for Interdisciplinary Information Sciences
Research Assistant
Tsinghua University - Institute for Interdisciplinary Information Sciences
June 2021 – April 2023 Beijing

Responsibilities include:

  • Investigate the decision rules of transformer-based AI judgers by feature attribution and find that they make legal judgment predictions according to irrelevant information in the given case descriptions.
  • Causally analyze the AI judgers’ nature and argue that the lack of human knowledge, the imbalance of training data, and the incomprehensiveness of testing methods impede them from learning causal relationships.
  • Propose two methods to infuse knowledge into data and model’s architecture, respectively.
  • Further propose 8 kinds of legal-specific attacks to complete the testing methods.
  • Experimentally demonstrate that knowledge improves models’ robustness and performance.

Projects

*
AI for Finance
A project of using LLM to facilitate or automate quantitative finance. The project’s research value is to automate the whole research and development (R&D) cycle. I’m responsible for empowering LLM to automatically implement various factors in engineering, and I also design the overall research plan and conduct the ground-breaking research. Details are shown in the hyperlink of the project’s name.
AI for Finance
Information Extraction and Management on Textual Data
I’m responsible for leading both the engineering and research components of the project, focusing on information extraction and management on textual data (e.g., technical reports, scientific documents, notes of technicians, etc.) in the terabyte scale.
Information Extraction and Management on Textual Data

Contact

Please feel free to contact me if you have any questions.