Β HuskyDoge

Benhao Huang

MAITRIX
TRAIS
SJTU-XAI
Pittsburgh, PA, USA

About Me

Hello! Husky here! I'm an incoming student in the CMU MSML program. Throughout my research journey, I've explored a variety of topics. Currently, my primary focus is on generative world modeling, where I have been fortunate to work closely with Professor Zhiting Hu and Professor Yilun Du on advancing the capabilities of world models. Previously, I collaborated with my amazing advisor Jiaqi W. Ma at the TRAIS Lab, focusing on dataset curation using LLM agents. I also gained valuable research experience in AI interpretability under Professor Quanshi Zhang's XAI Lab.

Within world modeling, I'm particularly(currently) interested in the following directions:

  • How can we enable world models to operate effectively in long-sequence scenarios? This includes both looking ahead (simulating long trajectories) and looking back (designing effective memory mechanisms).
  • How can we make world models real-time interactive? This is especially crucial for real world applications, and I find both the mathematical and ML systems perspectives fascinating.
  • How can we make world models more physically grounded? I'm excited to explore data-driven approaches, RLHF, and other emerging methods.

I am now actively seeking internship opportunities in related fields. If you share similar research interests, I would be delighted to connect. I am always open to collaboration and eager to expand my expertise through impactful projects.

News

  • 2025/07Mirage (Game Engine based on PAN) is now available online! πŸŽ‰
  • 2025/05DCA-Bench has been accepted to KDD 2025 DB-Track as an oral paper! See you in Toronto πŸŽ‰

Education

Aug 2025 - Feb 2027 (expected)
M.S. in Machine Learning
Carnegie Mellon University
Sept 2021 - July 2025
B.S.E. in Computer Science
Shanghai Jiao Tong University
Sept 2018 - June 2021
High School
Zhejiang Ruian High School

Interests

World Model, Reasoning and Planning
Data-centric AI, AI Automation
Efficient ML, Long Sequence Modeling

Selected Works

Refer to Research page for complete list.

PAN: Towards General World Model with Natural Language Actions and Video States
Benhao Huang, 
Pandora Team, 
Zhiting Hu, 
Eric P. Xing
Jul 1st 2025
#World Model
#Image to Video
#Diffusion

A step towards a General World Model (GWM) that can simulate complex video scenarios with natural language actions.

ζ–°ζ™Ίε…ƒ
paper-v1
Github
Mirage
DCA-Bench: A Benchmark for Dataset Curation Agents
Benhao Huang, 
Yingzhuo Yu, 
Jin Huang, 
Xingjian Zhang, 
Jiaqi W. Ma
May 1st 2025
KDD-2025 DB Track (Oral), ICML-2025 Data World
#LLM Agent
#Benchmark
#Data-centric AI

A benchmark exploring the performance of LLM Agents on detecting issues in datasets hosted on popular platforms.

paper
Github
πŸ€—HuggingFace
slides
poster

Awards & Scholarships

National Scholarship
2024
Rui Yuan Hong Shan Scholarship (Top 2%, SJTU)
2023
Shao Qiu Scholarship (Top 4%, SJTU)
2022
Meritorious Winner of MCM/ICM
2022
GitHub Commit Chart
Last Updated on Aug 24th 2025 Powered by greatest-gatsby-academic-template.