Research

2025

PAN: Towards General World Model with Natural Language Actions and Video States
Benhao Huang, 
Pandora Team, 
Zhiting Hu, 
Eric P. Xing
Jul 1st 2025
#World Model
#Image to Video
#Diffusion

A step towards a General World Model (GWM) that can simulate complex video scenarios with natural language actions.

新智元
paper-v1
Github
online-demo of Mirage
Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation
Qiyue Gao, 
Xinyu Pi, 
Kevin Liu, 
Junrong Chen, 
Ruolan Yang, 
Xinqi Huang, 
Xinyu Fang, 
Lu Sun, 
Gautham Kishore, 
Bo Ai, 
Stone Tao, 
Mengyang Liu, 
Jiaxi Yang, 
Chao-Jung Lai, 
Chuanyang Jin, 
Jiannan Xiang, 
Benhao Huang, 
Zeming Chen, 
David Danks, 
Hao Su, 
Tianming Shu, 
Ziqiao Ma, 
Lianhui Qin, 
Zhiting Hu
Jun 1st 2025
ICLR 2025 Workshop World Models / ACL 2025 Findings
#World Model
#Benchmark

This paper evaluates whether modern Vision-Language Models (VLMs) like GPT-4o and Gemini can act as internal world models (WMs)—systems that understand and predict the world.

paper
🤗HuggingFace
website
DCA-Bench: A Benchmark for Dataset Curation Agents
Benhao Huang, 
Yingzhuo Yu, 
Jin Huang, 
Xingjian Zhang, 
Jiaqi W. Ma
May 1st 2025
KDD-2025 DB Track (Oral), ICML-2025 Data World
#LLM Agent
#Benchmark
#Data-centric AI

A benchmark exploring the performance of LLM Agents on detecting issues in datasets hosted on popular platforms.

paper
Github
🤗HuggingFace
slides
poster

2024

Contrasting Adversarial Perturbations: The Space of Harmless Perturbations
Lu Chen, 
Shaofeng Li, 
Benhao Huang, 
Fan Yang, 
Zheng Li, 
Jie Li, 
Luo Yuan
Dec 10th 2024
AAAI 2025
#Adversarial Perturbations
#AI Safety

paper
Defining and extracting generalizable interaction primitives from DNNs
Lu Chen, 
Siyu Lou, 
Benhao Huang, 
Quanshi Zhang
Sep 13th 2024
ICLR-2024
#LLM
#AI Interpretability

Given different DNNs trained for the same task, developed a new method to extract interactions that are shared by these DNNs. Experiments show that the extracted interactions can better reflect common ...

paper
GitHub
Last Updated on Aug 10th 2025 Powered by greatest-gatsby-academic-template.