Research

Year

Tag

2025

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Jiannan Xiang,

Yi Gu,

Zihan Liu,

Zeyu Feng,

Qiyue Gao,

Yiyan Hu,

Benhao Huang,

Guangyi Liu,

Yichi Yang,

Kun Zhou,

Davit Abrahamyan,

Arif Ahmad,

Ganesh Bannur,

Junrong Chen,

Kimi Chen,

Mingkai Deng,

Ruobing Han,

Xinqi Huang,

Haoqiang Kang,

Zheqi Li,

Enze Ma,

Hector Ren,

Yashowardhan Shinde,

Rohan Shingre,

Ramsundar Tanikella,

Kaiming Tao,

Dequan Yang,

Xinle Yu,

Cong Zeng,

Binglin Zhou,

Hector Liu,

Zhiting Hu,

Eric P. Xing

Nov 13th 2025

#World Model

#Video Generation

#Long-term Consistency

PAN brings imagination to life — fusing language, action, and vision to simulate the world's evolution with stunning realism and consistency.

technical report

Blog

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Mask Tokens as Prophet: Fine-Grained Cache Eviction for Efficient dLLM Inference

Jianuo Huang,

Yaojie Zhang,

Yicun Yang,

Benhao Huang,

Biqing Qi,

Dongrui Liu,

Linfeng Zhang

Oct 10th 2025

#Diffusion

#LLM

#Efficiency

MaskKV exploits mask-token attention signals to evict low-utility KV pairs in diffusion LLMs, shrinking cache budgets while preserving long-context accuracy and increasing throughput.

paper

code

Mask Tokens as Prophet: Fine-Grained Cache Eviction for Efficient dLLM Inference

FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching

Jiacheng Liu,

Peiliang Cai,

Qinming Zhou,

Yuqi Lin,

Deyang Kong,

Benhao Huang,

Yupei Pan,

Haowen Xu,

Chang Zou,

Junshu Tang,

Shikang Zheng,

Linfeng Zhang

Oct 9th 2025

#Diffusion

#Efficiency

FreqCa accelerates diffusion models by analyzing frequency dynamics of features across timesteps, reusing low-frequency components and interpolating high-frequency ones with great memory reduction.

paper

FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching

Flow Equivariant World Models: Structured Dynamics Outside the Field of View

Hansen Lillemark*,

Benhao Huang*,

Fangneng Zhan,

Yilun Du,

T. Anderson Keller

Sep 23rd 2025

NeurReps Workshop, SpaVLE Workshop @ NeurIPS 2025

#World Model

#Equivariant Models

#Long-term Consistency

On World Modeling the partially observable dynamics in the environment.

paper

code

Flow Equivariant World Models: Structured Dynamics Outside the Field of View

A Survey of Data Attribution: Methods, Applications, and Evaluation in the Era of Generative AI

Junwei Deng,

Yuzheng Hu,

Pingbang Hu,

Ting-Wei Li,

Shixuan Liu,

Jiachen T Wang,

Dan Ley,

Qirun Dai,

Benhao Huang,

Jin Huang,

Cathy Jiao,

Hoang Anh Just,

Yijun Pan,

Jingyan Shen,

Yiwen Tu,

Weiyi Wang,

Xinhe Wang,

Shichang Zhang,

Shiyuan Zhang,

Ruoxi Jia,

Himabindu Lakkaraju,

Hao Peng,

Weijing Tang,

Chenyan Xiong,

Jieyu Zhao,

Hanghang Tong,

Han Zhao,

Jiaqi W Ma

Sep 6th 2025

A comprehensive survey on data attribution methods, applications, and evaluation protocols that address how training data influences the behaviors of generative AI systems.

paper

A Survey of Data Attribution: Methods, Applications, and Evaluation in the Era of Generative AI

Mirage: Generative World Engine

Dynamics Lab

Jul 1st 2025

#World Model

#Game Generation

The World's First Al-Native UGC Game Engine Powered by Real-Time World Model

新智元

Blog

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

Qiyue Gao,

Xinyu Pi,

Kevin Liu,

Junrong Chen,

Ruolan Yang,

Xinqi Huang,

Xinyu Fang,

Lu Sun,

Gautham Kishore,

Bo Ai,

Stone Tao,

Mengyang Liu,

Jiaxi Yang,

Chao-Jung Lai,

Chuanyang Jin,

Jiannan Xiang,

Benhao Huang,

Zeming Chen,

David Danks,

Hao Su,

Tianming Shu,

Ziqiao Ma,

Lianhui Qin,

Zhiting Hu

Jun 1st 2025

ICLR 2025 Workshop World Models / ACL 2025 Findings

#World Model

#Benchmark

This paper evaluates whether modern Vision-Language Models (VLMs) like GPT-4o and Gemini can act as internal world models (WMs)—systems that understand and predict the world.

paper

🤗HuggingFace

website

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

DCA-Bench: A Benchmark for Dataset Curation Agents

Benhao Huang,

Yingzhuo Yu,

Jin Huang,

Xingjian Zhang,

Jiaqi W. Ma

May 1st 2025

KDD-2025 DB Track (Oral), ICML-2025 Data World

#LLM Agent

#Benchmark

#Data-centric AI

A benchmark exploring the performance of LLM Agents on detecting issues in datasets hosted on popular platforms.

2024

Contrasting Adversarial Perturbations: The Space of Harmless Perturbations

Lu Chen,

Shaofeng Li,

Benhao Huang,

Fan Yang,

Zheng Li,

Jie Li,

Luo Yuan

Dec 10th 2024

AAAI 2025

#Adversarial Perturbations

#AI Safety

paper

Contrasting Adversarial Perturbations: The Space of Harmless Perturbations

Defining and extracting generalizable interaction primitives from DNNs

Lu Chen,

Siyu Lou,

Benhao Huang,

Quanshi Zhang

Sep 13th 2024

ICLR-2024

#LLM

#AI Interpretability

Given different DNNs trained for the same task, developed a new method to extract interactions that are shared by these DNNs. Experiments show that the extracted interactions can better reflect common ...

paper

GitHub

Defining and extracting generalizable interaction primitives from DNNs