Research
2025
Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation
Qiyue Gao,
Xinyu Pi,
Kevin Liu,
Junrong Chen,
Ruolan Yang,
Xinqi Huang,
Xinyu Fang,
Lu Sun,
Gautham Kishore,
Bo Ai,
Stone Tao,
Mengyang Liu,
Jiaxi Yang,
Chao-Jung Lai,
Chuanyang Jin,
Jiannan Xiang,
Benhao Huang,
Zeming Chen,
David Danks,
Hao Su,
Tianming Shu,
Ziqiao Ma,
Lianhui Qin,
Zhiting Hu
This paper evaluates whether modern Vision-Language Models (VLMs) like GPT-4o and Gemini can act as internal world models (WMs)—systems that understand and predict the world.
2024
Contrasting Adversarial Perturbations: The Space of Harmless Perturbations
Lu Chen,
Shaofeng Li,
Benhao Huang,
Fan Yang,
Zheng Li,
Jie Li,
Luo Yuan
Defining and extracting generalizable interaction primitives from DNNs
Lu Chen,
Siyu Lou,
Benhao Huang,
Quanshi Zhang
Given different DNNs trained for the same task, developed a new method to extract interactions that are shared by these DNNs. Experiments show that the extracted interactions can better reflect common ...
Last Updated on Aug 10th 2025 Powered by greatest-gatsby-academic-template.