Dian Zheng 郑典

Building wonderful Content Creator with MLLMs and diffusion

Incoming Ph.D. student, MMLab@CUHK, Prof. Hongsheng Li

Email: zd1423606603@gmail.com | WeChat: zd1423606603 提示图片


[Google Scholar][Github][X (Twitter)]

Biography

I'm passionately obsessed with novelty, but at the same time, ideas are cheap, show me your code.

I am an incoming PHD student at CUHK, MMLab, advised by Prof. Hongsheng Li. Before that, I obtain my master degree at SYSU in 2025, advised by Prof. Wei-Shi Zheng and B.E. degree at DLUT in 2022.

🛠️ Employment:
[2025/07—Present] Research Intern, Meituan, Mentor: Manyuan Zhang.
[2024/12—2025/06] Research Intern, Shanghai AI Laboratory, honored to be advised by Prof. Ziwei Liu.
[2024/05—2024/11] Research Intern, Alibaba, Mentor: Cao Li.

I am always open to research collaborations about generative model, robotics. Feel free to contact me.

News

Publications (* denotes equal contribution, and denotes the corresponding author)

First, Co-First, Last Author

Architecture Decoupling Is Not All You Need For Unified Multimodal Model
Dian Zheng, Manyuan Zhang, Hongyu Li, Kai Zou, Hongbo Liu, Ziyu Guo, Kaituo Feng, Yexin Liu, Ying Luo, Yan Feng, Peng Pei, Xunliang Cai, Hongsheng Li
Arxiv, 2025
[ArXiv] [Code] [Project Page] [Social Media [机器之心]]

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
Dian Zheng*, Ziqi Huang*, Hongbo Liu, Kai Zou, Yinan He, Fan Zhang, Yuanhan Zhang, Jingwen He, Wei-Shi Zheng, Yu Qiao, Ziwei Liu
Arxiv, 2025
[ArXiv] [Code] [Project Page] [Social Media [机器之心]]

Panorama Generation From NFoV Image Done Right
Dian Zheng, Cheng Zhang, Xiao-Ming Wu, Cao Li, Chengfei Lv, Jian-Fang Hu, Wei-Shi Zheng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025 🏆️ Highlight
[Paper][Code][Project Page]

Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks
Yu Zhou*, Dian Zheng*, Qijie Mo, Ren-Jie Lu, Kun-Yu Lin, Wei-Shi Zheng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025 🏆️ Highlight
[Paper]

SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
Zhen Lv, Yangqi Long, Congzhentao Huang, Cao Li, Chengfei Lv, Hao Ren, Dian Zheng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
[Paper]

Diffusion Model for Volume based Stereo Matching
Dian Zheng, Xiao-Ming Wu, Zuhao Liu, Jingke Meng, Wei-Shi Zheng
International Journal of Computer Vision (IJCV), 2025
[Paper][Code]

Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
Dian Zheng, Xiao-Ming Wu, Shuzhou Yang, Jian Zhang, Jian-Fang Hu, Wei-Shi Zheng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[Paper][Code]

Others

EditThinker: Unlocking Iterative Reasoning for Any Image Editor
Hongyu Li, Manyuan Zhang, Dian Zheng, Ziyu Guo, Yimeng Jia, Kaituo Feng, Hao Yu, Yexin Liu, Yan Feng, Peng Pei, Xunliang Cai, Linjiang Huang, Hongsheng Li, Si Liu
Arxiv, 2025
[Paper] [Code] [Project Page]

OneThinker: All-in-one Reasoning Model for Image and Video
Kaituo Feng, Manyuan Zhang, Hongyu Li, Kaixuan Fan, Shuang Chen, Yilei Jiang, Dian Zheng, Peiwen Sun, Yiyuan Zhang, Haoze Sun, Yan Feng, Peng Pei, Xunliang Cai, Xiangyu Yue
Arxiv, 2025
[Paper] [Code] [Social Media [量子位]]

OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation
Yexin Liu, Manyuan Zhang, Yueze Wang, Hongyu Li, Dian Zheng, Weiming Zhang, Changsheng Lu, Yan Feng, Peng Pei, Xunliang Cai, Harry Yang
Arxiv, 2025
[Paper] [Code]

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark
Kai Zou*, Ziqi Huang*, Yuhao Dong*, Shulin Tian, Dian Zheng, Hongbo Liu, Jingwen He, Bin Liu, Yu Qiao, Ziwei Liu
Arxiv, 2025
[Paper][Code][Project Page]

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models
Hongbo Liu*, Jingwen He*, Yi Jin, Dian Zheng, Yuhao Dong, Fan Zhang, Ziqi Huang, Yinan He, Yangguang Li, Weichao Chen, Yu Qiao, Wanli Ouyang, Shengjie Zhao, Ziwei Liu
Neural Information Processing Systems (NeurIPS), 2025
[Paper] [Code] [Project Page] [Social Media [量子位]]

An Economic Framework for 6-DoF Grasp Detection
Xiao-Ming Wu*, Jia-Feng Cai*, Jian-Jian Jiang, Dian Zheng, Yi-Lin Wei, Wei-Shi Zheng
European Conference on Computer Vision (ECCV), 2024
[Paper][Code]

Dexterous Grasp Transformer
Guo-Hao Xu*, Yi-Lin Wei*, Dian Zheng, Xiao-Ming Wu, Wei-Shi Zheng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[Paper][Code]

Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training
Xiao-Ming Wu, Dian Zheng, Zuhao Liu, Wei-Shi Zheng
IEEE/CVF International Conference on Computer Vision (ICCV), 2023
[Paper][Code]

Generating Anomalies for Video Anomaly Detection with Prompt-based Feature Mapping
Zuhao Liu, Xiao-Ming Wu, Dian Zheng, Kun-Yu Lin, Wei-Shi Zheng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[Paper]

Underwater Stereo Matching Via Unsupervised Appearance And Feature Adaptation Networks
Zhong Wei, Yazhi Yuan, Xinchen Ye, Dian Zheng, Rui Xu
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2022
[Paper]

Academic Service

Conference Reviewer:
CVPR: 2025, 2026
NeurIPS: 2024 (Top Reviewer), 2025
ICLR: 2025, 2026
AAAI: 2026
ACM MM: 2025
ICML: 2025
ICME: 2025, 2026
Journal Reviewer:
TIP
TCSVT
PR
CMV

Awards

Outstanding Master's Thesis in Sun Yat-sen University (中山大学优秀硕士学位论文), 2025.
Outstanding Graduate in Sun Yat-Sen University (中山大学优秀毕业生), 2025.
Xiaomi Grand Prize Scholarship of Sun Yat-Sen University (中山大学小米特等奖学金), 2024.
NeurIPS2024 Top Reviewer, 2024.
First Prize, Academic Scholarship of Sun Yat-Sen University for Graduate Student (中山大学硕士研究生一等奖助金), 2022, 2023, 2024.
Outstanding Bachelor's Thesis in Dalian University of Technology (大连理工大学大学优秀本科学位论文), 2022.
Outstanding Graduate in Dalian University of Technology (大连理工大学优秀毕业生), 2022.