I am now an Associate Professor with the Hong Kong University of Science and Technology (HKUST). Previously, I worked as an Associate Professor at Sun Yat-sen University, and an applied research scientist at Tencent, solving real-world problems using computer vision and machine learning techniques. Prior to Tencent, I worked for Amazon in Palo Alto, California, where I developed deep models for better visual search experience. Before that, I worked as a research scientist in Tencent AI Lab. The techniques I have developed/involved have been shipped to several products in Tencent such as WeChat, QQ, Tencent Video, Tencent Yuanbao, Tencent Cloud, and myapp. I received the Ph.D. degree from Imperial College London, UK, 2016, under the supervision of Prof. Tae-Kyun Kim, and working closely with Dr. Bjorn Stenger, M.E. degree from Institute of Automation, Chinese Academy of Sciences, China, 2012, under the supervision of Prof. Weiming Hu, and B.E. degree from Huazhong University of Science and Technology, China, 2009.

I have published over 90 peer-reviewed papers in top-tier conferences and journals, like ICML, NeurIPS, CVPR, ICCV, ECCV, SIGGRAPH, AAAI, ACL, ACMMM, ICLR, TPAMI, AI, IJCV. My work is selected into the CVPR 2019 Best Paper Finalist and I was awarded the 2022 ACM China Rising Star Award (Guangzhou Chapter). I have served as Associate Editor for IEEE Transactions on Image Processing, Neurocomputing, IET Computer Vision, Guest Editor for CVIU, Area Chair for NeurIPS 2025, ACM MM 2025, ICML 2025, IJCAI 2025, IJCNN 2025, BMVC 2024, and Senior Program Committee member for AAAI and IJCAI, regular reviewer for top conferences and journals like TPAMI, IJCV, CVPR, ICML, ICCV. I have been elected among Top 2% Scientists worldwide (2023 & 2024) by Stanford/Elsevier. [Curriculum Vitae] [HKUST Profile]

Multiple PHD opennings in efficient large model, AIGC (image and video generation), content restoration and enhancement are available. Please check the Join Us page for details.

Research

I conduct research on creative AI. Specifically, my current research focuses on several topics, such as image/video generation and restoration/enhancement. My research is supported by the following sponsors/agencies.


Updates

  • 2025/06:   MaterialMVP and MOERL are accepted by ICCV 2025.
  • 2025/06:   Our group secured funding from Tencent (news), ByteDance, Huawei, Wiener Intelligence, Video Rebirth, Qingdao Municipal Bureau of Science and Technology.
  • 2025/06:   Paper to appear in TKDE.
  • 2025/05:   Papers accepted by TVCG and ACL 2025.
  • 2025/05:   Serve as Area Chair for BMVC 2025.
  • 2025/04:   Two papers are accepted by SIGGRAPH 2025.
  • 2025/04:   Serve as COI Coordinator for SIGGRAPH Asia 2025.
  • 2025/04:   I will serve as Area Chair for NeurIPS 2025.
  • 2025/03:   Content restoration works to appear in TPAMI and PR.
  • 2025/03:   I serve as Workshop Chair for CVM 2025.
  • 2025/03:   I will serve as Area Chair for ACM Multimedia 2025.
  • 2025/02:   Three papers accepted by CVPR 2025.
  • 2025/01:   I will serve as Associate Editor for IEEE Transactions on Image Processing.

Show more

Recent Work [More]

UNIC: Unified In-Context Video Editing,
Zixuan Ye, Xuanhua He, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qifeng Chen, Wenhan Luo,
arXiv:2506.04216.
[arXiv] [Project Page]
Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation,
Zhe Kong, Feng Gao, Yong Zhang, Zhuoliang Kang, Xiaoming Wei, Xunliang Cai, Guanying Chen, Wenhan Luo,
arXiv:2505.22647.
[arXiv] [Project Page] [Code] GitHub stars
MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion,
Zebin He, Mingxin Yang, Shuhui Yang, Yixuan Tang, Tao Wang, Kaihao Zhang, Guanying Chen, Yuhong Liu, Jie Jiang, Chunchao Guo, Wenhan Luo,
Proc. of International Conference on Computer Vision (ICCV), Hawaii, USA, 2025.
[arXiv] [Project Page] [Code] GitHub stars
MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration,
Tao Wang, Peiwen Xia, Bo Li, Peng-Tao Jiang, Zhe Kong, Kaihao Zhang, Tong Lu, Wenhan Luo,
Proc. of International Conference on Computer Vision (ICCV), Hawaii, USA, 2025.
DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution,
Zhe Kong, Le Li, Yong Zhang, Feng Gao, Shaoshu Yang, Tao Wang, Kaihao Zhang, Zhuoliang Kang, Xiaoming Wei, Guanying Chen, Wenhan Luo,
ACM SIGGRAPH, 2025.
[PDF] [Project Page]
MB-TaylorFormer V2: Improved Multi-branch Linear Transformer Expanded by Taylor Formula for Image Restoration,
Zhi Jin, Yuwei Qiu, Kaihao Zhang, Hongdong Li, Wenhan Luo,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), to appear.
[arXiv] [Code]
StyleMaster: Stylize Your Video with Artistic Generation and Translation,
Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo,
Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2025.
[arXiv] [Github] [Project Page] GitHub stars
Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling,
Jingyun Xue, Hongfa Wang, Qi Tian, Yue Ma, Andong Wang, Zhiyuan Zhao, Shaobo Min, Wenzhe Zhao, Kaihao Zhang, Heung-Yeung Shum, Wei Liu, Mengyang Liu, Wenhan Luo,
International Conference on Learning Representations (ICLR), 2025.
[PDF] [Project Page] [API in Tencent Cloud]

Experience

I have studied/interned/worked in the following affiliations.