I am now an Associate Professor with the Hong Kong University of Science and Technology (HKUST), leading the Creative, Controllable & Cognitive Computing Group (C4G) in HKUST. Previously, I worked as an Associate Professor at Sun Yat-sen University, and an applied research scientist at Tencent, solving real-world problems using computer vision and machine learning techniques. Prior to Tencent, I worked for Amazon in Palo Alto, California, where I developed deep models for better visual search experience. Before that, I worked as a research scientist in Tencent AI Lab. The techniques I have developed/involved have been shipped to several products in Tencent such as WeChat, QQ, Tencent Video, Tencent Yuanbao, Tencent Cloud, and myapp. I received the Ph.D. degree from Imperial College London, UK, 2016, under the supervision of Prof. Tae-Kyun Kim, and working closely with Dr. Bjorn Stenger, M.E. degree from Institute of Automation, Chinese Academy of Sciences, China, 2012, under the supervision of Prof. Weiming Hu, and B.E. degree from Huazhong University of Science and Technology, China, 2009.

I have published over 100 peer-reviewed papers in top-tier conferences and journals, like ICML, NeurIPS, CVPR, ICCV, ECCV, SIGGRAPH, ACL, ICLR, TPAMI, AI, IJCV. My work is selected into the CVPR 2019 Best Paper Finalist and I was awarded the 2022 ACM China Rising Star Award (Guangzhou Chapter). I have served as Senior Area Editor for IEEE Signal Processing Letters, Associate Editor for IEEE Transactions on Image Processing, Neurocomputing, IET Computer Vision, (Senior) Area Chair for ICLR, NeurIPS, ACM MM, ICML, IJCAI, BMVC, and Senior Program Committee member for AAAI and IJCAI, regular reviewer for top conferences and journals like TPAMI, IJCV, CVPR, ICML, ICCV. I have been elected among Top 2% Scientists worldwide (2023, 2024, 2025) by Stanford/Elsevier.

Vacancies: our group begin to review the application for 27 spring/fall PhD students. The openings are funding-dependent on a rolling basis. If you are interested, please email me your CV and research interests. Priority will be given to candidates with outstanding academic background and strong publication record, those who are competitive applicants for the Hong Kong PhD Fellowship Scheme (HKPFS), as well as individuals who are willing to work as a Research Assistant (RA) or intern prior to their PhD studies.

Research

I conduct research on creative AI. Specifically, my current research focuses on several topics, such as image/video generation and restoration/enhancement. My research is supported by the following sponsors.

Updates

2026/05: Received IEEE Signal Processing Society Outstanding Editorial Board Member Award for my service to IEEE Signal Processing Letters as Senior Area Editor.
2026/05: DiNa-LRM is accepted by ICML 2026.
2026/03: I will give a talk in Chinese Congress on Image and Graphics 2026 (中国图象图形大会): 低质量视觉处理与质量评价论坛.
2026/03: I will serve as Senior Area Chair for NeurIPS 2026 and Area Chair for ACCV 2026.
2026/02: I will serve as Area Chair for ACM Multimedia 2026.
2026/02: Received an Outstanding Senior Program Committee Service Award (35 out of 1728 SPC) from AAAI 2026 Organization.
2026/02: Recent acceptance: 2 ICLR + 6 CVPR.
2025/11: I will serve as Area Chair for ICML 2026.
2025/10: I will give a talk in SIGGRAPH Asia 2025: The Asiagraphics Workshop on Intelligent Graphics.
2025/09: We are organizing a workshop in AAAI 2026: Consistency in Video Generative Models: from Clip to Wild. Welcome to participate to win the top prize of up to 200,000 CNY.
2025/08: I will serve as Area Chair for ICLR 2026.
2025/07: I will serve as Senior Program Committe (SPC) member for AAAI 2026.

2025/07: Our group webpage is now live.
2025/07: I will give a talk in ChinaSI 2025 (中国空间智能大会).
2025/06: Appointed as Senior Area Editor (S-AE) for IEEE Signal Processing Letters.
2025/06: MaterialMVP and MOERL are accepted by ICCV 2025.
2025/06: Our group secured funding from Tencent (news), ByteDance, Huawei, Wiener Intelligence, Video Rebirth, Qingdao Municipal Bureau of Science and Technology.
2025/06: Paper to appear in TKDE.
2025/05: Papers accepted by TVCG and ACL 2025.
2025/05: Serve as Area Chair for BMVC 2025.
2025/04: Two papers are accepted by SIGGRAPH 2025.
2025/04: Serve as COI Coordinator for SIGGRAPH Asia 2025.
2025/04: I will serve as Area Chair for NeurIPS 2025.
2025/03: Content restoration works to appear in TPAMI and PR.
2025/03: I serve as Workshop Chair for CVM 2025.
2025/03: I will serve as Area Chair for ACM Multimedia 2025.
2025/02: Three papers accepted by CVPR 2025.
2025/01: I will serve as Associate Editor for IEEE Transactions on Image Processing.
2025/01: Two papers accepted by ICLR 2025.
2025/01: Uni-MoE accepted by TPAMI.
2024/12: Invited to serve as Area Chair for IJCAI 2025.
2024/12: Invited to serve as Area Chair for ICML 2025.
~~2024/11: Invited to serve as Senior Program Committe member (SPC) for IJCAI 2025.~~
2024/10: Awarded the CCF-Tencent Rhino-Bird Faculty Fund Excellence Award, see the news.
2024/09: Two Papers accepted by NeurIPS 2024.
2024/09: Elected among Stanford/Elsevier Top 2% Scientists List 2024.
2024/09: Paper to appear in TKDE.
2024/09: Paper to appear in IJCV.
2024/08: Paper to appear in TCSVT.
2024/07: Paper accepted by ACM MM 2024.
2024/07: Four papers accepted by ECCV 2024.
2024/06: Invited to serve as Senior Program Committe member (SPC) for AAAI 2025.
2024/05: Invited to serve as Area Chair for BMVC 2024.
2024/05: Paper accepted by ICML 2024.
2024/04: Our team win two championship awards in the two tracks of NTIRE 2024 challenges (Bracketing Image Restoration and Enhancement Challenge - Track 1 & 2) in conjunction with CVPR2024, see the award.
2024/04: Paper to appear in IEEE TGRS.
2024/03: Together with Samsung, we win the 2rd place in the competition of Few-shot RAW Image Denoising @ MIPI2024 in conjunction with CVPR2024, see the award.
2024/03: Paper to appear in TCSVT.
2024/03: Paper to appear in IJCV.
2024/02: Two papers accepted by CVPR 2024.
2024/02: Paper accepted by COLING 2024 (multi-modal in-context learning).
2024/02: I will be joining the Hong Kong University of Science and Technology (HKUST) as Associate Professor in 2024 spring/summer.
2024/01: Paper accepted by ICLR 2024.
2023/12: Paper to appear in TNNLS (blind face restoration).
2023/12: Invited to serve as Senior Program Committe member (SPC) for IJCAI 2024.
2023/11: Paper to appear in TCSVT (under-display camera face image restoration).
2023/10: Elected among Top 2% Scientists worldwide 2023 by Stanford University.
2023/09: Paper accepted by NeurIPS 2023 (punctuation-level attacks fooling text models).
2023/09: Paper to appear in TCSVT (image deblurring benchmark).
2023/09: Paper to appear in Pattern Recognition (image dehazing).
2023/07: I was granted the CCF-Tencent Rhino-Bird Faculty Research Fund, see the news.
2023/07: Paper to apppear in TCSVT (all-in-one weather-degraded image restoration).
2023/07: Four papers accepted by ICCV 2023.
2023/07: Invited to serve as a member of Senior Program Committe (SPC) for AAAI 2024.
2023/06: Paper accepted by IROS 2023 (hand interaction tracking).
2023/02: Paper accepted by CVPR 2023 (reflection removal against adversarial attacks).
2023/02: Elevation to IEEE Senior Member.
2023/01: Paper accepted by TNNLS (presentation attack detection).
2022/12: Invited to serve as a member of Senior Program Committe (SPC) for IJCAI 2023.
2022/11: Paper accepted by AAAI2023 (low-light image enhancement).
2022/11: TMM paper accepted (multi-modal retrieval).
2022/10: TMM paper accepted (image dehazing).
2022/09: Awarded the 2022 ACM China Rising Star Award (Guangzhou Chapter), see the news.
2022/08: Paper accepted by WACV2023 (few-shot object counting).
2022/07: I joined Sun Yat-sen University as an associate professor.
2022/06: Paper accepted by ACM MM 2022 (multi-object tracking).
2022/05: TPAMI paper accepted (face hallucination).
2022/05: Paper accepted by IJCV (image deblurring survey).
2022/04: Paper accepted by IJCV (image deraining).
2022/03: Paper accepted by CVPR2022 (aesthetic text logo synthesis).
2022/01: TPAMI paper accepted (video deraining).
2021/09: TIP paper accepted (image deraining).
2021/08: Invited to serve as a Senior PC member for AAAI 2022.
2021/08: TIP paper accepted (image desnow).
2021/07: Paper accepted by ICCV2021 (image SR benchmarking).
2021/07: One paper (image deblur & SR) to appear in IEEE Transactions on Image Processing.
2021/06: TMM paper accepted (action recognition).
2021/05: Our work of active visual tracking is accepted by ICML2021.
2021/05: One paper of image dehazing to appear in IEEE Transactions on Image Processing.
2021/04: Our work of human image synthesis is accepted to appear in TPAMI.
2021/03: One paper to appear in IEEE Transactions on Geoscience and Remote Sensing.
2021/02: One paper to appear in IEEE Transactions on Multimedia.
2020/12: The paper “Multiple Object Tracking: A Literature Review” is accepted by Artificial Intelligence.
2020/12: Invited to serve as a Senior PC member for IJCAI 2021.
2020/11: One paper of pedestrian detection to appear in IEEE Transactions on Image Processing.
2020/09: An invited talk is given in SUSTech, hosted by Prof. Xiaoying Tang.
2020/09: One paper of optical flow estimation to appear in IEEE Transactions on Image Processing.
2020/07: One paper to appear in ACM MM 2020 (Oral).
2020/07: One paper to appear in ECCV2020.
2020/06: One paper of multiple object tracking to appear in Pattern Recognition.
2020/05: We are organizing a special issue of action recognition and detection on CVIU. Submission deadline is Sep 15th. See the CFP if interested
2020/02: Two papers (one oral + one poster) to appear in CVPR2020.
2019/10: One paper to appear in TPAMI, entitled “AD-VAT+: An Asymmetric Dueling Mechanism for Learning and Understanding Visual Active Tracking”.
2019/09: Code and dataset of our ICCV2019 paper for motion imitation, appearance transfer and novel view synthesis are released. Check the project page here.
2019/09: One paper to appear in IJCV.
2019/07: One paper to appear in ICCV2019.
2019/07: The code of our ACL2019 paper is released. Check it here.
2019/07: The code of our ICLR2019 paper is released. Check it here.
2019/05: Join Amazon in California as a research scientist.
2019/06: Our CVPR paper entitled “Learning to Compose Dynamic Tree Structures for Visual Context” is selected as one of the best paper finalists (50 out of the 1294 accepted papers in CVPR2019).
2019/05: One paper of video grounding to appear in ACL2019 as a long paper, and oral presentation.
2019/05: Serve as program committee member of the workshop of Vision Meets Drones 2019: A Challenge in conjunction with ICCV2019.
2019/03: Four papers (2 orals + 2 posters) to appear in CVPR2019.
2019/02: Serve as program committee member of the 4th BMTT MOT Challenge Workshop and the 2nd Workshop and Challenge on Target Re-identification and Multi-Target Multi-Camera Tracking in conjunction with CVPR2019.
2019/02: The code of our CVPR 2018 paper is released. Check it here.
2019/01: Our work of “End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning” is accepted by TPAMI.
2018/12: One paper to appear in ICLR2019. Congratulations to Fangwei Zhong.
2018/11: One paper to appear in NIPS2018 workshop on Deep Reinforcement Learning.
2018/11: One paper to appear in AAAI2019. Congratulations to Kaihao Zhang.
2018/10: Welcome Tianrui Liu (Imperial College London) on board as a research intern.
2018/09: The code of our ICML2018 paper is released. Check it here.
2018/09: The code of our ECCV2018 paper “Bi-Real Net” is released. Check it here.
2018/08: Our work of video deblur is accepted by IEEE Transactions on Image Processing. Congratulations to Kaihao Zhang.
2018/08: One paper of multiple object tracking is accepted by IEEE Transactions on Image Processing.
2018/07: The dataset for our CVPR2018 paper, Sky Scene is released. See our project page for details.
2018/07: One paper to appear in ECCV2018. Congratulations to Zechun Liu.
2018/05: Our work of End-to-end Active Object Tracking via Reinforcement Learning is accepted by ICML2018. The camera-ready version will come soon.
2018/05: Serve as a member of the advisory committee of the workshop of Vision Meets Drone: A Challenge (VisDrone2018, for short) in conjunction with ECCV2018.
2018/04: Welcome Jia Wan (NWPU) on board as an intern.
2018/02: One paper to appear in CVPR2018. Congratulations to Wei Xiong.
2018/01: Welcome our intern Zechun Liu (HKUST) on board.
2017/10: Welcome Yiming Chen (Imperial College London) on board as an intern in Tencent AI Lab.
2017/08: Welcome Kaihao Zhang on board as intern in Tencent AI Lab. Kaihao is from Australian National University and will work close with me for about six months.
2017/06: Welcome Weiyue Su and Wei Xiong on board as intern in Tencent AI Lab. Weiyue is from South China University of Technology. Wei is from Wuhan University.
2017/05: Serve as a program committee member of the First Joint BMPP-PETS Workshop on Tracking and Surveillance in conjunction with CVPR2017.
2017/04: One paper to appear in CVPR2017.
2016/07: Join Tencent AI Lab as a research scientist.
2016/06: Pass viva exam and obtained the Ph.D. degree, examined by Tao (Tony) Xiang from Queen Mary University of London.
2016/05: Serve as a program committee member of the workshop Benchmarking Multi-Target Tracking: MOTChallenge in conjunction with ECCV2016.
2015/02: Start internship in Microsoft Research Asia with Dr. David Wipf (from Feb 2015 to June 2015).

Show less

Recent Work [More]

	MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training, Lianyu Pang, Tianlin Pan, Cheng Da, Changqian Yu, Huan Yang, Kun Gai, Song Guo, Wenhan Luo, preprint. [arXiv] [Project Page] [Code]
	Tango3D: Towards Alignment for Global and Local 2D-3D Correspondence, Zebin He, Mingxin Yang, Shuhui Yang, Hanxiao Sun, Xintong Han, Chunchao Guo, Wenhan Luo, arXiv:2605.19727. [arXiv] [Project Page] [Code]
	Attention Hijacking: Response Manipulation Across Queries in Vision-Language Models, Zhiqiang Wang, Dongrui Liu, Yan Li, Zonghao Ying, Wei Xue, Wenhan Luo, Yike Guo, arXiv:2605.17310. [arXiv] [Project Page] [Code]
	Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models, Yicheng Ji, Zhizhou Zhong, Jun Zhang, Qin Yang, XiTai Jin, Ying Qin, Wenhan Luo, Shuiyang Mao, Wei Liu, Huan Li, arXiv:2605.09681. [arXiv] [Project Page] [Code]
	Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling, Gongye Liu, Bo Yang, Yida Zhi, Zhizhou Zhong, Lei Ke, Didan Deng, Han Gao, Yongxiang Huang, Kaihao Zhang, Hongbo Fu, Wenhan Luo, International Conference on Machine Learning (ICML), 2026. [arXiv] [Code]
	Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models, Zixuan Ye, Quande Liu, Cong Wei, Yuanxing Zhang, Xintao Wang, Pengfei Wan, Kun Gai, Wenhan Luo, Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2026. [arXiv] [Project Page]
	AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement, Zhizhou Zhong, Yicheng Ji, Zhe Kong, Yiying Liu, Jiarui Wang, Jiasun Feng, Lupeng Liu, Xiangyi Wang, Yanjia Li, Yuqing She, Ying Qin, Huan Li, Shuiyang Mao, Wei Liu, Wenhan Luo, arXiv:2511.23475. [arXiv] [Project Page] [Code] [Gradio] [Hugging Face Model]
	UNIC: Unified In-Context Video Editing, Zixuan Ye, Xuanhua He, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qifeng Chen, Wenhan Luo, International Conference on Learning Representations (ICLR), 2026. [arXiv] [Project Page]
	Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation, Zhe Kong, Feng Gao, Yong Zhang, Zhuoliang Kang, Xiaoming Wei, Xunliang Cai, Guanying Chen, Wenhan Luo, Neural Information Processing Systems (NeurIPS), 2025. [arXiv] [Project Page] [Code] [Hugging Face Model] [Gradio]
	Foundation Cures Personalization: Recovering Facial Personalized Models' Prompt Consistency, Yiyang Cai, Zhengkai Jiang, Yulong Liu, Chunyang Jiang, Wei Xue, Yike Guo, Wenhan Luo, Neural Information Processing Systems (NeurIPS), 2025. [PDF] [Project Page] [Code]
	InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing, Shaoshu Yang, Zhe Kong, Feng Gao, Meng Cheng, Xiangyu Liu, Yong Zhang, Zhuoliang Kang, Wenhan Luo, Xunliang Cai, Ran He, Xiaoming Wei, arXiv:2508.14033.* [arXiv] [Project Page] [Code]
	MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion, Zebin He, Mingxin Yang, Shuhui Yang, Yixuan Tang, Tao Wang, Kaihao Zhang, Guanying Chen, Yuhong Liu, Jie Jiang, Chunchao Guo, Wenhan Luo, Proc. of International Conference on Computer Vision (ICCV), Hawaii, USA, 2025. (Highlight) [arXiv] [Project Page] [Code]
	MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration, Tao Wang, Peiwen Xia, Bo Li, Peng-Tao Jiang, Zhe Kong, Kaihao Zhang, Tong Lu, Wenhan Luo, Proc. of International Conference on Computer Vision (ICCV), Hawaii, USA, 2025.
	DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution, Zhe Kong, Le Li, Yong Zhang, Feng Gao, Shaoshu Yang, Tao Wang, Kaihao Zhang, Zhuoliang Kang, Xiaoming Wei, Guanying Chen, Wenhan Luo, ACM SIGGRAPH, 2025. [PDF] [Project Page] [Code]
	MB-TaylorFormer V2: Improved Multi-branch Linear Transformer Expanded by Taylor Formula for Image Restoration, Zhi Jin, Yuwei Qiu, Kaihao Zhang, Hongdong Li, Wenhan Luo, IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), vol. 47, pp. 5990–6005, 2025. [arXiv] [Code]
	StyleMaster: Stylize Your Video with Artistic Generation and Translation, Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo, Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2025. [arXiv] [Github] [Project Page]
	Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling, Jingyun Xue, Hongfa Wang, Qi Tian, Yue Ma, Andong Wang, Zhiyuan Zhao, Shaobo Min, Wenzhe Zhao, Kaihao Zhang, Heung-Yeung Shum, Wei Liu, Mengyang Liu, Wenhan Luo, International Conference on Learning Representations (ICLR), 2025. [PDF] [Project Page] [API in Tencent Cloud]

Experience

I have studied/interned/worked in the following affiliations.

Wenhan Luo

Research

Updates

Recent Work [More]

Experience