I am now an Associate Professor with the Hong Kong University of Science and Technology (HKUST). Previously, I worked as an Associate Professor at Sun Yat-sen University, and an applied research scientist at Tencent, solving real-world problems using computer vision and machine learning techniques. Prior to Tencent, I worked for Amazon in Palo Alto, California, where I developed deep models for better visual search experience. Before that, I worked as a research scientist in Tencent AI Lab. The techniques I have developed/involved have been shipped to several products in Tencent such as WeChat, QQ, Tencent Video, Tencent Yuanbao, Tencent Cloud, and myapp. I received the Ph.D. degree from Imperial College London, UK, 2016, under the supervision of Prof. Tae-Kyun Kim, and working closely with Dr. Bjorn Stenger, M.E. degree from Institute of Automation, Chinese Academy of Sciences, China, 2012, under the supervision of Prof. Weiming Hu, and B.E. degree from Huazhong University of Science and Technology, China, 2009.
I have published over 90 peer-reviewed papers in top-tier conferences and journals, like ICML, NeurIPS, CVPR, ICCV, ECCV, SIGGRAPH, AAAI, ACL, ACMMM, ICLR, TPAMI, AI, IJCV. My work is selected into the CVPR 2019 Best Paper Finalist and I was awarded the 2022 ACM China Rising Star Award (Guangzhou Chapter). I have served as Associate Editor for IEEE Transactions on Image Processing, Neurocomputing, IET Computer Vision, Guest Editor for CVIU, Area Chair for NeurIPS 2025, ACM MM 2025, ICML 2025, IJCAI 2025, IJCNN 2025, BMVC 2024, and Senior Program Committee member for AAAI and IJCAI, regular reviewer for top conferences and journals like TPAMI, IJCV, CVPR, ICML, ICCV. I have been elected among Top 2% Scientists worldwide (2023 & 2024) by Stanford/Elsevier. [Curriculum Vitae] [HKUST Profile]
Multiple PHD opennings in efficient large model, AIGC (image and video generation), content restoration and enhancement are available. Please check the Join Us page for details.
Research
I conduct research on creative AI. Specifically, my current research focuses on several topics, such as image/video generation and restoration/enhancement. My research is supported by the following sponsors/agencies.
Updates
- 2025/06: MaterialMVP and MOERL are accepted by ICCV 2025.
- 2025/06: Our group secured funding from Tencent (news), ByteDance, Huawei, Wiener Intelligence, Video Rebirth, Qingdao Municipal Bureau of Science and Technology.
- 2025/06: Paper to appear in TKDE.
- 2025/05: Papers accepted by TVCG and ACL 2025.
- 2025/05: Serve as Area Chair for BMVC 2025.
- 2025/04: Two papers are accepted by SIGGRAPH 2025.
- 2025/04: Serve as COI Coordinator for SIGGRAPH Asia 2025.
- 2025/04: I will serve as Area Chair for NeurIPS 2025.
- 2025/03: Content restoration works to appear in TPAMI and PR.
- 2025/03: I serve as Workshop Chair for CVM 2025.
- 2025/03: I will serve as Area Chair for ACM Multimedia 2025.
- 2025/02: Three papers accepted by CVPR 2025.
- 2025/01: I will serve as Associate Editor for IEEE Transactions on Image Processing.
Show more
- 2025/01: Two papers accepted by ICLR 2025.
- 2025/01: Uni-MoE accepted by TPAMI.
- 2024/12: Invited to serve as Area Chair for IJCAI 2025.
- 2024/12: Invited to serve as Area Chair for ICML 2025.
2024/11: Invited to serve as Senior Program Committe member (SPC) for IJCAI 2025.
- 2024/10: Awarded the CCF-Tencent Rhino-Bird Faculty Fund Excellence Award, see the news.
- 2024/09: Two Papers accepted by NeurIPS 2024.
- 2024/09: Elected among Stanford/Elsevier Top 2% Scientists List 2024.
- 2024/09: Paper to appear in TKDE.
- 2024/09: Paper to appear in IJCV.
- 2024/08: Paper to appear in TCSVT.
- 2024/07: Paper accepted by ACM MM 2024.
- 2024/07: Four papers accepted by ECCV 2024.
- 2024/06: Invited to serve as Senior Program Committe member (SPC) for AAAI 2025.
- 2024/05: Invited to serve as Area Chair for BMVC 2024.
- 2024/05: Paper accepted by ICML 2024.
- 2024/04: Our team win two championship awards in the two tracks of NTIRE 2024 challenges (Bracketing Image Restoration and Enhancement Challenge - Track 1 & 2) in conjunction with CVPR2024, see the award.
- 2024/04: Paper to appear in IEEE TGRS.
- 2024/03: Together with Samsung, we win the 2rd place in the competition of Few-shot RAW Image Denoising @ MIPI2024 in conjunction with CVPR2024, see the award.
- 2024/03: Paper to appear in TCSVT.
- 2024/03: Paper to appear in IJCV.
- 2024/02: Two papers accepted by CVPR 2024.
- 2024/02: Paper accepted by COLING 2024 (multi-modal in-context learning).
- 2024/02: I will be joining the Hong Kong University of Science and Technology (HKUST) as Associate Professor in 2024 spring/summer.
- 2024/01: Paper accepted by ICLR 2024.
- 2023/12: Paper to appear in TNNLS (blind face restoration).
- 2023/12: Invited to serve as Senior Program Committe member (SPC) for IJCAI 2024.
- 2023/11: Paper to appear in TCSVT (under-display camera face image restoration).
- 2023/10: Elected among Top 2% Scientists worldwide 2023 by Stanford University.
- 2023/09: Paper accepted by NeurIPS 2023 (punctuation-level attacks fooling text models).
- 2023/09: Paper to appear in TCSVT (image deblurring benchmark).
- 2023/09: Paper to appear in Pattern Recognition (image dehazing).
- 2023/07: I was granted the CCF-Tencent Rhino-Bird Faculty Research Fund, see the news.
- 2023/07: Paper to apppear in TCSVT (all-in-one weather-degraded image restoration).
- 2023/07: Four papers accepted by ICCV 2023.
- 2023/07: Invited to serve as a member of Senior Program Committe (SPC) for AAAI 2024.
- 2023/06: Paper accepted by IROS 2023 (hand interaction tracking).
- 2023/02: Paper accepted by CVPR 2023 (reflection removal against adversarial attacks).
- 2023/02: Elevation to IEEE Senior Member.
- 2023/01: Paper accepted by TNNLS (presentation attack detection).
- 2022/12: Invited to serve as a member of Senior Program Committe (SPC) for IJCAI 2023.
- 2022/11: Paper accepted by AAAI2023 (low-light image enhancement).
- 2022/11: TMM paper accepted (multi-modal retrieval).
- 2022/10: TMM paper accepted (image dehazing).
- 2022/09: Awarded the 2022 ACM China Rising Star Award (Guangzhou Chapter), see the news.
- 2022/08: Paper accepted by WACV2023 (few-shot object counting).
- 2022/07: I joined Sun Yat-sen University as an associate professor.
- 2022/06: Paper accepted by ACM MM 2022 (multi-object tracking).
- 2022/05: TPAMI paper accepted (face hallucination).
- 2022/05: Paper accepted by IJCV (image deblurring survey).
- 2022/04: Paper accepted by IJCV (image deraining).
- 2022/03: Paper accepted by CVPR2022 (aesthetic text logo synthesis).
- 2022/01: TPAMI paper accepted (video deraining).
- 2021/09: TIP paper accepted (image deraining).
- 2021/08: Invited to serve as a Senior PC member for AAAI 2022.
- 2021/08: TIP paper accepted (image desnow).
- 2021/07: Paper accepted by ICCV2021 (image SR benchmarking).
- 2021/07: One paper (image deblur & SR) to appear in IEEE Transactions on Image Processing.
- 2021/06: TMM paper accepted (action recognition).
- 2021/05: Our work of active visual tracking is accepted by ICML2021.
- 2021/05: One paper of image dehazing to appear in IEEE Transactions on Image Processing.
- 2021/04: Our work of human image synthesis is accepted to appear in TPAMI.
- 2021/03: One paper to appear in IEEE Transactions on Geoscience and Remote Sensing.
- 2021/02: One paper to appear in IEEE Transactions on Multimedia.
- 2020/12: The paper “Multiple Object Tracking: A Literature Review” is accepted by Artificial Intelligence.
- 2020/12: Invited to serve as a Senior PC member for IJCAI 2021.
- 2020/11: One paper of pedestrian detection to appear in IEEE Transactions on Image Processing.
- 2020/09: An invited talk is given in SUSTech, hosted by Prof. Xiaoying Tang.
- 2020/09: One paper of optical flow estimation to appear in IEEE Transactions on Image Processing.
- 2020/07: One paper to appear in ACM MM 2020 (Oral).
- 2020/07: One paper to appear in ECCV2020.
- 2020/06: One paper of multiple object tracking to appear in Pattern Recognition.
- 2020/05: We are organizing a special issue of action recognition and detection on CVIU. Submission deadline is Sep 15th. See the CFP if interested
- 2020/02: Two papers (one oral + one poster) to appear in CVPR2020.
- 2019/10: One paper to appear in TPAMI, entitled “AD-VAT+: An Asymmetric Dueling Mechanism for Learning and Understanding Visual Active Tracking”.
- 2019/09: Code and dataset of our ICCV2019 paper for motion imitation, appearance transfer and novel view synthesis are released. Check the project page here.
- 2019/09: One paper to appear in IJCV.
- 2019/07: One paper to appear in ICCV2019.
- 2019/07: The code of our ACL2019 paper is released. Check it here.
- 2019/07: The code of our ICLR2019 paper is released. Check it here.
- 2019/05: Join Amazon in California as a research scientist.
- 2019/06: Our CVPR paper entitled “Learning to Compose Dynamic Tree Structures for Visual Context” is selected as one of the best paper finalists (50 out of the 1294 accepted papers in CVPR2019).
- 2019/05: One paper of video grounding to appear in ACL2019 as a long paper, and oral presentation.
- 2019/05: Serve as program committee member of the workshop of Vision Meets Drones 2019: A Challenge in conjunction with ICCV2019.
- 2019/03: Four papers (2 orals + 2 posters) to appear in CVPR2019.
- 2019/02: Serve as program committee member of the 4th BMTT MOT Challenge Workshop and the 2nd Workshop and Challenge on Target Re-identification and Multi-Target Multi-Camera Tracking in conjunction with CVPR2019.
- 2019/02: The code of our CVPR 2018 paper is released. Check it here.
- 2019/01: Our work of “End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning” is accepted by TPAMI.
- 2018/12: One paper to appear in ICLR2019. Congratulations to Fangwei Zhong.
- 2018/11: One paper to appear in NIPS2018 workshop on Deep Reinforcement Learning.
- 2018/11: One paper to appear in AAAI2019. Congratulations to Kaihao Zhang.
- 2018/10: Welcome Tianrui Liu (Imperial College London) on board as a research intern.
- 2018/09: The code of our ICML2018 paper is released. Check it here.
- 2018/09: The code of our ECCV2018 paper “Bi-Real Net” is released. Check it here.
- 2018/08: Our work of video deblur is accepted by IEEE Transactions on Image Processing. Congratulations to Kaihao Zhang.
- 2018/08: One paper of multiple object tracking is accepted by IEEE Transactions on Image Processing.
- 2018/07: The dataset for our CVPR2018 paper, Sky Scene is released. See our project page for details.
- 2018/07: One paper to appear in ECCV2018. Congratulations to Zechun Liu.
- 2018/05: Our work of End-to-end Active Object Tracking via Reinforcement Learning is accepted by ICML2018. The camera-ready version will come soon.
- 2018/05: Serve as a member of the advisory committee of the workshop of Vision Meets Drone: A Challenge (VisDrone2018, for short) in conjunction with ECCV2018.
- 2018/04: Welcome Jia Wan (NWPU) on board as an intern.
- 2018/02: One paper to appear in CVPR2018. Congratulations to Wei Xiong.
- 2018/01: Welcome our intern Zechun Liu (HKUST) on board.
- 2017/10: Welcome Yiming Chen (Imperial College London) on board as an intern in Tencent AI Lab.
- 2017/08: Welcome Kaihao Zhang on board as intern in Tencent AI Lab. Kaihao is from Australian National University and will work close with me for about six months.
- 2017/06: Welcome Weiyue Su and Wei Xiong on board as intern in Tencent AI Lab. Weiyue is from South China University of Technology. Wei is from Wuhan University.
- 2017/05: Serve as a program committee member of the First Joint BMPP-PETS Workshop on Tracking and Surveillance in conjunction with CVPR2017.
- 2017/04: One paper to appear in CVPR2017.
- 2016/07: Join Tencent AI Lab as a research scientist.
- 2016/06: Pass viva exam and obtained the Ph.D. degree, examined by Tao (Tony) Xiang from Queen Mary University of London.
- 2016/05: Serve as a program committee member of the workshop Benchmarking Multi-Target Tracking: MOTChallenge in conjunction with ECCV2016.
- 2015/02: Start internship in Microsoft Research Asia with Dr. David Wipf (from Feb 2015 to June 2015).
Show less
|
UNIC: Unified In-Context Video Editing,
Zixuan Ye, Xuanhua He, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qifeng Chen, Wenhan Luo,
arXiv:2506.04216.
[arXiv]
[Project Page]
|
|
Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation,
Zhe Kong, Feng Gao, Yong Zhang, Zhuoliang Kang, Xiaoming Wei, Xunliang Cai, Guanying Chen, Wenhan Luo,
arXiv:2505.22647.
[arXiv]
[Project Page]
[Code]
|
|
MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion,
Zebin He, Mingxin Yang, Shuhui Yang, Yixuan Tang, Tao Wang, Kaihao Zhang, Guanying Chen, Yuhong Liu, Jie Jiang, Chunchao Guo, Wenhan Luo,
Proc. of International Conference on Computer Vision (ICCV), Hawaii, USA, 2025.
[arXiv]
[Project Page]
[Code]
|
|
MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration,
Tao Wang, Peiwen Xia, Bo Li, Peng-Tao Jiang, Zhe Kong, Kaihao Zhang, Tong Lu, Wenhan Luo,
Proc. of International Conference on Computer Vision (ICCV), Hawaii, USA, 2025.
|
|
DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution,
Zhe Kong, Le Li, Yong Zhang, Feng Gao, Shaoshu Yang, Tao Wang, Kaihao Zhang, Zhuoliang Kang, Xiaoming Wei, Guanying Chen, Wenhan Luo,
ACM SIGGRAPH, 2025.
[PDF]
[Project Page]
|
|
MB-TaylorFormer V2: Improved Multi-branch Linear Transformer Expanded by Taylor Formula for Image Restoration,
Zhi Jin, Yuwei Qiu, Kaihao Zhang, Hongdong Li, Wenhan Luo,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), to appear.
[arXiv]
[Code]
|
|
StyleMaster: Stylize Your Video with Artistic Generation and Translation,
Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo,
Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2025.
[arXiv]
[Github]
[Project Page]
|
|
Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling,
Jingyun Xue, Hongfa Wang, Qi Tian, Yue Ma, Andong Wang, Zhiyuan Zhao, Shaobo Min, Wenzhe Zhao, Kaihao Zhang, Heung-Yeung Shum, Wei Liu, Mengyang Liu, Wenhan Luo,
International Conference on Learning Representations (ICLR), 2025.
[PDF]
[Project Page]
[API in Tencent Cloud]
|
Experience
I have studied/interned/worked in the following affiliations.
