I am looking for 3~4 PhD students (starting at 2024 autumn or 2025 spring), 1 post-doc (starting at any time), multiple interns and RA (starting at any time, can be onsite or remote) to work with me at HKUST. Please Email me (whluo.china AT gmail.com) with your CV and indicate the position you are interested. I will quickly get back to candidates if the fitting is well.

I will be joining the Hong Kong University of Science and Technology (HKUST) as Associate Professor in 2024 spring/summer.

I conduct research on creative AI. Previously, I worked as an Associate Professor at Sun Yat-sen University, and an applied research scientist at Tencent, solving real-world problems using computer vision and machine learning techniques. Prior to Tencent, I worked for Amazon (A9) in Palo Alto, California, where I developed deep models for better visual search experience. Before that, I worked as a research scientist in Tencent AI Lab. The techniques I have developed/involved have been shipped to several products in Tencent such as WeChat, QQ, Tencent Video, and myapp. I received the Ph.D. degree from Imperial College London, UK, 2016, M.E. degree from Institute of Automation, Chinese Academy of Sciences, China, 2012 and B.E. degree from Huazhong University of Science and Technology, China, 2009. I have published over 80 peer-reviewed papers, most of which are published in top-tier conferences and journals, like ICML, NeurIPS, CVPR, ICCV, ECCV, AAAI, ACL, ACMMM, ICLR, TPAMI, AI, IJCV, TIP. I received the CVPR 2019 Best Paper Nominee and was awarded the 2022 ACM China Rising Star Award (Guangzhou Chapter). I have been elected among Top 2% Scientists worldwide 2023 by Stanford university. [CV]

Research

I am interested in several topics in computer vision and machine learning. Specifically, my current research focuses on creative AI, such as image/video synthesis and enhancement.

Updates

  • 2024/04:   Our team win two championship awards in the two tracks of NTIRE 2024 challenges (Bracketing Image Restoration and Enhancement Challenge - Track 1 & 2) in conjunction with CVPR2024.
  • 2024/04:   Paper to appear in IEEE TGRS.
  • 2024/03:   Together with Samsung, we win the 2rd place in the competition of Few-shot RAW Image Denoising @ MIPI2024 in conjunction with CVPR2024, see the award.
  • 2024/03:   Paper to appear in TCSVT.
  • 2024/03:   Paper to appear in IJCV.
  • 2024/02:   Two papers accepted by CVPR 2024.
  • 2024/02:   Paper accepted by COLING 2024 (multi-modal in-context learning).
  • 2024/02:   I will be joining the Hong Kong University of Science and Technology (HKUST) as Associate Professor in 2024 spring/summer.
  • 2024/01:   Paper accepted by ICLR 2024.
  • 2023/12:   Paper to appear in TNNLS (blind face restoration).
  • 2023/12:   Invited to serve as Senior Program Committe member (SPC) for IJCAI 2024.
  • 2023/11:   Paper to appear in TCSVT (under-display camera face image restoration).
  • 2023/10:   Elected among Top 2% Scientists worldwide 2023 by Stanford University.

Show more

Publications [Full List]

(* indicates equal contribution, + indicates intern/student working with me, # indicates correspondence)

  • GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions,

    Tao Wang, Kaihao Zhang, Ziqian Shao, Wenhan Luo#, Bjorn Stenger, Tong Lu, Tae-Kyun Kim, Wei Liu, Hongdong Li,

    International Journal of Computer Vision (IJCV), to appear.

    [PDF] [Code]

  • Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation,

    Xingqun Qi, Jiahao Pan, Peng Li, Ruibin Yuan, Xiaowei Chi, Mengfei Li, Wenhan Luo, Wei Xue, Shanghang Zhang, Qifeng Liu, Yike Guo,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2024.

    [PDF] [Code]

  • Context-Aware Integration of Language and Visual References for Natural Language Tracking,

    Yanyan Shao, Shuting He, Qi Ye, Yuchao Feng, Wenhan Luo, Jiming Chen,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2024.

    [PDF] [Code]

  • Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost,

    Yuan Gao, Weizhong Zhang, Wenhan Luo, Lin Ma, Jin-Gang Yu, Gui-Song Xia, Jiayi Ma,

    International Conference on Learning Representations (ICLR), 2024.

    [PDF] [Code]

  • Punctuation-level Attack: Single-shot and Single Punctuation Can Fool Text Models,

    Wenqiang Wang, Chongyang Du, Tao Wang, Kaihao Zhang, Wenhan Luo#, Lin Ma, Wei Liu, Xiaochun Cao,

    Neural Information Processing Systems (NeurIPS), 2023.

    [PDF]

  • FnF Attack: Adversarial Attack against Multiple Object Trackers by Inducing False Negatives and False Positives,

    Tao Zhou, Qi Ye#, Wenhan Luo#, Kaihao Zhang, Zhiguo Shi, Jiming Chen,

    Proc. of International Conference on Computer Vision (ICCV), Paris, France, 2023.

    [PDF] [Project] [Code]

  • PRIOR: Prototype Representation Joint Learning from Medical Images and Reports,

    Pujin Cheng, Li Lin, Junyan Lyu, Yijin Huang, Wenhan Luo, Xiaoying Tang,

    Proc. of International Conference on Computer Vision (ICCV), Paris, France, 2023.

    [PDF] [Code]

  • MB-TaylorFormer: Mutil-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing,

    Yuwei Qiu, Kaihao Zhang, Chenxi Wang, Wenhan Luo, Hongdong Li, Zhi Jin,

    Proc. of International Conference on Computer Vision (ICCV), Paris, France, 2023.

    [PDF] [Code]

  • Homography Guided Temporal Fusion for Road Line and Marking Segmentation,

    Shan Wang, Chuong Nguyen, Jiawei Liu, Kaihao Zhang, Wenhan Luo, Yanhao Zhang, Sundaram Muthu, Fahira Afzal Maken, Hongdong Li,

    Proc. of International Conference on Computer Vision (ICCV), Paris, France, 2023.

    [PDF] [Code]

  • Robust Single Image Reflection Removal Against Adversarial Attacks,

    Zhenbo Song, Zhenyuan Zhang, Kaihao Zhang, Wenhan Luo#, Zhaoxin Fan, Wenqi Ren, Jianfeng Lu,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2023.

    [PDF] [Code]

  • Benchmarking Ultra-High-Definition Low-Light Image Enhancement and A Transformer Method,

    Tao Wang, Kaihao Zhang, Tianrun Shen, Wenhan Luo#, Bjorn Stenger, Tong Lu#,

    Proc. of the Association for the Advancement of Artificial Intelligence (AAAI), USA, 2023. (Oral)

    [PDF] [Code]

  • APPTracker: Improving Tracking Multiple Objects in Low-Frame-Rate Videos,

    Tao Zhou, Wenhan Luo, Zhiguo Shi, Jiming Chen, Qi Ye,

    The 30th ACM International Conference on Multimedia (ACM MM), 2022.

    [PDF] [Project Page]

  • EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset,

    Kaihao Zhang, Dongxu Li, Wenhan Luo, Jingyu Liu, Jiankang Deng, Wei Liu, Stefanos Zafeiriou,

    IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), vol. 45, pp. 3968-3978, 2023.

    [PDF] [Github] [Project Page]

  • Deep Image Deblurring: A Survey,

    Kaihao Zhang, Wenqi Ren, Wenhan Luo, Wei-Sheng Lai, Bjorn Stenger, Ming-Hsuan Yang, Hongdong Li,

    International Journal of Computer Vision (IJCV), vol. 130, pp. 2103-2130, 2022.

    [PDF]

  • Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior,

    Kaihao Zhang+, Wenhan Luo#, Yanjiang Yu, Wenqi Ren, Fang Zhao, Changsheng Li, Lin Ma, Wei Liu, Hongdong Li,

    International Journal of Computer Vision (IJCV), vol. 130, pp. 1754-1769, 2022.

    [PDF] [Github]

  • Aesthetic Text Logo Synthesis via Content-aware Layout Inferring,

    Yizhi Wang+, Guo Pu, Wenhan Luo, Yexin Wang, Pengfei Xiong, Hongwen Kang, Zhouhui Lian,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2022.

    [PDF] [Dataset/Code]

  • Enhanced Spatio-Temporal Interaction Learning for Video Deraining: A Faster and Better Framework,

    Kaihao Zhang+, Dongxu Li, Wenhan Luo, Wenqi Ren, Wei Liu,

    IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), vol. 45, pp. 1287-1293, 2023.

    [arXiv] [Dataset/Code]

  • Benchmarking Ultra-High-Definition Image Super-resolution,

    Kaihao Zhang+, Dongxu Li, Wenhan Luo, Wenqi Ren, Bjorn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang,

    Proc. of International Conference on Computer Vision (ICCV), 2021.

    [PDF] [Dataset]

  • Towards Distraction-Robust Active Visual Tracking,

    Fangwei Zhong+, Peng Sun, Wenhan Luo, Tingyun Yan, Yizhou Wang,

    International Conference on Machine Learning (ICML), 2021.

    [PDF] [Code]

  • Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis,

    Wen Liu+, Zhixin Piao, Zhi Tu, Wenhan Luo, Lin Ma, Shenghua Gao,

    IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), vol. 44, pp. 5114-5132, 2022.

    [PDF] [Code]

  • Multiple Object Tracking: A Literature Review,

    Wenhan Luo, Junliang Xing, Anton Milan, Xiaoqin Zhang, Wei Liu, Tae-Kyun. Kim,

    Artificial Intelligence, vol. 293, pp. 103448, 2021.

    [PDF]

  • Every Moment Matters: Detail-Aware Networks to Bring a Blurry Image Alive,

    Kaihao Zhang+, Wenhan Luo, Bjorn Stenger, Wenqi Ren, Lin Ma, Hongdong Li,

    The 28th ACM International Conference on Multimedia (ACM MM), 2020. (Oral)

    [PDF]

  • Beyond Monocular Deraining: Stereo Image Deraining via Semantic Understanding,

    Kaihao Zhang+, Wenhan Luo, Wenqi Ren, Jingwen Wang, Fang Zhao, Lin Ma, Hongdong Li,

    European Conference on Computer Vision (ECCV), UK, 2020.

    [PDF] [Dataset (zzkd)] [Code (ehl2)] [Results (yb4y)] [Github]

  • Deblurring by Realistic Blurring,

    Kaihao Zhang+, Wenhan Luo, Yiran Zhong, Lin Ma, Bjorn Stenger, Wei Liu, Hongdong Li,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2020. (Oral)

    [PDF] [Dataset/Code]

  • Fine-grained Image-to-Image Transformation towards Visual Recognition,

    Wei Xiong, Yutong He, Yixuan Zhang, Wenhan Luo, Lin Ma, Jiebo Luo,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2020.

    [PDF] [Project Page]

  • AD-VAT+: An Asymmetric Dueling Mechanism for Learning and Understanding Visual Active Tracking,

    Fangwei Zhong+, Peng Sun, Wenhan Luo, Tingyun Yan, Yizhou Wang,

    IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), vol. 43, pp. 1467-1482, 2021.

    [PDF] [Code] [Demo] [Dataset]

  • Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis,

    Wen Liu+, Zhixin Piao, Jie Min, Wenhan Luo, Lin Ma, Shenghua Gao,

    Proc. of International Conference on Computer Vision (ICCV), Korea, 2019.

    [PDF] [Project Page] [Code] [Dataset]

  • Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video,

    Zhenfang Chen+, Lin Ma#, Wenhan Luo#, Kwan-Yee K Wong,

    The 57th Annual Meeting of the Association for Computational Linguistics (ACL), Italy, 2019. (Oral)

    [PDF] [Code]

  • Face Anti-Spoofing: Model Matters, So Does Data,

    Xiao Yang*, Wenhan Luo*, Linchao Bao, Yuan Gao, Dihong Gong, Shibao Zheng, Zhifeng Li, Wei Liu,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2019.

    [PDF]

  • Learning Joint Gait Representation via Quintuplet Loss Minimization,

    Kaihao Zhang+, Wenhan Luo, Lin Ma, Wei Liu, Hongdong Li,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2019. (Oral)

    [PDF]

  • Residual Regression with Semantic Prior for Crowd Counting,

    Jia Wan+, Wenhan Luo, Baoyuan Wu, Antoni Chan, Wei Liu,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2019.

    [PDF] [Project Page] [Code]

  • Learning to Compose Dynamic Tree Structures for Visual Contexts,

    Kaihua Tang+, Hanwang Zhang, Baoyuan Wu, Wenhan Luo, Wei Liu,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2019. (Oral & Best Paper Nominee)

    [arXiv] [Code]

  • Bi-Real Net: Binarizing Deep Network towards Real-Network Performance,

    Zechun Liu+, Wenhan Luo, Baoyuan Wu, Xin Yang, Wei Liu, Kwang-Ting Cheng,

    International Journal of Computer Vision (IJCV), vol. 128, pp. 202-219, 2020.

    [PDF] [arXiv] [Code]

  • AD-VAT: An Asymmetric Dueling Mechanism for Learning Visual Active Tracking,

    Fangwei Zhong+, Peng Sun, Wenhan Luo, Tingyun Yan, Yizhou Wang,

    International Conference on Learning Representations (ICLR), New Orleans, USA, 2019.

    [OpenReview Link] [Code] [Dataset]

  • End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning,

    Wenhan Luo*, Peng Sun*, Fangwei Zhong*, Wei Liu, Tong Zhang, Yizhou Wang,

    IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), vol. 42, pp. 1317-1332, 2020.

    [arXiv] [Project Page] [Code]

  • Cousin Network Guided Sketch Recognition via Latent Attribute Warehouse,

    Kaihao Zhang+, Wenhan Luo#, Lin Ma, Hongdong Li,

    Proc. of the Association for the Advancement of Artificial Intelligence (AAAI), Hawaii, USA, 2019. (Spotlight)

    [PDF]

  • Bi-Real Net: Enhancing the Performance of 1-bit CNNs with Improved Representational Capability and Advanced Training Algorithm,

    Zechun Liu+, Baoyuan Wu, Wenhan Luo, Xin Yang, Wei Liu, Kwang-Ting Cheng,

    European Conference on Computer Vision (ECCV), Germany, 2018.

    [PDF] [Code]

  • End-to-end Active Object Tracking via Reinforcement Learning,

    Wenhan Luo*, Peng Sun*, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang,

    International Conference on Machine Learning (ICML), Sweden, 2018.

    [PDF] [Project Page] [Code] [Demo]

  • Learning to Generate Time-Lapse Videos Using Multi-Stage Dynamic Generative Adversarial Networks,

    Wei Xiong+, Wenhan Luo, Lin Ma, Wei Liu, Jiebo Luo,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2018.

    [arXiv] [Project Page] [Code] [Dataset]

  • Real-Time Neural Style Transfer for Videos,

    Haozhi Huang+, Hao Wang, Wenhan Luo, Lin Ma, Wenhao Jiang, Xiaolong Zhu, Zhifeng Li, Wei Liu,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), USA, 2017.

    [PDF]

  • Automatic Topic Discovery for Multi-object Tracking,

    Wenhan Luo, Bjorn Stenger, Xiaowei Zhao, Tae-Kyun Kim,

    Proc. of the Association for the Advancement of Artificial Intelligence (AAAI), Austin, Texas, USA, 2015. (Oral)

    [PDF]

  • Bi-label Propagation for Generic Multiple Object Tracking,

    Wenhan Luo, Tae-Kyun Kim, Bjorn Stenger, Xiaowei Zhao, Roberto Cipolla,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Columbus, Ohio, USA, 2014.

    [PDF]

  • Unified Face Analysis by Iterative Multi-Output Random Forests,

    Xiaowei Zhao, Tae-Kyun Kim, Wenhan Luo,

    Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Columbus, Ohio, USA, 2014.

    [PDF]

  • Single and Multiple Object Tracking Using Log-Euclidean Riemannian Subspace and Block-Division Appearance Model,

    Weiming Hu, Xi Li, Wenhan Luo, Xiaoqin Zhang, Steve Maybank, Zhongfei Zhang,

    IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), vol. 34, no. 12, pp. 2420-2440, 2012.

    [PDF]