Publication | Wayne Wu

2026

UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos

Mingxuan Liu*, Honglin He*, Elisa Ricci, Wayne Wu, and Bolei Zhou

International Conference on Learning Representations (ICLR), 2026

arXiv Project Page
From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning

Honglin He, Yukai Ma, Wayne Wu, and Bolei Zhou

International Conference on Learning Representations (ICLR), 2026

arXiv Project Page
Joint Optimization for 4D Human-Scene Reconstruction in the Wild

Zhizheng Liu, Joe Lin, Wayne Wu, and Bolei Zhou

International Conference on Learning Representations (ICLR), 2026

arXiv Project Page
Learning Sidewalk Autopilot from Multi-Scale Imitation with Corrective Behavior Expansion

Honglin He, Yukai Ma, Brad Squicciarini, Wayne Wu, and Bolei Zhou

International Conference on Robotics and Automation (ICRA), 2026
AURA: Multi-modal Shared Autonomy for Urban Navigation

Yukai Ma, Honglin He, Selina Song, Wayne Wu, and Bolei Zhou

Computer Vision and Pattern Recognition (CVPR), 2026

2025

Towards Autonomous Micromobility through Scalable Urban Simulation

Wayne Wu *, Honglin He*, Chaoyuan Zhang, Jack He, Seth Z. Zhao, Ran Gong, Quanyi Li, and Bolei Zhou

Computer Vision and Pattern Recognition (CVPR), 2025

arXiv Project Page
Highlight
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation

Ziyang Xie, Zhizheng Liu, Zhenghao Peng, Wayne Wu, and Bolei Zhou

Computer Vision and Pattern Recognition (CVPR), 2025

arXiv Project Page
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

Yuhan Wang, Fangzhou Hong, Shuai Yang, Liming Jiang, Wayne Wu, and Chen Change Loy

Computer Vision and Pattern Recognition (CVPR), 2025

arXiv Project Page
Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels

Zhizheng Liu, Joe Lin, Wayne Wu, and Bolei Zhou

International Conference on Learning Representations (ICLR), 2025

arXiv Project Page
MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility

Wayne Wu *, Honglin He*, Jack He, Yiran Wang, Chenda Duan, Zhizheng Liu, Quanyi Li, and Bolei Zhou

International Conference on Learning Representations (ICLR), 2025

arXiv YouTube Project Page
Spotlight

2024

Parameterization-driven Neural Surface Reconstruction for Object-oriented Editing in Neural Rendering

Baixin Xu, Jiangbei Hu, Fei Hou, Kwan-Yee Lin, Wayne Wu, Chen Qian, and Ying He

European Conference on Computer Vision (ECCV), 2024

arXiv Project Page
CosmicMan: A Text-to-Image Foundation Model for Humans

Shikai Li, Jianglin Fu, Kaiyuan Liu, Wentao Wang, Kwan-Yee Lin, and Wayne Wu †

Computer Vision and Pattern Recognition (CVPR), 2024

arXiv YouTube Project Page
Highlight
PaintHuman: Towards High-fidelity Text-to-3D Human Texturing via Denoised Score Distillation

Jianhui Yu, Hao Zhu, Liming Jiang, Chen Change Loy, Weidong Cai, and Wayne Wu †

Association for the Advancement of Artificial Intelligence (AAAI), 2024

arXiv
ReliTalk: Relightable Talking Portrait Generation from a Single Video

Haonan Qiu, Zhaoxi Chen, Yuming Jiang, Hang Zhou, Xiangyu Fan, Lei Yang, Wayne Wu, and Ziwei Liu

International Journal of Computer Vision (IJCV), 2024

arXiv YouTube Project Page
VLG: General Video Recognition with Web Textual Knowledge

Jintao Lin, Zhaoyang Liu, Wenhai Wang, Wayne Wu, and Limin Wang

International Journal of Computer Vision (IJCV), 2024

arXiv

2023

HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks

Zhuo Chen, Xudong Xu, et al., Wayne Wu, Bo Dai, and Xiaokang Yang

Technical report, arXiv:2304.09463, 2023

arXiv
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

Contributors to RenderMe-360

Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks, 2023

arXiv YouTube Project Page
DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering

Contributors to DNA-Rendering

International Conference on Computer Vision (ICCV), 2023

arXiv YouTube Project Page
SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling

Zhitao Yang, Zhongang Cai, Haiyi Mei, Shuai Liu, Zhaoxi Chen, Weiye Xiao, Yukun Wei, Zhongfei Qing, Chen Wei, Bo Dai, Wayne Wu, Chen Qian, Dahua Lin, Ziwei Liu, and Lei Yang

International Conference on Computer Vision (ICCV), 2023

arXiv YouTube Project Page
UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation

Jianglin Fu, Shikai Li, Yuming Jiang, Kwan-Yee Lin, Ziwei Liu, and Wayne Wu †

International Conference on Computer Vision (ICCV), 2023

YouTube Project Page
OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs

Honglin He, Zhuoqian Yang, Shikai Li, Bo Dai, and Wayne Wu †

International Conference on Computer Vision (ICCV), 2023

YouTube Project Page
3DHumanGAN: Towards Photo-Realistic 3D-Aware Human Image Generation

Zhuoqian Yang, Shikai Li, Wayne Wu†, and Bo Dai

International Conference on Computer Vision (ICCV), 2023

arXiv YouTube Project Page
MotionBERT: Unified Pretraining for Human Motion Analysis

Wentao Zhu, Xiaoxuan Ma, Zhaoyang Liu, Libin Liu, Wayne Wu, and Yizhou Wang

International Conference on Computer Vision (ICCV), 2023

arXiv Project Page
Text2Performer: Text-Driven Human Video Generation

Yuming Jiang, Shuai Yang, Tong Liang Koh, Wayne Wu, Chen Change Loy, and Ziwei Liu

International Conference on Computer Vision (ICCV), 2023

arXiv YouTube Project Page
Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis

Yuxin Wang, Wayne Wu, and Dan Xu

International Conference on Computer Vision (ICCV), 2023

arXiv Project Page
MonoHuman: Animatable Human Neural Field from Monocular Video

Zhengming Yu, Wei Cheng, Xian Liu, Wayne Wu, and Kwan-Yee Lin

Conference on Computer Vision and Pattern Recognition (CVPR), 2023

arXiv Project Page
CelebV-Text: A Large-Scale Facial Text-Video Dataset

Jianhui Yu, Hao Zhu, Liming Jiang, Chen Change Loy, Weidong Cai, and Wayne Wu †

Conference on Computer Vision and Pattern Recognition (CVPR), 2023

arXiv Project Page
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

Tong Wu, Jiarui Zhang, Xiao Fu, Yuxin Wang, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, and Ziwei Liu

Conference on Computer Vision and Pattern Recognition (CVPR), 2023

arXiv Project Page
Best Paper Candidate
Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation

Haoyue Cheng, Zhaoyang Liu, Wayne Wu, and Limin Wang

International Conference on Learning Representations (ICLR), 2023

Project Page

2022

Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis

Wei Cheng, Su Xu, Jingtan Piao, Chen Qian, Wayne Wu, Kwan-Yee Lin, and Hongsheng Li

Technical report, arXiv:2204.11798, 2022

arXiv YouTube Project Page
StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3

Haonan Qiu, Yuming Jiang, Hang Zhou, Wayne Wu, and Ziwei Liu

Technical report, arXiv:2208.07862, 2022

arXiv YouTube Project Page
Audio-Driven Co-Speech Gesture Video Generation

Xian Liu, Qianyi Wu, Hang Zhou, Yuanqi Du, Wayne Wu, Dahua Lin, and Ziwei Liu

Neural Information Processing Systems (NeurIPS), 2022

arXiv Project Page
Spotlight
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis

Long Zhuo, Guangcong Wang, Shikai Li, Wayne Wu, and Ziwei Liu

European Conference on Computer Vision (ECCV), 2022

arXiv YouTube Project Page
StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Jianglin Fu, Shikai Li, Yuming Jiang, Kwan-Yee Lin, Chen Qian, Chen Change Loy, Wayne Wu†, and Ziwei Liu

European Conference on Computer Vision (ECCV), 2022

arXiv YouTube Project Page
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, and Bolei Zhou

European Conference on Computer Vision (ECCV), 2022

arXiv Project Page
Oral
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

Hao Zhu*, Wayne Wu *†, Wentao Zhu, Liming Jiang, Siwei Tang, Li Zhang, Ziwei Liu, and Chen Change Loy

European Conference on Computer Vision (ECCV), 2022

arXiv YouTube Project Page
Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing

Haoyue Cheng, Zhaoyang Liu, Hang Zhou, Chen Qian, Wayne Wu, and Limin Wang

European Conference on Computer Vision (ECCV), 2022

arXiv
Text2Human: Text-Driven Controllable Human Image Generation

Yuming Jiang, Shuai Yang, Haonan Qiu, Wayne Wu, Chen Change Loy, and Ziwei Liu

ACM Transaction on Graphics (SIGGRAPH), 2022

arXiv YouTube Project Page
EAMM: One-Shot Emotional Talking Face via Audio-based Emotion-Aware Motion Model

Xinya Ji, Hang Zhou, Kaisiyuan Wang, Qianyi Wu, Wayne Wu†, Feng Xu, and Xun Cao

ACM Transaction on Graphics (SIGGRAPH), 2022

arXiv YouTube
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

Yanbo Xu, Yueqin Yin, Liming Jiang, Qianyi Wu, Chengyao Zheng, Chen Change Loy, Bo Dai, and Wayne Wu †

Conference on Computer Vision and Pattern Recognition (CVPR), 2022

arXiv Project Page
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, and Bolei Zhou

Conference on Computer Vision and Pattern Recognition (CVPR), 2022

arXiv YouTube Project Page
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection

Jiaqi Tang, Zhaoyang Liu, Chen Qian, Wayne Wu, and Limin Wang

Conference on Computer Vision and Pattern Recognition (CVPR), 2022

arXiv
MoCaNet: Motion Retargeting in-the-wild via Canonicalization Networks

Wentao Zhu, Zhuoqian Yang, Ziang Di, Wayne Wu†, Yizhou Wang, and Chen Change Loy

Association for the Advancement of Artificial Intelligence (AAAI), 2022

arXiv Project Page
Everybody’s Talkin’: Let Me Talk as You Want

Linsen Song, Wayne Wu, Chen Qian, Ran He, and Chen Change Loy

Transactions on Information Forensics and Security (TIFS) 2022

arXiv YouTube Project Page
DeepFakes Detection: The DeeperForensics Dataset and Challenge

Liming Jiang, Wayne Wu, Chen Qian, and Chen Change Loy

Handbook of Digital Face Manipulation and Detection, Springer, 2022

Project Page
Talking Faces: Audio-to-Video Face Generation

Yuxin Wang, Linsen Song, Wayne Wu, Chen Qian, Ran He, and Chen Change Loy

Handbook of Digital Face Manipulation and Detection, Springer, 2022

Project Page

2021

Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

Liming Jiang, Bo Dai, Wayne Wu, and Chen Change Loy

Neural Information Processing System (NeurIPS), 2021

arXiv YouTube Project Page
Focal Frequency Loss for Image Reconstruction and Synthesis

Liming Jiang, Bo Dai, Wayne Wu, and Chen Change Loy

International Conference on Computer Vision (ICCV), 2021

arXiv Project Page
TAM: Temporal Adaptive Module for Video Recognition

Zhaoyang Liu, Limin Wang, Wayne Wu, Chen Qian, and Tong Lu

International Conference on Computer Vision (ICCV), 2021

arXiv Project Page
Everything’s Talkin’: Pareidolia Face Reenactment

Linsen Song*, Wayne Wu *, Chaoyou Fu, Chen Qian, Chen Change Loy, and Ran He

Conference on Computer Vision and Pattern Recognition (CVPR), 2021

arXiv YouTube Project Page
Audio-Driven Emotional Video Portraits

Xinya Ji, Hang Zhou, Kaisiyuan Wang, Wayne Wu†, Chen Change Loy, Xun Cao, and Feng Xu

Conference on Computer Vision and Pattern Recognition (CVPR), 2021

arXiv YouTube Project Page
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation

Hang Zhou, Yasheng Sun, Wayne Wu, Chen Change Loy, Xiaogang Wang, and Ziwei Liu

Conference on Computer Vision and Pattern Recognition (CVPR), 2021

arXiv YouTube Project Page

2020

AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection

Hao Zhu, Chaoyou Fu, Qianyi Wu, Wayne Wu, Chen Qian, and Ran He

Neural Information Processing System (NeurIPS), 2020

arXiv
Bi-directional Cross-Modality Feature Propagation with SA Gate for RGB-D Semantic Segmentation

Xiaokang Chen, Kwan-Yee Lin, Jingbo Wang, Wayne Wu, Chen Qian, Hongsheng Li, and Gang Zeng

European Conference on Computer Vision (ECCV), 2020

arXiv Project Page
MEAD: A Large-Scale Audio-Visual Dataset for Emotional Talking-Face Generation

Kaisiyuan Wang, Qianyi Wu, Linsen Song, Zhuoqian Yang, Wayne Wu†, Chen Qian, Ran He, Yu Qiao, and Chen Change Loy

European Conference on Computer Vision (ECCV), 2020

YouTube PDF Project Page
DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection

Liming Jiang, Ren Li, Wayne Wu, Chen Qian, and Chen Change Loy

Conference on Computer Vision and Pattern Recognition (CVPR), 2020

arXiv YouTube Project Page
TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting

Zhuoqian Yang*, Wentao Zhu*, Wayne Wu *, Chen Qian, Qiang Zhou, Bolei Zhou, and Chen Change Loy

Conference on Computer Vision and Pattern Recognition (CVPR), 2020

arXiv YouTube Project Page

2019

TransGaGa: Geometry-Aware Unsupervised Image-to-Image Translation

Wayne Wu, Kaidi Cao, Cheng Li, Chen Qian, and Chen Change Loy

Conference on Computer Vision and Pattern Recognition (CVPR), 2019

arXiv Project Page
FAB: A Robust Facial Landmark Detection Framework for Motion-Blurred Videos

Keqiang Sun, Wayne Wu, Tinghao Liu, Shuo Yang, Quan Wang, Qiang Zhou, Zuochang Ye, and Chen Qian

International Conference on Computer Vision (ICCV), 2019

arXiv Project Page
Aggregation via Separation: Boosting Facial Landmark Detector with Self-Supervised Style Transition

Shengju Qian, Keqiang Sun, Wayne Wu, Chen Qian, and Jiaya Jia

International Conference on Computer Vision (ICCV), 2019

arXiv
Make a Face: Towards Arbitrary High Fidelity Face Manipulation

Shengju Qian, Kwan-Yee Lin, Wayne Wu, Yangxiaokang Liu, Quan Wang, Fumin Shen, Chen Qian, and Ran He

International Conference on Computer Vision (ICCV), 2019

arXiv

2018

ReenactGAN: Learning to Reenact Faces via Boundary Transfer

Wayne Wu, Yunxuan Zhang, Cheng Li, Chen Qian, and Chen Change Loy

European Conference on Computer Vision (ECCV), 2018

arXiv YouTube Project Page
Look at Boundary: A Boundary-Aware Face Alignment Algorithm

Wayne Wu, Chen Qian, Shuo Yang, Quan Wang, Yici Cai, and Qiang Zhou

Conference on Computer Vision and Pattern Recognition (CVPR), 2018

arXiv YouTube Project Page