I am currently a researcher at Baidu AMU. I received my B.S. and M.S. degrees from Tongji University, School of Automotive Engineering.

Since 2024, I have been working at Baidu VIS(now AMU), where I collaborate closely with Hang Zhou and Kaisiyuan Wang on high-fidelity and efficient human-centric video generation, with a particular focus on audio-driven and video-driven synthesis.

Previously, I worked at the Intelligent Driving Group (IDG), Baidu Inc., on end-to-end autonomous driving with Xiaoqing Ye and Yifu Zhang.

My research interests lie in human-centric generation, co-speech gesture synthesis, human video synthesis, and compositional video generation.

🔥 News

2025.07: GestureHYDRA was accepted to ICCV 2025.
2023.06: We won 4th place in the CVPR 2023 3D Occupancy Prediction Challenge.

📝 Publications

ICCV 2025

GestureHYDRA: Semantic Co-speech Gesture Synthesis via Hybrid Modality Diffusion Transformer and Cascaded-Synchronized Retrieval-Augmented Generation

Quanwei Yang, Luying Huang, Kaisiyuan Wang†, Jiazhi Guan, Shengyi He, Fengluo Li, Hang Zhou, Lingyun Yu, Yingying Li, Haocheng Feng, Hongtao Xie†

[Project Page] [Code] [BibTeX]

ICCV 2025. Equal contribution. † Corresponding author.

arXiv 2026

ONE-SHOT: Compositional Human-Environment Video Synthesis via Spatial-Decoupled Motion Injection and Hybrid Context Integration

Fengyuan Yang, Luying Huang†, Jiazhi Guan*, Quanwei Yang, Dongwei Pan, Jianglin Fu, Haocheng Feng, Wei He, Kaisiyuan Wang, Hang Zhou*, Angela Yao

[Project Page] [BibTeX]

arXiv preprint. † Project leader. * Corresponding authors.

arXiv 2026

DISPLAY: Directable Human-Object Interaction Video Generation via Sparse Motion Guidance and Multi-Task Auxiliary

Jiazhi Guan, Quanwei Yang, Luying Huang, Junhao Liang, Borong Liang, Haocheng Feng, Wei He, Kaisiyuan Wang†, Hang Zhou†, Jingdong Wang

[Project Page] [BibTeX]

arXiv preprint. Equal contribution. † Corresponding author.

arXiv 2026

InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Dongwei Pan, Longwei Guo, Jiazhi Guan, Luying Huang, Yiding Li, Haojie Liu, Haocheng Feng, Wei He, Kaisiyuan Wang, Hang Zhou

[Project Page] [BibTeX]

arXiv preprint.

CVPR Challenge 2023

Multi-Scale Occ: 4th Place Solution for CVPR 2023 3D Occupancy Prediction Challenge

Yangyang Ding, Luying Huang, Jiachen Zhong

[Challenge] [BibTeX]

CVPR 2023 3D Occupancy Prediction Challenge. Equal contribution.

🎖 Honors and Awards

2025.07 ICCV 2025 paper acceptance for GestureHYDRA.
2023.06 4th Place, CVPR 2023 3D Occupancy Prediction Challenge.

📖 Educations

Tongji University, School of Automotive Engineering, B.S. and M.S.

💼 Experiences

2024. - Present, Researcher, Baidu AMU.
Previously, Intelligent Driving Group (IDG), Baidu Inc.

🛠 Selected Projects

Baidu NOVA Digital Human, including high-fidelity digital human systems for long-form live streaming.
Baidu Apollo ADFM, end-to-end autonomous driving large model research and development.