2023.09 - Now Postdoc, PolyU
2017.09 - 2023.06 Ph.D., Wuhan University
2021.02 - 2022.06 Research Intern, MSRA
2019.07 - 2019.09 Intern, Tencent
2013.09 - 2017.06 B.Eng., Wuhan University
Personal Website Google ScholarComputer Vision, Generative Model, Quality Assessment
Yaosi Hu, Zhenzhong Chen, Chong Luo. LaMD: Latent Motion Diffusion for Image-Conditional Video Generation. IJCV, 2025. https://doi.org/10.1007/s11263-025-02386-7.
Binyuan Huang*, Yuqing Wen*, Yucheng Zhao*, Yaosi Hu*, Yingfei Liu, Fan Jia, Weixin Mao, Tiancai Wang, Chi Zhang, Chang Wen Chen, Zhenzhong Chen, Xiangyu Zhang. SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control. AAAI, 2025. (Co-first Authors)
Yaosi Hu, Chong Luo, Zhenzhong Chen. A Benchmark for Controllable Text‑Image‑to‑Video Generation. IEEE TMM, 2023. https://doi.org/10.1109/TMM.2023.3284989.
Yaosi Hu, Chong Luo, Zhenzhong Chen. Make It Move: Controllable Image-to-Video Generation with Text Descriptions. CVPR, 2022. https://doi.org/10.1109/CVPR52688.2022.01768.
Yaosi Hu, Zhenzhong Chen, Zheng‑Jun Zha, Feng Wu, Hierarchical Global‑Local Temporal Modeling for Video Captioning. ACM MM, 2019. https://doi.org/10.1145/3343031.3351072.