Hi, this is Xiaojie Xu(徐 啸捷). I am currently an M.Phil. student in Artificial Intelligence at The Hong Kong University of Science and Technology, Guangzhou advised by Prof. Ying-Cong Chen. Prior, I received a Bachelor’s degree in Automation from University of Science and Technology of China, advised by Prof. Ligang Liu.
My research focuses on Multimodal Understanding and Generation, including text, images, videos, and 3D data.
I am always open to interesting research topics. Please feel free to contact me if you want to collaborate🤠.
📝 Publications
* indicates equal contributions. For a complete list of publications, please refer to my Google Scholar profile.

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Ziqi Huang*, Fan Zhang*, Xiaojie Xu, Yinan He, Jiashuo Yu, Ziyue Dong, Qianli Ma, Nattapol Chanpaisit, Chenyang Si, Yuming Jiang, Yaohui Wang, Xinyuan Chen, Ying-Cong Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu
Submitted to some journal, Github stars > 1k

PreGenie: An Agentic Framework for High-quality Visual Presentation Generation
Xiaojie Xu, Xinli Xu, Sirui Chen, Haoyu Chen, Fan Zhang, Ying-Cong Chen
Conference on Empirical Methods in Natural Language Processing(EMNLP), Findings

Long-Video Audio Synthesis with Multi-Agent Collaboration
Yehang Zhang*, Xinli Xu*, Xiaojie Xu*, Doudou Zhang, Li Liu, Ying-Cong Chen
Conference on Empirical Methods in Natural Language Processing(EMNLP), Main

POSTA: A Go-to Framework for Customized Artistic Poster Generation
Haoyu Chen*, Xiaojie Xu*, Wenbo Li, Jingjing Ren, Tian Ye, Songhua Liu, Ying-Cong Chen, Lei Zhu, Xinchao Wang
Conference on Computer Vision and Pattern Recognition(CVPR)

Momentum Auxiliary Network for Supervised Local Learning
Junhao Su, Changpeng Cai, Feiyu Zhu, Chenghao He, Xiaojie Xu, Dongzhi Guan, Chenyang Si
European Conference on Computer Vision(ECCV), Oral Presentation, Top 2.3%

Xiaojie Xu, Tianshuo Xu, Fulong Ma and Ying-Cong Chen
International Conference on Robotics and Automation(ICRA)

3DCaricShop: A Dataset and A Baseline Method for Single-view 3D Caricature Face Reconstruction
Yuda Qiu, Xiaojie Xu, Lingteng Qiu, Yan Pan, Yushuang Wu, Weikai Chen, and Xiaoguang Han
Conference on Computer Vision and Pattern Recognition(CVPR)
📖 Education
- Bachelor of Engineering(B.Eng.) in Automation, University of Science and Technology of China
- Master of Philosophy(M.Phil.) in Artificial Intelligence, The Hong Kong University of Science and Technology, Guangzhou
💻 Research Experiences
- 2023.10 – 2024.05, Generative Models, Nanyang Technological University, with Prof. Chenyang Si and Prof. Ziwei Liu
- 2022.06 – 2023.03, Digital Human, Tencent AI Lab, with Dr. Di Kang and Prof. Linchao Bao
- 2021.06 – 2021.12, Autonomous Driving, Tsinghua University, with Prof. Hang Zhao
🎖 Honors and Awards
- Postgraduate Scholarship Award at HKUST
- Outstanding Undergraduate Scholarship Award at USTC
- Chinese Physics Olympiad(CPhO). First prize in Jiangxi Province