π Welcome!
I am currently a Ph.D. student (2023 Fall) at the Hong Kong University of Science and Technology, Guangzhou, under the supervision of Prof. Ying-Cong Chen. Prior to this, I began my research journey as a Masterβs student in 2020 at the MAC Lab of Xiamen University and received my Masterβs degree in 2023, advised by the Prof. Rongrong Ji.
My research interests focus on visual generative models, particularly in exploring their versatility (Lotus, Lotus-2), controllability (DisEnvisioner), and efficiency (PixelFolder). I am currently working on video generation and world models, focusing on both realism and efficiency.
Feel free to contact me via e-mail if you are interested in discussing or collaborating with me. π
π» Internships
- 2026.01 - Present, Kuaishou, Kling Team, Shenzhen, China.
- 2021.08 - 2022.01, Tencent Youtu Lab, Shanghai, China.
π Publications
Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model
Jing He, Haodong Li, Mingzhi Sheng, Ying-Cong Chenβ
arXiv 2025
Paper / Project Page / Github / Demo (D) / Demo (N)
LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He*β, Haodong Li*, Wei Yin, Yixun Liang, Kaiqiang Zhou, Hongbo Zhang, Bingbing Liu, Ying-Cong Chenβ
ICLR 2025
Paper / Project Page / Github / Demo (Depth) / Demo (Normal) / ComfyUI
DisEnvisioner: Disentangled and Enriched Visual Prompt for Image Customization
Jing He*β, Haodong Li*,Yongzhe Hu, Guibao Shen, Yingjie Cai, Weichao Qiu, Ying-Cong Chenβ
ICLR 2025
Paper / Project Page / Github / Demo (Soon)
PixelFolder: An efficient progressive pixel synthesis network for image generation
Jing He, Yiyi Zhouβ , Qi Zhang, Jun Peng, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji
ECCV 2022
Paper / Github
DA$^2$: Depth Anything in Any Direction
Haodong Li, Wangguangdong Zheng, Jing He, Yuhao Liu, Xin Lin, Xin Yang, Ying-Cong Chen, Chunchao Guo
ICLR 2026
Paper / Project Page / GitHub / Demo / Data
StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors
Guibao Shen, Yihua Du, Wenhang Ge, Jing He, Chirui Chang, Donghao Zhou, Zhen Yang, Luozhou Wang, Xin Tao, Ying-Cong Chenβ
arXiv 2025
Paper / Project Page / Github / Model
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction
Leheng Li, Weichao Qiu, Xu Yan, Jing He, Kaiqiang Zhou, Yingjie Cai, Qing Lian, Bingbing Liu, Ying-Cong Chenβ
arXiv 2024
arXiv / Project Page / Github / Model
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Siyu Zou, Jiji Tang, Yiyi Zhou, Jing He, Chaoyi Zhao, Rongsheng Zhang, Zhipeng Hu, Xiaoshuai Sunβ
AAAI 2024
Paper / Github
Learning dynamic prior knowledge for text-to-face pixel synthesis
Jun Peng, Xiaoxiong Du, Yiyi Zhou, Jing He, Yunhang Shen, Xiaoshuai Sun, Rongrong Ji β
ACM MM 2022
Paper / Github
Towards open-ended text-to-face generation, combination and manipulation
Jun Peng, Han Pan, Yiyi Zhou, Jing He, Xiaoshuai Sun, Yan Wang, Yongjian Wu, Rongrong Ji β
ACM MM 2022
Paper / Github
π Services
Reviewer:
- ICML: 2026
- ICCV/ECCV: 2025, 2026
- NeurIPS: 2025, 2026
- AAAI: 2024
- ICLR: 2025, 2026
- CVPR: 2026
π Educations
- 2023.09 - present, Ph.D. Student, The Hong Kong University of Science and Technology, GuangZhou. Advisor: Ying-Cong Chen.
- 2020.09 - 2023.06, Master of Engineering, Xiamen University. Advisor: Rongrong Ji.
- 2016.09 - 2020.06, Bachelor of Engineering, Wuhan Institute of Technology.
π Honors and Awards
- 2023-2025 Postgraduate Scholarship, HKUST-GZ.
- 2020 Outstanding graduate student by WIT.
- 2016-2020 First-Class Scholarship by WIT.