I am now a visiting student at the NLPR, Institute of Automation, Chinese Academy of Sciences (CASIA), supervised by Prof Zhaoxiang Zhang. I am also a final-year MPhil. student at BUPT in
China, supervised by Prof. Yonggang Qi. Prior to that, I
obtained my Bachelor's degree in Electronics Information Science and Technology at BUPT in 2022.
My research interests revolve around computer visionn and generative
models , with a particular focus on image generation , video
generation , and 3D content creation .
NOVA is a non-quantized autoregressive model that enables efficient video generation
by reformulating the video creation as frame-by-frame and set-by-set predictions.
See3D is a scalable visual-conditional MVD model for open-world 3D creation, which can be trained on
web-scale video collections without camera pose annotations.
GeoDream is a 3D generation method that integrates explicit generalized 3D priors with 2D
diffusion priors to enhance the capability of obtaining unambiguous 3D consistent geometric
structures without sacrificing diversity or fidelity.
SketchKnitter is a method that achieves vectorized sketch generation by reversing the stroke
deformation process using a diffusion model learned from real sketches, enabling the creation of
higher quality, visually appealing sketches with fewer sampling steps.