The CVNext lab focuses on advancing general-purpose embodied intelligence, building upon foundations in long video
understanding and reasoning in dynamic, complex scenes. The core objective is to develop open, adaptive embodied agents
that tightly integrate environment perception, interactive reasoning, and personalized adaptation and decision-making.
Ultimately, the research aims to establish both theoretical frameworks and practical systems for general and
domain-specific embodied agents, contributing to scalable, transferable, and real-world embodied AI. Our main research
directions include:
- Interactive 3D Scene Reconstruction and Generation
- Unified World-Reasoning-Action Modeling for Embodied Agents
- Personalized Adaptation with Active Perception
Professor
Gaoang Wang [Web]
Assistant Professor
Office: C417, ZJUI Building
Email: gaoangwang@intl.zju.edu.cn
Research Interests:
- Visual Perception
- Transfer Learning
- Spatial Intelligence
- Embodied Intelligence
News:
[Apr. 2026] One paper was accepted by Findings of ACL 2026. Congratulations to Xuexiang!
[Mar. 2026] One paper was accepted by TMM, 2026. Congratulations to Guanhong!
[Mar. 2026] One paper was accepted by IJCV, 2026. Congratulations to Wenhao!
[Mar. 2026] Two papers were accepted by ICME 2026. Congratulations to Wendi and Xiaohan!
[Feb. 2026] One paper was accepted by TVCG, 2026. Congratulations to Chenlu!
[Feb. 2026] Two papers were accepted by CVPR 2026. Congratulations to Zhonghan and Jingyu!
[Jan. 2026] One paper was accepted by ICLR 2026. Congratulations to Zhonghan!
[Jan. 2026] One paper was accepted by ICASSP 2026.
[Nov. 2025] One paper was accepted by IJCV, 2026. Congratulations to Chenlu!
[Nov. 2025] Three papers were accepted by AAAI 2026, including one oral paper. Congratulations to Wenhao, Xuan, and Junsheng!
[Oct. 2025] We got an outstanding paper award in ICCV KnowledgeMR Workshop. Congratulations to Enxin!
[Aug. 2025] One paper was accepted by TPAMI, 2025. Congratulations to Enxin!
[Jul. 2025] One paper was accepted by ECAI 2025. Congratulations to Boyi and Zhonghan!
[Jul. 2025] One paper was accepted by ICCV Findings Workshop, 2025. Congratulations to Enxin!
[Jun. 2025] One paper was accepted by TIP, 2025.
[Jun. 2025] One paper was accepted by ICCV 2025. Congratulations to Weili and Enxin!
[May 2025] One paper was accepted by Information Fusion, 2025. Congratulations to Xiaoyue!
[May 2025] One paper was accepted by ICML 2025.
[Apr. 2025] One paper was accepted by CVPR Workshop on Urban Scene Modeling, 2025. Congratulations to Jie!
[Mar. 2025] One paper was accepted by TVCG, 2025. Congratulations to Zhonghan!
[Feb. 2025] One paper was accepted by TCSVT, 2025.
[Feb. 2025] One paper was accepted by CVPR 2025. Congratulations to Xuan!
[Jan. 2025] One paper was accepted by MIA, 2025. Congratulations to Chenlu!
[Jan. 2025] One paper was accepted by TMM, 2025. Congratulations to Shidong!
[Dec. 2024] Two papers were accepted by ICASSP 2025. Congratulations to Xiaoyue and Jingyu!
[Dec. 2024] One paper was accepted by AAAI 2025.
[Sep. 2024] One paper was accepted by NeurIPS 2024.
[Jul. 2024] One paper was accepted by MICCAI Workshop on Deep Generative Models, 2024. Congratulations to Xiaoyue!
[Jun. 2024] Two papers were accepted by ACM MM 2024. Congratulations to Shengyu, Xuechen, and Wenhao!
[Jun. 2024] One paper was accepted by ECCV 2024. Congratulations to Zhonghan and Wenhao!
[Jun. 2024] One paper was accepted by PRCV 2024. Congratulations to Zhenyu and Wenhao!
[Apr. 2024] One paper was accepted by TMM, 2024. Congratulations to Chenlu!
[Mar. 2024] "Long-term Video Question Answering Competition (LOVEU@CVPR'24 Track 1)" was released. More details can
be found here.
[Mar. 2024] One paper was accepted by ICLR Workshop on LLM Agents, 2024. Congratulations to Zhonghan!
[Feb. 2024] Three papers were accepted by CVPR 2024. Congratulations to Enxin, Wenhao, and Chenlu!
[Dec. 2023] Two papers were accepted by ICASSP 2024. Congratulations to Xuechen and Zhenyu!
[Dec. 2023] Two papers were accepted by AAAI 2024. Congratulations to Meiqi!
[Dec. 2023] One paper was accepted by Neurocomputing, 2023. Congratulations to Guanhong!
[Sep. 2023] One paper was accepted by IJCV, 2023. Congratulations to Shengyu!
[Sep. 2023] One paper was accepted by TMM, 2023. Congratulations to Shidong!
[Aug. 2023] One paper was accepted by PRCV 2023. Congratulations to Xuan!
[Jul. 2023] Three papers were accepted by ICCV 2023. Congratulations to Wenhao!
[Jun. 2023] One paper was accepted by MICCAI 2023. Congratulations to Chenlu!
[May 2023] One paper was accepted by Findings of ACL 2023. Congratulations to Qi!
[Apr. 2023] Two papers were accepted by IJCAI 2023.
[Apr. 2023] One paper was accepted by CVPR workshop, Computer Vision for Fashion, Art, and Design, 2023. Congratulations to Shidong!
[Mar. 2023] One paper was accepted by ICME 2023. Congratulations to Wenhao!
[Mar. 2023] One paper was accepted by ICASSP 2023.
[Feb. 2023] One paper was accepted by CVPR 2023.
[Feb. 2023] One paper was accepted by TAI, 2023. Congratulations to Wenhao!
[Nov. 2022] One paper was accepted by TMI, 2022.
[Jul. 2022] One paper was accepted by ECCV 2022.
[Apr. 2022] One paper was accepted by CVPR workshop, the 2nd Workshop on Sketch-Oriented Deep Learning, 2022. Congratulations to Kairong!
[Mar. 2022] One paper was accepted by ICME 2022. Congratulations to Guanhong!
[Jan. 2022] One paper was accepted by TMM, 2022.
[Aug. 2021] One paper was accepted by CVIU, 2021. Congratulations to Shengyu!
[Jul. 2021] One paper was accepted by ICCV 2021.
[Apr. 2021] One paper was accepted by CVPR workshop, the Workshop on Autonomous Driving, 2021.
[Jan. 2021] ROD2021 Challenge @ICMR 2021 was released.
Ph.D. Students
guanhongwang@zju.edu.cn
Multi-modality Learning
Video Understanding
Vision and Language
whu@zju.edu.cn
3D Vision
Generative Models
Anomaly Detection
Zhonghan Zhao
zhaozhonghan@zju.edu.cn
Embodied AI
Reinforcement Learning
Incontext Learning
Chenlu Zhan
(Main Advisor: Hongwei Wang)
chenlu.22@intl.zju.edu.cn
Medical Vision Language
Medical Multimodality
Visual-Language Pretraining
Wendi Hu
3200105651@zju.edu.cn
Multi-object Tracking
Kewei Wei
3200104125@zju.edu.cn
Multimodality Learning
Tielong Cai
tielong.22@intl.zju.edu.cn
Generative model
Embodied AI
Master Students
Enxin Song [Web]
enxin.23@intl.zju.edu.cn
Video Understanding
Image Generation
Xuan Wang
xuanw@zju.edu.cn
Multi-modality Learning
Embodied AI
Fang Liang
3D Vision
Image Reconstruction
Dongping Li
dongping.23@intl.zju.edu.cn
Multi-modality Learning
Active Perception
Unified Model
Junsheng Huang
junsheng.24@intl.zju.edu.cn
3D Vision
Multi-modality Learning
Tianci Tang
tianci_tang@tiu.edu.cn
Embodied AI
Diffusion Model
Yizhi Li
yizhi.20@intl.zju.edu.cn
Multi-modality Learning
Computer Vision
Xuexiang Wen
xuexiang.24@intl.zju.edu.cn
Multi-modality Learning
Jiawu Zhang
2540614031@qq.com
Multi-modal logistics large models
Bocheng Hu
bocheng.25@intl.zju.edu.cn
Motion Generation
Vision–Language Models (VLMs)
Vision–Language–Action Models (VLAs)
Jie Cao
jie.25@intl.zju.edu.cn
Multi-modality Learning
Haonan Zhou
haonan1.25@intl.zju.edu.cn
3D Scene Generation
Xiaohan Chen
xiaohan.25@intl.zju.edu.cn
Multi-modality Learning
Large Language Models (LLMs)
Alumni
Shengyu Hao
shengyuhao@zju.edu.cn
Multi-object Tracking
Representation Learning
Domain Adaptation
Xiaoyue Li
(Main Advisor: Mark Butala)
xiaoyue98@zju.edu.cn
Image Generation
Image Reconstruction
Medical Image Inverse Problems
Shidong Cao
shidong.22@intl.zju.edu.cn
Generative Models
Multi-modality Learning
Graph Machine Learning
Yichen Ouyang [Web]
22271110@zju.edu.cn
Generative Models
3D Vision
Multi-modality Learning
Meiqi Sun
meiqi.22@intl.zju.edu.cn
Animal Action Recognition
Animal Pose Estimation
Xuechen Guo
xuechen.22@intl.zju.edu.cn
Computer Vision
Multi-modality Learning
Jianshu Guo
jianshu.22@intl.zju.edu.cn
Diffusion Model
Vision Language
Chang Su
changs.19@intl.zju.edu.cn
Smart City
Yichen Xu
Wenhao Chai (Alumni)[Web]
wchai@uw.edu
Multi-modality Representation
Unified Perception Model
Embodied Intelligence
Jie Deng
dengj325@gmail.com
3D Scene Generation