Alert button
Picture for Di Huang

Di Huang

Alert button

DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling

Add code
Bookmark button
Alert button
Apr 14, 2024
Xuening Yuan, Hongyu Yang, Yueming Zhao, Di Huang

Viaarxiv icon

iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection

Add code
Bookmark button
Alert button
Apr 08, 2024
Nan Zhou, Jiaxin Chen, Di Huang

Viaarxiv icon

InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization

Add code
Bookmark button
Alert button
Apr 06, 2024
Xiefan Guo, Jinlin Liu, Miaomiao Cui, Jiankai Li, Hongyu Yang, Di Huang

Viaarxiv icon

Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge

Add code
Bookmark button
Alert button
Apr 02, 2024
Haoxiang Ma, Modi Shi, Boyang Gao, Di Huang

Viaarxiv icon

A Survey on Long Video Generation: Challenges, Methods, and Prospects

Add code
Bookmark button
Alert button
Mar 25, 2024
Chengxuan Li, Di Huang, Zeyu Lu, Yang Xiao, Qingqi Pei, Lei Bai

Viaarxiv icon

GVGEN: Text-to-3D Generation with Volumetric Representation

Add code
Bookmark button
Alert button
Mar 19, 2024
Xianglong He, Junyi Chen, Sida Peng, Di Huang, Yangguang Li, Xiaoshui Huang, Chun Yuan, Wanli Ouyang, Tong He

Figure 1 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 2 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 3 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 4 for GVGEN: Text-to-3D Generation with Volumetric Representation
Viaarxiv icon

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Add code
Bookmark button
Alert button
Mar 18, 2024
Sha Zhang, Di Huang, Jiajun Deng, Shixiang Tang, Wanli Ouyang, Tong He, Yanyong Zhang

Figure 1 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 2 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 3 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 4 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Viaarxiv icon

Sim-to-Real Grasp Detection with Global-to-Local RGB-D Adaptation

Add code
Bookmark button
Alert button
Mar 18, 2024
Haoxiang Ma, Ran Qin, Modi shi, Boyang Gao, Di Huang

Figure 1 for Sim-to-Real Grasp Detection with Global-to-Local RGB-D Adaptation
Figure 2 for Sim-to-Real Grasp Detection with Global-to-Local RGB-D Adaptation
Figure 3 for Sim-to-Real Grasp Detection with Global-to-Local RGB-D Adaptation
Figure 4 for Sim-to-Real Grasp Detection with Global-to-Local RGB-D Adaptation
Viaarxiv icon

DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition

Add code
Bookmark button
Alert button
Mar 13, 2024
Hebeizi Li, Hongyu Yang, Di Huang

Figure 1 for DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition
Figure 2 for DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition
Figure 3 for DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition
Figure 4 for DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition
Viaarxiv icon

Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision

Add code
Bookmark button
Alert button
Mar 06, 2024
Yajie Liu, Pu Ge, Qingjie Liu, Di Huang

Figure 1 for Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Figure 2 for Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Figure 3 for Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Figure 4 for Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Viaarxiv icon