Alert button
Picture for Zhihao Yuan

Zhihao Yuan

Alert button

GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance

Add code
Bookmark button
Alert button
Dec 12, 2023
Haiming Zhang, Zhihao Yuan, Chaoda Zheng, Xu Yan, Baoyuan Wang, Guanbin Li, Song Wu, Shuguang Cui, Zhen Li

Figure 1 for GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Figure 2 for GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Figure 3 for GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Figure 4 for GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
Viaarxiv icon

Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding

Add code
Bookmark button
Alert button
Nov 26, 2023
Zhihao Yuan, Jinke Ren, Chun-Mei Feng, Hengshuang Zhao, Shuguang Cui, Zhen Li

Viaarxiv icon

Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases

Add code
Bookmark button
Alert button
Jul 05, 2022
Zhihao Yuan, Xu Yan, Zhuo Li, Xuhao Li, Yao Guo, Shuguang Cui, Zhen Li

Figure 1 for Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
Figure 2 for Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
Figure 3 for Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
Figure 4 for Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
Viaarxiv icon

X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning

Add code
Bookmark button
Alert button
Apr 06, 2022
Zhihao Yuan, Xu Yan, Yinghong Liao, Yao Guo, Guanbin Li, Zhen Li, Shuguang Cui

Figure 1 for X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
Figure 2 for X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
Figure 3 for X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
Figure 4 for X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
Viaarxiv icon

CLEVR3D: Compositional Language and Elementary Visual Reasoning for Question Answering in 3D Real-World Scenes

Add code
Bookmark button
Alert button
Dec 31, 2021
Xu Yan, Zhihao Yuan, Yuhao Du, Yinghong Liao, Yao Guo, Zhen Li, Shuguang Cui

Figure 1 for CLEVR3D: Compositional Language and Elementary Visual Reasoning for Question Answering in 3D Real-World Scenes
Figure 2 for CLEVR3D: Compositional Language and Elementary Visual Reasoning for Question Answering in 3D Real-World Scenes
Figure 3 for CLEVR3D: Compositional Language and Elementary Visual Reasoning for Question Answering in 3D Real-World Scenes
Figure 4 for CLEVR3D: Compositional Language and Elementary Visual Reasoning for Question Answering in 3D Real-World Scenes
Viaarxiv icon

InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring

Add code
Bookmark button
Alert button
Mar 01, 2021
Zhihao Yuan, Xu Yan, Yinghong Liao, Ruimao Zhang, Zhen Li, Shuguang Cui

Figure 1 for InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Figure 2 for InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Figure 3 for InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Figure 4 for InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Viaarxiv icon