Picture for Gang Yu

Gang Yu

Department of Biomedical Engineering, School of Basic Medical Sciences, Central South University, Changsha, China

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts

Add code
Dec 17, 2023
Figure 1 for M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Figure 2 for M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Figure 3 for M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Figure 4 for M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Viaarxiv icon

FaceStudio: Put Your Face Everywhere in Seconds

Add code
Dec 06, 2023
Viaarxiv icon

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model

Add code
Dec 01, 2023
Figure 1 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 2 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 3 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 4 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Viaarxiv icon

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

Add code
Nov 30, 2023
Figure 1 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 2 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 3 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 4 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Viaarxiv icon

ChartLlama: A Multimodal LLM for Chart Understanding and Generation

Add code
Nov 27, 2023
Figure 1 for ChartLlama: A Multimodal LLM for Chart Understanding and Generation
Figure 2 for ChartLlama: A Multimodal LLM for Chart Understanding and Generation
Figure 3 for ChartLlama: A Multimodal LLM for Chart Understanding and Generation
Figure 4 for ChartLlama: A Multimodal LLM for Chart Understanding and Generation
Viaarxiv icon

VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations

Add code
Oct 23, 2023
Figure 1 for VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations
Figure 2 for VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations
Figure 3 for VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations
Figure 4 for VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations
Viaarxiv icon

TapMo: Shape-aware Motion Generation of Skeleton-free Characters

Add code
Oct 19, 2023
Viaarxiv icon

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering

Add code
Sep 18, 2023
Viaarxiv icon

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

Add code
Sep 06, 2023
Figure 1 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 2 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 3 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 4 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Viaarxiv icon

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis

Add code
Aug 22, 2023
Figure 1 for IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Figure 2 for IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Figure 3 for IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Figure 4 for IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Viaarxiv icon