Picture for Xiaodan Liang

Xiaodan Liang

AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis

Add code
Feb 27, 2024
Figure 1 for AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis
Figure 2 for AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis
Figure 3 for AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis
Figure 4 for AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis
Viaarxiv icon

MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data

Add code
Feb 14, 2024
Viaarxiv icon

GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data

Add code
Feb 13, 2024
Figure 1 for GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data
Figure 2 for GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data
Figure 3 for GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data
Viaarxiv icon

MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation

Add code
Jan 14, 2024
Figure 1 for MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation
Figure 2 for MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation
Figure 3 for MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation
Figure 4 for MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation
Viaarxiv icon

Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models

Add code
Jan 02, 2024
Viaarxiv icon

3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands

Add code
Jan 02, 2024
Figure 1 for 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Figure 2 for 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Figure 3 for 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Figure 4 for 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Viaarxiv icon

Monocular 3D Hand Mesh Recovery via Dual Noise Estimation

Add code
Dec 26, 2023
Figure 1 for Monocular 3D Hand Mesh Recovery via Dual Noise Estimation
Figure 2 for Monocular 3D Hand Mesh Recovery via Dual Noise Estimation
Figure 3 for Monocular 3D Hand Mesh Recovery via Dual Noise Estimation
Figure 4 for Monocular 3D Hand Mesh Recovery via Dual Noise Estimation
Viaarxiv icon

Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model

Add code
Dec 18, 2023
Viaarxiv icon

WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on

Add code
Dec 06, 2023
Figure 1 for WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on
Figure 2 for WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on
Figure 3 for WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on
Figure 4 for WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on
Viaarxiv icon

DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance

Add code
Dec 05, 2023
Viaarxiv icon