Picture for Xiaowei Chi

Xiaowei Chi

M-LRM: Multi-view Large Reconstruction Model

Add code
Jun 11, 2024
Viaarxiv icon

LLMs Meet Multimodal Generation and Editing: A Survey

Add code
May 29, 2024
Viaarxiv icon

CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild

Add code
May 27, 2024
Viaarxiv icon

DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments

Add code
Feb 29, 2024
Viaarxiv icon

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Add code
Feb 25, 2024
Figure 1 for ChatMusician: Understanding and Generating Music Intrinsically with LLM
Figure 2 for ChatMusician: Understanding and Generating Music Intrinsically with LLM
Figure 3 for ChatMusician: Understanding and Generating Music Intrinsically with LLM
Figure 4 for ChatMusician: Understanding and Generating Music Intrinsically with LLM
Viaarxiv icon

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation

Add code
Nov 29, 2023
Viaarxiv icon

ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model

Add code
Nov 29, 2023
Figure 1 for ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model
Figure 2 for ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model
Figure 3 for ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model
Figure 4 for ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model
Viaarxiv icon

Unimodal Training-Multimodal Prediction: Cross-modal Federated Learning with Hierarchical Aggregation

Add code
Mar 27, 2023
Figure 1 for Unimodal Training-Multimodal Prediction: Cross-modal Federated Learning with Hierarchical Aggregation
Figure 2 for Unimodal Training-Multimodal Prediction: Cross-modal Federated Learning with Hierarchical Aggregation
Figure 3 for Unimodal Training-Multimodal Prediction: Cross-modal Federated Learning with Hierarchical Aggregation
Figure 4 for Unimodal Training-Multimodal Prediction: Cross-modal Federated Learning with Hierarchical Aggregation
Viaarxiv icon

BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks

Add code
Dec 02, 2022
Figure 1 for BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Figure 2 for BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Figure 3 for BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Figure 4 for BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Viaarxiv icon

Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection

Add code
Nov 30, 2022
Figure 1 for Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection
Figure 2 for Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection
Figure 3 for Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection
Figure 4 for Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection
Viaarxiv icon