Picture for Xiaowei Chi

Xiaowei Chi

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon

BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection

Add code
Sep 17, 2025
Viaarxiv icon

MinD: Unified Visual Imagination and Control via Hierarchical World Models

Add code
Jun 23, 2025
Viaarxiv icon

ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance

Add code
Apr 23, 2025
Viaarxiv icon

MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation

Add code
Mar 26, 2025
Viaarxiv icon

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Add code
Mar 11, 2025
Viaarxiv icon

RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency

Add code
Jan 15, 2025
Viaarxiv icon

Large Motion Video Autoencoding with Cross-modal Video VAE

Add code
Dec 23, 2024
Figure 1 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 2 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 3 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 4 for Large Motion Video Autoencoding with Cross-modal Video VAE
Viaarxiv icon

EVA: An Embodied World Model for Future Video Anticipation

Add code
Oct 20, 2024
Figure 1 for EVA: An Embodied World Model for Future Video Anticipation
Figure 2 for EVA: An Embodied World Model for Future Video Anticipation
Figure 3 for EVA: An Embodied World Model for Future Video Anticipation
Figure 4 for EVA: An Embodied World Model for Future Video Anticipation
Viaarxiv icon

PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Add code
Sep 16, 2024
Figure 1 for PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
Figure 2 for PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
Figure 3 for PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
Figure 4 for PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
Viaarxiv icon