Picture for Chenyang Gu

Chenyang Gu

Causal Inspired Multi Modal Recommendation

Add code
Oct 14, 2025
Viaarxiv icon

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation

Add code
Sep 30, 2025
Figure 1 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 2 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 3 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 4 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Viaarxiv icon

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Add code
Aug 18, 2025
Viaarxiv icon

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation

Add code
Jul 02, 2025
Viaarxiv icon

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Add code
Mar 13, 2025
Viaarxiv icon

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation

Add code
Jan 28, 2025
Figure 1 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 2 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 3 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 4 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Viaarxiv icon

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation

Add code
Dec 18, 2024
Viaarxiv icon

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Add code
Nov 27, 2024
Figure 1 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 2 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 3 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 4 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Viaarxiv icon

CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation

Add code
Jul 08, 2024
Figure 1 for CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
Figure 2 for CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
Figure 3 for CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
Figure 4 for CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
Viaarxiv icon

Large Motion Model for Unified Multi-Modal Motion Generation

Add code
Apr 01, 2024
Figure 1 for Large Motion Model for Unified Multi-Modal Motion Generation
Figure 2 for Large Motion Model for Unified Multi-Modal Motion Generation
Figure 3 for Large Motion Model for Unified Multi-Modal Motion Generation
Figure 4 for Large Motion Model for Unified Multi-Modal Motion Generation
Viaarxiv icon