Picture for Yikang Ding

Yikang Ding

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Add code
Mar 26, 2026
Viaarxiv icon

Kling-MotionControl Technical Report

Add code
Mar 03, 2026
Viaarxiv icon

CLAIM: Camera-LiDAR Alignment with Intensity and Monodepth

Add code
Dec 16, 2025
Viaarxiv icon

KlingAvatar 2.0 Technical Report

Add code
Dec 15, 2025
Viaarxiv icon

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Add code
Sep 11, 2025
Figure 1 for Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis
Figure 2 for Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis
Figure 3 for Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis
Figure 4 for Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis
Viaarxiv icon

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching

Add code
Jul 03, 2025
Viaarxiv icon

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation

Add code
Mar 19, 2025
Figure 1 for DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Figure 2 for DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Figure 3 for DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Figure 4 for DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Viaarxiv icon

MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction

Add code
Mar 13, 2025
Figure 1 for MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction
Figure 2 for MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction
Figure 3 for MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction
Figure 4 for MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction
Viaarxiv icon

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Add code
Jan 24, 2025
Figure 1 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 2 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 3 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 4 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Viaarxiv icon

UniScene: Unified Occupancy-centric Driving Scene Generation

Add code
Dec 06, 2024
Viaarxiv icon