Picture for Xiaopeng Zhang

Xiaopeng Zhang

MASH: Cooperative-Heterogeneous Multi-Agent Reinforcement Learning for Single Humanoid Robot Locomotion

Add code
Aug 14, 2025
Viaarxiv icon

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Add code
Jul 28, 2025
Viaarxiv icon

Edge-ASR: Towards Low-Bit Quantization of Automatic Speech Recognition Models

Add code
Jul 10, 2025
Viaarxiv icon

OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding

Add code
Jul 03, 2025
Viaarxiv icon

Tackling View-Dependent Semantics in 3D Language Gaussian Splatting

Add code
May 30, 2025
Viaarxiv icon

Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision

Add code
Apr 03, 2025
Viaarxiv icon

RASA: Replace Anyone, Say Anything -- A Training-Free Framework for Audio-Driven and Universal Portrait Video Editing

Add code
Mar 14, 2025
Viaarxiv icon

DehazeGS: Seeing Through Fog with 3D Gaussian Splatting

Add code
Jan 07, 2025
Viaarxiv icon

LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors

Add code
Dec 12, 2024
Viaarxiv icon

Stepping Forward on the Last Mile

Add code
Nov 06, 2024
Figure 1 for Stepping Forward on the Last Mile
Figure 2 for Stepping Forward on the Last Mile
Figure 3 for Stepping Forward on the Last Mile
Figure 4 for Stepping Forward on the Last Mile
Viaarxiv icon