Picture for Xiaopeng Zhang

Xiaopeng Zhang

CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing

Add code
Dec 15, 2025
Viaarxiv icon

Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction

Add code
Nov 13, 2025
Viaarxiv icon

Temporal Action Selection for Action Chunking

Add code
Nov 06, 2025
Viaarxiv icon

MASH: Cooperative-Heterogeneous Multi-Agent Reinforcement Learning for Single Humanoid Robot Locomotion

Add code
Aug 14, 2025
Viaarxiv icon

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Add code
Jul 28, 2025
Viaarxiv icon

Edge-ASR: Towards Low-Bit Quantization of Automatic Speech Recognition Models

Add code
Jul 10, 2025
Viaarxiv icon

OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding

Add code
Jul 03, 2025
Viaarxiv icon

Tackling View-Dependent Semantics in 3D Language Gaussian Splatting

Add code
May 30, 2025
Viaarxiv icon

Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision

Add code
Apr 03, 2025
Viaarxiv icon

RASA: Replace Anyone, Say Anything -- A Training-Free Framework for Audio-Driven and Universal Portrait Video Editing

Add code
Mar 14, 2025
Viaarxiv icon