Picture for Xiaopeng Zhang

Xiaopeng Zhang

FineViT: Progressively Unlocking Fine-Grained Perception with Dense Recaptions

Add code
Mar 18, 2026
Viaarxiv icon

HMR-1: Hierarchical Massage Robot with Vision-Language-Model for Embodied Healthcare

Add code
Mar 09, 2026
Viaarxiv icon

AI Decodes Historical Chinese Archives to Reveal Lost Climate History

Add code
Jan 30, 2026
Viaarxiv icon

CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing

Add code
Dec 15, 2025
Viaarxiv icon

Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction

Add code
Nov 13, 2025
Viaarxiv icon

Temporal Action Selection for Action Chunking

Add code
Nov 06, 2025
Viaarxiv icon

MASH: Cooperative-Heterogeneous Multi-Agent Reinforcement Learning for Single Humanoid Robot Locomotion

Add code
Aug 14, 2025
Viaarxiv icon

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Add code
Jul 28, 2025
Viaarxiv icon

Edge-ASR: Towards Low-Bit Quantization of Automatic Speech Recognition Models

Add code
Jul 10, 2025
Viaarxiv icon

OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding

Add code
Jul 03, 2025
Viaarxiv icon