Picture for Xiaokang Yang

Xiaokang Yang

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Add code
Aug 27, 2025
Viaarxiv icon

Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions

Add code
Aug 06, 2025
Viaarxiv icon

NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding

Add code
Aug 06, 2025
Viaarxiv icon

MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding

Add code
Jul 08, 2025
Viaarxiv icon

MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration

Add code
May 30, 2025
Viaarxiv icon

Weight Spectra Induced Efficient Model Adaptation

Add code
May 29, 2025
Viaarxiv icon

MAP: Revisiting Weight Decomposition for Low-Rank Adaptation

Add code
May 29, 2025
Viaarxiv icon

Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning

Add code
May 27, 2025
Viaarxiv icon

HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance

Add code
May 26, 2025
Viaarxiv icon