Picture for Xiaokang Yang

Xiaokang Yang

Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions

Add code
Aug 06, 2025
Figure 1 for Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
Figure 2 for Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
Figure 3 for Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
Figure 4 for Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
Viaarxiv icon

NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding

Add code
Aug 06, 2025
Figure 1 for NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
Figure 2 for NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
Figure 3 for NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
Figure 4 for NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
Viaarxiv icon

MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding

Add code
Jul 08, 2025
Figure 1 for MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding
Figure 2 for MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding
Figure 3 for MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding
Figure 4 for MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding
Viaarxiv icon

MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models

Add code
Jun 12, 2025
Figure 1 for MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models
Figure 2 for MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models
Figure 3 for MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models
Figure 4 for MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models
Viaarxiv icon

ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration

Add code
May 30, 2025
Figure 1 for ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration
Figure 2 for ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration
Figure 3 for ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration
Figure 4 for ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration
Viaarxiv icon

MAP: Revisiting Weight Decomposition for Low-Rank Adaptation

Add code
May 29, 2025
Figure 1 for MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Figure 2 for MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Figure 3 for MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Figure 4 for MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Viaarxiv icon

Weight Spectra Induced Efficient Model Adaptation

Add code
May 29, 2025
Viaarxiv icon

Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning

Add code
May 27, 2025
Figure 1 for Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning
Figure 2 for Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning
Figure 3 for Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning
Figure 4 for Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning
Viaarxiv icon

HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance

Add code
May 26, 2025
Viaarxiv icon

Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition

Add code
May 25, 2025
Figure 1 for Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition
Figure 2 for Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition
Figure 3 for Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition
Figure 4 for Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition
Viaarxiv icon