Picture for Kun Yuan

Kun Yuan

Where It Moves, It Matters: Referring Surgical Instrument Segmentation via Motion

Add code
Jan 18, 2026
Viaarxiv icon

SurgGoal: Rethinking Surgical Planning Evaluation via Goal-Satisfiability

Add code
Jan 15, 2026
Viaarxiv icon

Mixture of Distributions Matters: Dynamic Sparse Attention for Efficient Video Diffusion Transformers

Add code
Jan 14, 2026
Viaarxiv icon

LapFM: A Laparoscopic Segmentation Foundation Model via Hierarchical Concept Evolving Pre-training

Add code
Dec 09, 2025
Viaarxiv icon

Mixture-of-Channels: Exploiting Sparse FFNs for Efficient LLMs Pre-Training and Inference

Add code
Nov 12, 2025
Viaarxiv icon

An All-Reduce Compatible Top-K Compressor for Communication-Efficient Distributed Learning

Add code
Oct 30, 2025
Viaarxiv icon

An Efficient Subspace Algorithm for Federated Learning on Heterogeneous Data

Add code
Sep 05, 2025
Viaarxiv icon

Bridging Video Quality Scoring and Justification via Large Multimodal Models

Add code
Jun 26, 2025
Viaarxiv icon

Efficient Long-Context LLM Inference via KV Cache Clustering

Add code
Jun 13, 2025
Figure 1 for Efficient Long-Context LLM Inference via KV Cache Clustering
Figure 2 for Efficient Long-Context LLM Inference via KV Cache Clustering
Figure 3 for Efficient Long-Context LLM Inference via KV Cache Clustering
Figure 4 for Efficient Long-Context LLM Inference via KV Cache Clustering
Viaarxiv icon

EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy

Add code
May 21, 2025
Viaarxiv icon