Picture for Xiaodong Yu

Xiaodong Yu

Residual Skill Optimization for Text-to-SQL Ensembles

Add code
May 20, 2026
Viaarxiv icon

KV-RM: Regularizing KV-Cache Movement for Static-Graph LLM Serving

Add code
May 10, 2026
Viaarxiv icon

Stabilizing Efficient Reasoning with Step-Level Advantage Selection

Add code
Apr 27, 2026
Viaarxiv icon

VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking

Add code
Mar 20, 2026
Viaarxiv icon

Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density

Add code
Feb 11, 2026
Viaarxiv icon

Reliable Use of Lemmas via Eligibility Reasoning and Section$-$Aware Reinforcement Learning

Add code
Feb 01, 2026
Viaarxiv icon

CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models

Add code
Jan 05, 2026
Viaarxiv icon

Instella: Fully Open Language Models with Stellar Performance

Add code
Nov 14, 2025
Viaarxiv icon

An Efficient Gradient-Aware Error-Bounded Lossy Compressor for Federated Learning

Add code
Nov 07, 2025
Viaarxiv icon

Learning from Online Videos at Inference Time for Computer-Use Agents

Add code
Nov 06, 2025
Figure 1 for Learning from Online Videos at Inference Time for Computer-Use Agents
Figure 2 for Learning from Online Videos at Inference Time for Computer-Use Agents
Figure 3 for Learning from Online Videos at Inference Time for Computer-Use Agents
Figure 4 for Learning from Online Videos at Inference Time for Computer-Use Agents
Viaarxiv icon