Picture for Ruobing Xie

Ruobing Xie

MHSA: A Lightweight Framework for Mitigating Hallucinations via Steered Attention in LVLMs

Add code
May 14, 2026
Viaarxiv icon

Hybrid Policy Distillation for LLMs

Add code
Apr 22, 2026
Viaarxiv icon

Negative Advantage Is a Double-Edged Sword: Calibrating Advantage in GRPO for Deep Search

Add code
Apr 20, 2026
Viaarxiv icon

Self-Distillation for Multi-Token Prediction

Add code
Mar 25, 2026
Viaarxiv icon

Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity

Add code
Mar 18, 2026
Viaarxiv icon

OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning

Add code
Mar 16, 2026
Viaarxiv icon

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Add code
Feb 12, 2026
Viaarxiv icon

D$^2$Quant: Accurate Low-bit Post-Training Weight Quantization for LLMs

Add code
Jan 30, 2026
Viaarxiv icon

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Add code
Jan 20, 2026
Viaarxiv icon

Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval

Add code
Sep 05, 2025
Viaarxiv icon