Picture for Ruobing Xie

Ruobing Xie

Self-Distillation for Multi-Token Prediction

Add code
Mar 25, 2026
Viaarxiv icon

Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity

Add code
Mar 18, 2026
Viaarxiv icon

OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning

Add code
Mar 16, 2026
Viaarxiv icon

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Add code
Feb 12, 2026
Viaarxiv icon

D$^2$Quant: Accurate Low-bit Post-Training Weight Quantization for LLMs

Add code
Jan 30, 2026
Viaarxiv icon

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Add code
Jan 20, 2026
Viaarxiv icon

Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval

Add code
Sep 05, 2025
Viaarxiv icon

Proximal Supervised Fine-Tuning

Add code
Aug 25, 2025
Figure 1 for Proximal Supervised Fine-Tuning
Figure 2 for Proximal Supervised Fine-Tuning
Figure 3 for Proximal Supervised Fine-Tuning
Figure 4 for Proximal Supervised Fine-Tuning
Viaarxiv icon

Flexible Realignment of Language Models

Add code
Jun 15, 2025
Figure 1 for Flexible Realignment of Language Models
Figure 2 for Flexible Realignment of Language Models
Figure 3 for Flexible Realignment of Language Models
Figure 4 for Flexible Realignment of Language Models
Viaarxiv icon

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Add code
May 28, 2025
Viaarxiv icon