Picture for Qingyi Si

Qingyi Si

Near-Future Policy Optimization

Add code
Apr 22, 2026
Viaarxiv icon

Self-Distilled RLVR

Add code
Apr 03, 2026
Viaarxiv icon

Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection

Add code
Mar 28, 2026
Viaarxiv icon

A Closer Look into LLMs for Table Understanding

Add code
Mar 16, 2026
Viaarxiv icon

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

Add code
Mar 16, 2026
Viaarxiv icon

HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System

Add code
Mar 16, 2026
Viaarxiv icon

System 1&2 Synergy via Dynamic Model Interpolation

Add code
Jan 29, 2026
Viaarxiv icon

Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning

Add code
Jan 12, 2026
Viaarxiv icon

Breaking the Trade-Off Between Faithfulness and Expressiveness for Large Language Models

Add code
Aug 26, 2025
Viaarxiv icon

Stable Reinforcement Learning for Efficient Reasoning

Add code
May 23, 2025
Figure 1 for Stable Reinforcement Learning for Efficient Reasoning
Figure 2 for Stable Reinforcement Learning for Efficient Reasoning
Figure 3 for Stable Reinforcement Learning for Efficient Reasoning
Figure 4 for Stable Reinforcement Learning for Efficient Reasoning
Viaarxiv icon